The course is a gentle introduction to the field of Natural Language Processing for students with primarily humanities backgrounds. The course (FIMS 9732) is getting a user-friendly name, tentatively phrased as “How Computers Talk to Us: Conversational AI Design and Uses.” The course is open to all FIMS graduate students that work in information professions. No prior background in languages or programming expected.

Next time taught: Fall 2024 term. Open to all FIMS students.
Location: FIMS & Nursing Building at the University of Western Ontario, London, CANADA

Last update: as of 05/27/2024 (please expect more information to be available by late July/early August)

Objectives:

  1. To gain an awareness and appreciation of the complexity of natural language.
  2. To analyze the research literature on linguistic and computational aspects of natural language processing techniques.
  3. To critically evaluate a variety of applications that use natural language processing technologies.
  4. To connect NLP technologies and library applications in an innovative way.
  5. (Optional: To gain practical experience in basic text analysis with NLP techniques and/or in advanced NLP applications).

This course was previously offered as LIS 9732/9832 (Language & Computer Technologies for Libraries & Beyond.) The sample content below reflects its initial offering as a ‘soft’ introduction to Natural Language Processing (NLP) for MLIS students with non-technical backgrounds.

A new FIMS-wide course will now be more inclusive of wider FIMS audience’s interests. (See a minimal Western University Course catalogue description). The revamped and much updated version of the course will emphasize the grasp of technical details. We will not shy away from trying some practical data analysis skills (with Python in a series of optional tutorials). NLP technologies have made considerable advances in the recent years especially in the realm of Conversational AI. Just think about the popularity and controversies around ChatGPT! A few other types of NLP or AI-based language technologies are Alexa, Google-translate, predictive auto-correction in search, and recommender systems. There is increased demand for specialists that can understand the inner workings of such tools, apps, and their methods. This coming Fall 2024, our FIMS students will be positioned to connect the two perspectives: that of the users and developers. Students will be able to inform themselves and comment intelligibly on the pros and cons of Conversational AI, its design and adoption, potential harms, and other related issues such as misuse and abuse of privacy. (Last update: 02/28/2024)


Course Description:

Introduction to linguistic and computational aspects of natural language processing technologies. Familiarity with underlying principles and techniques required to perform all levels of language understanding and processing of naturally occurring text. Critical assessment of the use of language technologies in a variety of applications.


Course Content:
Linguistic and Computing Basics:

  • Computing with Words.
  • Phonetics: Speech. Sound Structure. Phoneme Classifications. Statistical vs. Symbolic NLP
  • Corpus Linguistics: Collocations. Concordances. Annotation.
  • Lexicology: Corpora. Lexicons. WordNet
  • Morphology: Components of Words. Informative Affixes. Stemming and Lemmatizing.
  • Part-of-Speech Tagging: Challenges. Approaches. Accuracy.
  • Parsing: Phrase Structures. Context Free Grammars. Methods.
  • Semantic Networks. Thematic Roles. Frames. Case Grammars. Conceptual Graphs.
  • Discourse: Cohesion. Anaphora. Co-reference resolution. Discourse Structure. Sublanguage
  • Pragmatics: Speech Act Theory. Gricean Maxims. Dialogues. Plan Recognition. Subjectivity.
  • Final Thoughts: Myths & Reality

NLP Applications:

  • The focus of Fall 2024 term is on Conversational AI.
    • Natural Language Interaction: Dialogue Systems. Chatbots. Speech Recognition.
  • But also, other NLP/AI applications include:
    • Machine Translation. Automated Summarization. Question and Answering.
    • NLP in Information Retrieval (IR). Cross-language IR. Multimedia. NLP in Image IR.
    • Mining Content of Social Software Sites. Analysis of Social Tags. Information Extraction. Text Mining.
    • Computer Assisted Language Learning. Language Identification. Terminology Alignment and Comprehension Aids.
    • Authoring Aids. Automatic Indexing.
    • Assistive Technologies (for disabled).

Assignments:

Details: To Be Announced

Students will experiment with Conversational AI/NLP applications and write papers and do presentations related to the chosen language technology application. Each paper will be presented to class prior to submission.

Students’ posters are to be presented in an open poster session in the last class. Expect to have a 5-10 minute interaction with the instructor and various length interactions with fellow students and interested faculty, if available.


Methods of Evaluation: To Be Announced by early August 2024


More Informally About the Course:

What’s it about, more broadly? In essence, this course is a gentle transition from a humanities background towards a more technologically oriented way of thinking. I invite you to think about the role of computers in acquiring, analyzing, organizing, providing access, and making sense of textual information. We will concentrate on understanding capabilities and limits of current natural language technologies. We will discuss ways that people have thought to utilize language analysis in order to organize textual information in a meaningful way.

You will be exploring and thinking about how “intelligent computers” can assist us in libraries. Are there text-intensive environments (directly or indirectly related to libraries) beyond library applications for language and information technologies?

Have you, as an individual or professional, ever come across the information overload program? Are you familiar with machine translation, automated summarization, question-and-answering and retrieval, extraction, or auto-indexing? What are their current state-of-the-art, what are the advantages and limitations? Can these be of any help to you in your professional capacity as a librarian or information scientist?

The class is a combination of lectures, in class discussions, projects, independent reading, presentations, and papers. We learn actively in class by doing hand-on exercises.

While there will be no programming required, be prepared to look at sequences of pseudo-code steps necessary in understanding how computer programs work on the conceptual level. No prior linguistic background necessary. Bring your curiosity about languages and computers and, most importantly, keep your mind open!

See also a Prezi: How are Language Technologies and LIS(&T) interlinked?