Oct 17, 2024  
2022-2023 Academic Catalog 
    
2022-2023 Academic Catalog [ARCHIVED CATALOG]

Add to My Bookmark (opens a new window)

DATA 152 - Natural Language Processing


Upper Division

Prerequisites
DATA 040  , DATA 137  , DATA 139  , MATH 120   and CS 021  .

This course is intended as a practical introduction to the most widely used techniques, strategies and toolkits for natural language processing. The text classification task is one of the most popular tasks that we deal with in real life. We use it in classify news, spam filtering, sentiment analysis, etc. You will learn how to go from raw texts to predicted classes both with traditional methods (e.g. Linear Classifiers) and deep learning techniques (e.g. Convolutional Neural Nets). In addition, you will learn how to treat texts as a sequence of words, which is called the language modeling task in NLP. In particular, how to predict next words given some previous words. This is used for suggestions in searches, machine translation, chatbots, and so on. Finally, students will learn about vectors that represent meanings using modern tools for word and sentence embeddings, such as word2vec, and will discuss how to embed the whole documents with topic models.

Term Offered
Spring

Cross-Listing
No

Course credits: 1.0



Add to My Bookmark (opens a new window)