Please note that course schedules may be amended due to low enrollment, faculty availability, and/or other factors.
MSDS 420-DL : Database Systems and Data Preparation
Description
In this course students explore the fundamental concepts of
database management and data preparation. With a focus on
applications in large-scale data analytics projects, the course
introduces relational database systems, the relational model,
normalization process, and structured query language (SQL). The
course discusses topics related to data integration and cleaning,
database programming for extract, transform, and load (ETL)
operations. Students learn NoSQL technologies for working with
unstructured data and document-oriented information retrieval
systems. They learn how to index and score documents for effective
and relevant responses to user queries. Students acquire hands-on
programming experience for data preparation and data extraction
using various data sources and file formats.
Recommended: Prior programming experience or MSDS
430-DL Python for Data Science.
Prerequisite: MSDS 402-DL Introduction to Data Science or
MSDS 403-DL Data Science in Practice.