Achieving the full capability of AI requires a system perspective, extending beyond the models, to effectively leverage algorithms, data, and computing power. Creating AI-enabled systems includes thoughtful consideration of an operational decomposition for AI solutions, engineering data for algorithm development, and deployment strategies. The objective of this course is to bring a system perspective to creating AI-enabled systems. The course will explore the full-lifecycle of creating AI-enabled systems starting with problem decomposition and addressing data, development, design, diagnostic, and deployment phases. Each module will either introduce a domain in Machine Learning (Tabular, Computer Vision, Natural Language Processing, and Physical Systems) or delve into the end-to-end development of a specific AI system. Students will be exposed to the common technologies and resources practitioners use to develop these systems.
The course materials are divided into five Topics. Each Topic will be dedicated to study a specific system, culminating in a case study submission for the student to demonstrate an in-depth understanding of a specific ML system in its real-life context. This course heavily relies on a case study structure to explore complex issues, generate insights, and illustrate theories or concepts through detailed analysis.
Each Topic's materials will be accessible at the beginning of their corresponding Module. You are encouraged to preview all sections of the topic as early as possible:
| Module | Topic | Submissions |
| 1 | Introduction to ML Systems (Part 1): Requirements Engineering; Data Engineering; Dataset Design | Discussion 1 |
| 2 | Introduction to ML Systems (Part 2): Requirements Engineering; Data Engineering; Dataset Design | Tool Assessment Assignment Discussion 2 |
| 3 | Object Detection (Multi-Objective) Systems (Part 1): | Discussion 3 |
| 4 | Object Detection (Multi-Objective) Systems (Part 2): Model; Metrics; Deployment | Discussion 4 |
| 5 | Object Detection (Multi-Objective) Systems (Part 3): Case Study Analysis | TechTrack Implementation |
| 6 | Vector Search Systems (Part 1): | Discussion 6 |
| 7 | Vector Search Systems (Part 2): Model; Metrics; Deployment | Discussion 7 |
| 8 | Vector Search Systems (Part 3): Case Study Analysis | IronClad Implementation IronClad Design and Analysis Discussion 8 |
| 9 | Retrieval Augmented Generation Systems (Part 1): Problem Statement; Requirements Engineering; Data | Discussion 9 |
| 10 | Retrieval Augmented Generation Systems (Part 2): Model; Metrics; Deployment | Discussion 10 |
| 11 | Retrieval Augmented Generation Systems (Part 3): Case Study Analysis | TextWave Implementation TextWave Design and Analysis Discussion 11 |
| 12 | Recommender Systems (Part 1): Problem Statement; Requirements Engineering; Data | Discussion 12 |
| 13 | Recommender Systems (Part 2): Model; Metrics; Deployment | Discussion 13 |
| 14 | Recommender Systems (Part 3): Case Study Analysis | MovieMate Design and Analysis SecureBank Final Case Study |
This course prepares students for creating AI-enabled systems by considering the full-lifecycle for building and designing practical AI solutions. We will study issues involving requirements, data, metrics, model, and deployment. This course demonstrates how this fulfills a deployable pipeline to construct working AI systems.
No textbook is required. Each module will have a reading list that will include papers and textbooks chapters from O’Reilly online. Please use your JHED ID for logging into O’Reilly (https://www.oreilly.com). This is an excellent source for books and interactive tutorials. I encourage your exploration of such a valuable tool. If students are interested in learning more about the materials related to the basic foundations of this course, these books are recommended:
Assignments require a python development environment to generate Jupyter notebooks and python scripts. A tutorial on Jupyter notebooks can be found here.
Docker (docker.com) is required to create the necessary development environment and more. The class provides the necessary instructions in Module 1. This allows the creation of a docker container that contains jupyter lab, an environment to create and run jupyter notebooks and python scripts. This container also includes most of the python libraries required for the assignments and git, a version control for your created software. A video tutorial can be found here. You will also use docker to create deployable images to run your work. There is an interactive docker tutorial available here.
Docker containers avoid library and operating system conflicts to one to also run the containers regardless of their environment (as long as it supports docker). With git and docker you will build an accessible and managed machine learning portfolio of your classwork. Docker also allows straightforward and isolated installation of other software discussed in the course such as NoSQL databases (e.g. MongoDB) and messaging software (e.g. Kafka). Docker images/containers are easy to remove when no longer needed. Docker containers are far superior than installing and deinstalling software and all the various python libraries directly in your computer operating system.
You are required to have access to an operating system that supports Docker. Current Windows, Mac, and Linux all support Docker. If this is not available, you can access cloud virtual machines and install docker there.
It is expected that each module will take approximately 10-15 hours per week to complete. Here is an approximate breakdown:
All submissions are due according to the dates in the Calendar. Unless specified, all submissions should be submitted to GitHub and/or Canvas. You will be instructed as to the structure of your provided repository. This course will consist of the following basic student requirements:
Assignments (65% of Final Grade)
Assignments will be comprised of data analysis, software delivery, and a written explanation of the system design focused on the implementation your system. The assignments are meant to act as a way for students to receive early feedback for your case studies, and are encouraged to edit your system design and analysis based on the feedback your receive. Late submissions will be reduced by 10 points (out of 100) letter grade for each week late (no exceptions without prior coordination with the instructors). All deductions will not be prorated.
Final Project (25% of Final Grade)
The purpose of the final case study is to provide an in-depth understanding of a specific ML system. Each case study will explore systems to uncover insights about a problem and a proposed solution. These case studies begins with an open-ended description of a hypothetical problem, and students must propose an ML-system solution to address the presented problem. The general process to tackle case studies will include:
Case study submissions will be in the form of a Markdown file containing problem and system design descriptions. A good case study submission should include: 1) a description of the problem, 2) a description of your proposed design, 3) an argument for how your design addresses each requirement, and 4) test/evaluate hypotheses of your proposed design. This file should contain diagrams, tables, charts, etc. to argue for your design decisions.
Discussion Participation (10% of Final Grade)The discussions supplements the Final Project by allowing students to delve deeper into their chosen project through question prompts. The weekly discourse is composed of two parts (10 points total):
I will monitor module discussions and will respond to some of the posts to provide feedback. You are encouraged to edit your responses based of the feedback you receive.
Grading Policy:Assignments are due according to the dates posted in your Canvas course site. You may check these due dates in the Course Calendar or the Assignments in the corresponding modules.
Grades are typically posted one week after assignment due dates.
The course will use the following grading system:
100-90 = A,
89-80 = B
79-70 = C
69-63 = D
<63 = F
Final grades will be determined by the following weighting:
| Submission | Weight |
|---|---|
| Assignments | 65% |
| Final Project | 25% |
| Discussions | 10% |
Assignments are due according to the dates posted in your Canvas course site. You may check these due dates in the Course Calendar or the Assignments in the corresponding modules.
Grades are typically posted one week after assignment due dates.
The course will use the following grading system:
100-90 = A,
89-80 = B
79-70 = C
69-63 = D
<63 = F
Deadlines for Adding, Dropping, and Withdrawing from Courses
Students may add a course up to one week after the start of the term for that particular course. Students may drop courses according to the drop deadlines outlined in the EP academic calendar. Between the 6th week of the class and prior to the final withdrawal deadline, a student may withdraw from a course with a W on their academic record. A record of the course will remain on the academic record with a W appearing in the grade column to indicate that the student registered and withdrew from the course.
Academic Misconduct Policy
Students with Disabilities - Accommodations and Accessibility
Student Conduct Code
Classroom Climate
JHU is committed to creating a classroom environment that values the diversity of experiences and perspectives that all students bring. Everyone has the right to be treated with dignity and respect. Fostering an inclusive climate is important. Research and experience show that students who interact with peers who are different from themselves learn new things and experience tangible educational outcomes. At no time in this learning process should someone be singled out or treated unequally on the basis of any seen or unseen part of their identity. If you have concerns in this course about harassment, discrimination, or any unequal treatment, or if you seek accommodations or resources, please reach out to the course instructor directly. Reporting will never impact your course grade. You may also share concerns with your program chair, the Assistant Dean for Diversity and Inclusion, or the Office of Institutional Equity. In handling reports, people will protect your privacy as much as possible, but faculty and staff are required to officially report information for some cases (e.g. sexual harassment).
Course Auditing
When a student enrolls in an EP course with “audit” status, the student must reach an understanding with the instructor as to what is required to earn the “audit.” If the student does not meet those expectations, the instructor must notify the EP Registration Team (EP-Registration@exchange.johnshopkins.edu) in order for the student to be retroactively dropped or withdrawn from the course (depending on when the "audit" was requested and in accordance with EP registration deadlines). All lecture content will remain accessible to auditing students, but access to all other course material is left to the discretion of the instructor.