INFO 3401

Information Exploration

INFO 3401 “Information Exploration” is a semester-length, lecture-based undergraduate course. The volume, velocity, variety, and variability of data challenges our ability to collect, analyze, interpret, and act. This course will develop students’ skills and sensibilities for conducting exploratory data analysis across seven kinds of data. Students will learn to collect, analyze, visualize, evaluate, and communicate data to motivate new questions, make predictions, and work towards solutions. This course will call upon the quantitative and computational skills students have developed in previous courses and will increase their confidence and autonomy as data analysts and scientists who can deliver insights from diverse kinds of data.

Learning objectives

  • Improve students’ confidence analyzing diverse kinds of data
  • Develop students’ ability to match questions to data to solutions
  • Understand professional data science tools and methods
  • Think critically about the opportunities and limitations of data

Course outline

Module Week Type Skills
  1   Introductions
Tabular 2 Fundamentals Single tables, pandas
  3 Applications Population data
       
Relational 4 Fundamentals Multiple tables, sqlite
  5 Applications Sports data
       
Temporal 6 Fundamentals Time series, prophet
  7 Applications Economic data
       
Spatial 8 Fundamentals Mapping, geopandas
  9 Applications Political data
       
Dyadic 10 Fundamentals Networks, networkx
  11 Applications social network data
       
Structured 12 Fundamentals JSON \& XML, beautifulsoup
  13 Applications API data
       
  14   Fall Break
       
Unstructured 15 Fundamentals Text, nltk
  16 Applications Text data