Lab 1 - Introduction to Ethics

Due: Friday by 3:30pm

Data mining carries with it several ethical dimensions that we will explore in the first two labs in the class. Today's lab will focus on historical background, particularly with respects to scientific/medical research that involves data from humans (human subjects research).

There will be a brief lecture at the start of lab. Here are some reading assignments that will give background on the lecture:

After the lecture is done, break into groups and discuss whether the following scenarios would be considered human subject research. Each group should write a report on their discussions.
  1. Alice is a medical research at a major public university. Alice goes to a nationally recognized data center and downloads a public, anonymous database to train neural networks to detect warnings signs for heart disease. The original database creator has obtained informed consent from all the people whose data is in the database.
  2. Bob is a master's student in science education who is studying the causes of dropping out of college. Bob uses Facebook to gather information about college-aged people in order to determine if social media postings have any predictive value in determing who graduates college and who drops out. Bob anonymizes this data and posts it online.
  3. Carol is a programmer for a major retailer. Carol uses transaction data from the retailer's loyalty program to tailor marketing campaigns to specific customers' shopping habits.
After allowing sufficient time for discussions in groups, we will get back together and discuss the three situations as a class.

Each group should choose one group member to upload their report to Moodle. Make sure every group member's name is on the report.