OK my geek friends; a challenge. I have been assigned to teach a course next semester, entitled "Dealing with Data". It is a required course for the (new) information science and arts school, it has never been taught before, and basically I was told "Here is the course title. It should cover technical, legal, and ethical aspects of data. Design it.".
I could use a little help; details follow;
Oh yes, and the prerequisites are basic algebra.
So. I'm on the lookout for topics, interesting datasets, and analysis tools suitable for the uninitiated. Any articles or books or readings of any kind would be __GREATLY__ appreciated.
Topics: A mixture of ethical/legal (examples: HIPPA, "climategate", Facebook, Netflix) and technical (kinds of data, kinds of data storage, data cleaning and mining, search and retrieval)
Datasets: Everything from unstructured text to social network data to MRI (such as my brain, which I will probably distribute). Controversial datasets would be great, but they might be hard to analyze (Enron corpus; "Climategate" emails and/or datasets)
Analysis: Primarily visual, given the prereqs. Examples: Gapminder, Tableau public, Excel (unfortunately ubiquitous), targeted VTK/Python scripts.
Any other suggestions?
(PS - I did have a bit more guidance than the title; SISTA's director and Assoc. Director and I met several times. My intro paragraph was purely for dramatic effect)