Ahead last week, behind this week.
Thursday, May 19, 2011
5/20 Presentation
Ahead last week, behind this week.
Thursday, May 12, 2011
First Round Run
Thursday, May 5, 2011
Second Draft
Thursday, April 28, 2011
Updated Experiment Info--Pilot Testing
[Measurements]
1. The number of heuristics that the subject used to refine the search. (HOW besides watching?)
2. How confident the user is about his/her selections
3. How well the selections perform, as rated by independent raters*
4. Whether the user feels like he needed more time
*I intend on asking mechanical turkers to rate the dress and shirt selections. I will give them the same prompts that the subjects received. I will show them two at a time and ask which one seems more appropriate? (or a better choice?)
[Calculations & Results]
I don't have meaningful results because I haven't run anything on mechanical turk yet. But I intend on using chi square tests once I have my independent raters.
I will also use chi square to calculate confidence, category heuristics and time.
My pilot results:
Average small-screen confidence: 2.5
Average large-screen confidence: 3.6
[Further Questions]
How can I track the ways in which the subject refined his/her search besides watching?
How should I have mechanical turkers rate the selections? (ask ‘more appropriate’, ‘better’, ‘Jamie/Matt will like more’?)
Do I have them rate between two? (Strict ordering does not give me as much information)
Should I have both conditions fill out the pre & post task on the big monitor? Or do all of it on the small device?
Should I use a laptop/monitor instead of phone/monitor to help control for processing & network speed?