We ask fellows to work on a small challenge problem to assess problem solving and coding capabilities
Ask yourself why would they have selected this problem for the challenge? What are some gotchas in this domain I should know about?
What is the highest level of accuracy that others have achieved with this dataset or similar problems / datasets ?
What types of visualizations will help me grasp the nature of the problem / data?
What feature engineering might help improve the signal?
Which modeling techniques are good at capturing the types of relationships I see in this data?
Now that I have a model, how can I be sure that I didn't introduce a bug in the code? If results are too good to be true, they probably are!
What are some of the weaknesses of the model and and how can the model be improved with additional work.
After finishing the challenge, we ask fellows to submit a brief video (not more than 7 minutes) on YouTube and share the link with us. The video should contain the following
Please do not read from a script written beforehand.
Do not add any background music
Please ensure that the video is unlisted and not private