1 of 7

Data Science Competitions: A know-how to participate

Ashis Kumer Biswas, Ph.D.

Assistant Professor, CSE @ CU Denver

http://ml.cse.ucdenver.edu/

It’s not the destination, it’s the journey.

2 of 7

Competitive Programming

Given a problem, solve it breaking the performance records than fellow competitors before a target deadline.

A Problem

Solve it

Breaking records

Before Deadline

3 of 7

Competition Venues

  • Kaggle [https://www.kaggle.com ]
  • DrivenData [ https://www.drivendata.org/ ]
  • Most Data Science conferences host competitions.

4 of 7

Tips to be overall competitive

  1. Start early : read, read and read and discuss with partner(s). Understand the problem.
  2. Quickly identify the problem type and solve using the dumbest idea you can first think of the moment.
  3. Discipline yourself. It matters how many hours you put into it.
  4. Do algorithm analysis. Choose a different algorithm, or customize an existing algorithm, a more advanced algorithm if you can think of anything better later. Go for incremental (tiny) improvements.
  5. Master Programming language(s) and associated libraries.
  6. Master the art of testing code.
  7. Team work

5 of 7

Some Specific tips

  • Be proficient at preprocessing the dataset the best way possible including feature extractions, etc.
  • Learn to explore the given dataset. If possible gather some relevant domain knowledge if the field of problem is different than yours.
  • Learn to know about hyper-parameter tuning.
  • Learn about Ensemble strategies.
  • Learn to beat overfitting. Start develop good habits for each contest.
  • Research exhaustively! A winning submission constitutes many tiny, meticulously performed steps which otherwise would be ignored in a less competitive venue.
  • Practice, practice and practice!

6 of 7

Final tips

  • Set incremental goals
  • Watch/Review interviews of the winning entries.
  • Review most voted kernels/discussion notes.
  • Don’t worry about who are competing: are they PhD researchers, industry experts, etc?
  • Don’t worry about low rank. Focus and be determined to solve/submit your entry into the contest.
  • Participate in many competitions to develop your core skills.

7 of 7

Thanks

  • Any Question?