How to ML Paper - A brief Guide
Feel free to comment / share and happy paper writing! Also, please see caveats* below.
If you like this, why not follow How to ML on Twitter and share the advice/love?
Abstract (TL;DR of paper):
X: What are we trying to do and why is it relevant?
Y: Why is this hard?
Z: How do we solve it (i.e. our contribution!)
1: How do we verify that we solved it:
1a) Experiments and results
1b) Theory
Introduction (Longer version of the Abstract, i.e. of the entire paper):
X: What are we trying to do and why is it relevant?
Y: Why is this hard?
Z: How do we solve it (i.e. our contribution!)
1: How do we verify that we solved it:
1a) Experiments and results, including comparison to prior SOTA if applicable
1b) Theory
2: New trend: specifically list your contributions as bullet points (credits to Brendan)
Extra space? Future work!
Extra points for having Figure 1 on the first page
Related Work:
Academic siblings of our work, i.e. alternative attempts in literature at trying to solve the same problem.
Goal is to “Compare and contrast” - how does their approach differ in either assumptions or method? If their method is applicable to our Problem Setting I expect a comparison in the experimental section. If not, there needs to be a clear statement why a given method is not applicable.
Note: Just describing what another paper is doing is not enough. We need to compare and contrast.
Background:
Academic Ancestors of our work, i.e. all concepts and prior work that are required for understanding our method.
Usually includes a subsection, Problem Setting, which formally introduces the problem setting and notation (Formalism) for our method. Highlights any specific assumptions that are made that are unusual.
Note: If our paper introduces a novel problem setting as part of its contributions, it’s best to have a separate Section.
[Problem Setting as separate section – only if the Problem Setting is novel, i.e. a contribution]
Method:
What we do. Why we do it. All described using the general Formalism introduced in the Problem Setting and building on top of the concepts / foundations introduced in Background.
Experimental Setup:
How do we test that our stuff works? Introduces a specific instantiation of the Problem Setting and specific implementation details of our Method for this Problem Setting.
Results and Discussion:
Shows the results of running Method on our problem described in Experimental Setup. Compares to baselines mentioned in Related Work. Includes statistics and confidence intervals. Includes statements on hyperparameters and other potential issues of fairness. Includes ablation studies to show that specific parts of the method are relevant. Discusses limitations of the method.
Conclusion:
We did it. This paper rocks and you are lucky to have read it (i.e. brief recap of the entire paper). Also, we’ll do all these other amazing things in the future.
To keep going with the analogy, you can think of future work as (potential) academic offspring (credits to James).
Start with an outline rather than full text. Each line in the outline captures one idea and will correspond to one paragraph in the final version. It is much easier to change the outline of a building before building it. This is a great point in time to have conversations with others if you are unsure.
Next, expand the outline, but keep the summary line as Latex comments ahead of every paragraph:
%TL;DR of paragraph
Lorem ipsum dolorem… (expansion of the TL;DR), a full paragraph of text that says the TL;DR as easily and clearly as possible. No fluff, not trying to impress, just saying what is and what is not and why think so. Simple, no?
This will a) keep you on track and b) make it easy for anyone providing feedback to quickly see what the overall flow is.
I recommend writing the Abstract as soon as possible, even before results are finalised. Writing the Abstract first helps sharpen your focus and highlights issues with the paper / method / experiments. Of course you might have to make changes to the writing later, but that's much easier than trying to fix experiments / methods because you realise late that your story doesn't make sense.
A quick note on author ordering / inclusion, since this keeps coming up:
Example:
“The bank loan problem can be reformulated as a special subset of the contextual bandit problem” =>
“The bank loan problem is a special instance of a contextual bandit problem”
✔ The relationship between the radius and area of a circle is A = r^2 \pi.
❌ The relationship between the radius and area of a circle is: A = r^2 \pi.
lazy.
Last not least - communicate plenty with all authors (i.e. at least daily for the last week) to stay on track and have fun!!
PS: Nothing here is binding but I think it makes it much easier for everyone if we stick to a basic structure when writing papers. Think of it like a broad convention that allows readers to quickly process papers.
PPS: These broad best-practice suggestions are the result of writing papers with a fantastic set of mentors, students and other collaborators - all credits go to them! Special shout-out to my former PhD advisor, Shimon, who inspired many of the points here!
* Caveats:
I am sure there are many stylistic subtleties around writing that I am missing in this guide, but winning a literature or poetry competition is definitely not the goal. The goal is to write text that gets complicated content across efficiently. If in doubt just ask yourself if what you wrote is the simplest and clearest way to get the idea across. Also, you don’t need to follow every point religiously. But when you disregard one of these rules, you should be able to explain the motivation behind the rule and why your case is different. E.g., “Passive voice is normally a bad idea since it obscures the subject of the sentence. But in this specific sentence, passive voice was used because the object is the focus of the sentence. Also, the object was mentioned previously, so putting it first lessens the reader’s mental load.” (credits to Roger)
Comments / questions? Email jakob at robots dot ox dot ac dot uk or comment on Twitter. You can also let me know further common mistakes and I’ll add them here (w/ credit).
Licence