"What Can We Do to Improve Peer Review in NLP?" ๐Ÿ‘€

AI Coffee Break with Letitia ยท Beginner ยท๐Ÿ“„ Research Papers Explained ยท5y ago

Key Takeaways

This video discusses ways to improve peer review in natural language processing based on the paper by A. Rogers and I. Augenstein

Full Transcript

[Music] hello to everyone miss coffee bean here finally letizia has found some time to animate me and give me my voice back i missed being out here with you guys today we are going to talk about peer review in nlp something that leticia has been busy with during the last few days and in this video we will be presenting this awesome paper by anna rogers and isabel augenstein it is a short paper accepted at mnlp findings this year it discusses how to improve peer review a topic that could change the world well could peer review really change the world yeah why not think about one of the main chains of ideas research with new ideas coming not only from academia but also from industry should be spread to others one way to do this is through publications what ideas are valuable and impactful enough to be published is determined by peer review where other scientists get to say how much they like the research and how much not after publications the ideas can be taken on by others for improvement for further spreading or integration into super cool machine learning applications right but how to do the application part properly if the first part of the chain is flawed because surprise surprise peer review is broken if we want to know how to fix it we also want to know how it is broken so it is time to look at this paper even though the paper is about the natural language processing field we think that the general message can be useful for any kind of peer review so let's see the paper really does not mess around and starts abruptly with harsh claims like peer review does not guarantee quality control ouch but no matter how much this sentence hurts we cannot but agree to what they argue peer review is meant to guarantee that what is written in the paper is correct and high quality but this is an impossible task because the papers are not reproducible especially when thinking about the poor reviewers with limited time and unpaid review work then the others say that peer review fails to detect impactful papers again very harsh but quite true important for paper impact is often not only the science itself but other factors like the topic the promotion on for example the social media or how easy it is to take an idea and build on it further look at for example bird or transformers okay so surprise surprise peer review is hard and ill defined so the authors ask what could peer review realistically do well it could filter out obvious flaws and turn the spotlight on to quality right but the problem is that we are not doing that instead we are selecting or trying to select the best x percent of the papers and accept them what could possibly go wrong here by ah yes we do not know where and how to draw the line the idea that miss coffee being appreciated the most out of this paper is represented in this figure reviewing is like comparing apples with oranges different papers have different strengths one has better methodology the other one has better evaluation or should we select the idea but not so much its implementation or what should we focus actually on i don't know how to judge this i'm not gonna actually my special so while miss coffee bean is having a mental breakdown i can tell you that the other thing that i liked most about this paper is the idea that reviewers choose different ways of coping with this impossible task that is required from them so miss coffee bean now that you recovered from your breakdown what are the coping mechanisms ah writing style it is hard to resist the temptation to think that if there are language errors in the paper the science is bad too but this thinking is so wrong non-native english speakers can produce awesome science too well the next way to cope with the impossible reviewing task is to look at only one table in the paper you know the one containing the result if the proposed contribution is not the best there we reject we want only state of the art right you know but this ignores many things like efficiency interpretability all the other goodies beyond a high score in the evaluation metrics niche topics like you know everything that is not transformer or is not i cite the paper scientifically sexy is so hard to publish especially in natural language processing it is easy to publish work on english language but it is hard on others as if findings on other languages are not as generalizable as the findings on english to this wall of shame the authors further add the problem widespread in the whole machine learning area like already famous work where researchers publish already on archive so the anonymous peer review is not so anonymous in the end also big labs which invest a lot of work into public relations are famous and they get even more famous in my observation the big labs are even more recognizable in review just by you know compute resources or even plotting styles so let's move on to the next coping mechanism we sometimes tend to shame two simple solutions because simple means it is easier right nope it's not over complication is easier than simplicity another way of coping is through spotlighting mainstream approaches like deep learning but no deep learning is not the answer to everything and other methods have their merit too and especially in the big data requirements of deep learning it is quite unintuitive why we do not appreciate resource papers so much so okay we're almost at the end of the wall of shame novel approaches are too often rejected this sounds absurd yes and the authors explain it through the human way of saying no to new stuff because it is hard to comprehend especially at first and the last and i think by far the worst entry of the wall of shame are substitute questions that reviewers tend to ask in the impossible situations of assessing quality if i did this study would i have made the same choices and in our opinion this is the worst coping mechanism because it is the most subjective one and has to do a lot with the different backgrounds and personal ways of thinking of reviewers i mean the other coping mechanisms are at least expected and easy to counter but this last point about the substitute questions is as varied as there are reviewers for example if you write a paper and test your model on data set x and data set y the reviewer says or asks but why did you not test on data set z the reviewer might actually mean if i did this study i would have tested on z and the problem with this question is that it is hard to say if the reason for having data set z is really scientifical or just you know personal so anyway while looking at this wall of shame one might come up with the idea just to abolish peer review but we remind about what we discussed at the beginning of this video where we need to have a filter for publications for non-experts so they know what's quality and what's not so what to do the authors of the paper propose that reviewer work should be more valued and the working hours taken into account by employers review work should really not be done by overworked people in their spare time and then they propose some ways to reduce the uncertainty that comes with the poorer task definition of reviewing for example more organization energy should be invested into reviewer matching for assigning competent reviewers for each research topic then the review task should be more fine-grained to capture the diversity of papers and also the review forms should be able to capture this announcing the conference or journal priorities before submission and reviewers giving not an overall score but specific scores that are weighted differently depending on these editorial priorities should also improve the definition of the task of peer review and the reviewers faced with a well-defined task would be then relying less on coping mechanisms okay so when do we do this many others had the ideas presented in this paper and what i really appreciate most is that the authors managed to remind here by everybody again about the peer review problems so maybe this wake up call will finally have some effect miss coffee bean wanted to spread the word further with this video if you want to help too share this video with anybody interested or share the paper you can find the link to the paper in the description below see you next time okay bye

Original Description

"What Can We Do to Improve Peer Review in NLP?" by A. Rogers and I. Augenstein explained by Ms. Coffee Bean! Enjoy! ... and avoid the coping mechanisms from the wall of shame! ๐Ÿ˜… โžก๏ธ AI Coffee Break Merch! ๐Ÿ›๏ธ https://aicoffeebreak.creator-spring.com/ โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€ ๐Ÿ”ฅ Optionally, pay us a coffee to boost our Coffee Bean production! โ˜• Patreon: https://www.patreon.com/AICoffeeBreak Ko-fi: https://ko-fi.com/aicoffeebreak โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€โ–€ Outline: * 00:00 The importance of peer review * 02:02 Peer review != quality * 03:23 What reviewing really is * 04:39 Coping mechanisms * 07:27 The worst entry on the wall of shame * 09:00 What to do? ๐Ÿ“„ Paper explained: Rogers, Anna, and Isabelle Augenstein. "What Can We Do to Improve Peer Review in NLP?" Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings. 2020. https://www.aclweb.org/anthology/2020.findings-emnlp.112.pdf ๐Ÿ”— Links: YouTube: https://www.youtube.com/AICoffeeBreak Twitter: https://twitter.com/AICoffeeBreak Reddit: https://www.reddit.com/r/AICoffeeBreak/ #AICoffeeBreak #MsCoffeeBean #emnlp2020 #MachineLearning #AI #research Video and thumbnail contain emojis designed by OpenMoji โ€“ the open-source emoji and icon project. License: CC BY-SA 4.0
Watch on YouTube โ†— (saves to browser)
Sign in to unlock AI tutor explanation ยท โšก30

Playlist

Uploads from AI Coffee Break with Letitia ยท AI Coffee Break with Letitia ยท 20 of 60

1 AI Coffee Break - Channel Trailer
AI Coffee Break - Channel Trailer
AI Coffee Break with Letitia
2 How to check if a neural network has learned a specific phenomenon?
How to check if a neural network has learned a specific phenomenon?
AI Coffee Break with Letitia
3 A brief history of the Transformer architecture in NLP
A brief history of the Transformer architecture in NLP
AI Coffee Break with Letitia
4 Our paper at CVPR 2020 - MUL Workshop and ACL 2020 - ALVR Workshop
Our paper at CVPR 2020 - MUL Workshop and ACL 2020 - ALVR Workshop
AI Coffee Break with Letitia
5 The Transformer neural network architecture EXPLAINED. โ€œAttention is all you needโ€
The Transformer neural network architecture EXPLAINED. โ€œAttention is all you needโ€
AI Coffee Break with Letitia
6 Transformer combining Vision and Language? ViLBERT - NLP meets Computer Vision
Transformer combining Vision and Language? ViLBERT - NLP meets Computer Vision
AI Coffee Break with Letitia
7 Pre-training of BERT-based Transformer architectures explained โ€“ language and vision!
Pre-training of BERT-based Transformer architectures explained โ€“ language and vision!
AI Coffee Break with Letitia
8 GPT-3 explained with examples. Possibilities, and implications.
GPT-3 explained with examples. Possibilities, and implications.
AI Coffee Break with Letitia
9 Adversarial Machine Learning explained! | With examples.
Adversarial Machine Learning explained! | With examples.
AI Coffee Break with Letitia
10 BERTology meets Biology | Solving biological problems with Transformers
BERTology meets Biology | Solving biological problems with Transformers
AI Coffee Break with Letitia
11 Can a neural network tell if an image is mirrored? โ€“ Visual Chirality
Can a neural network tell if an image is mirrored? โ€“ Visual Chirality
AI Coffee Break with Letitia
12 The ultimate intro to Graph Neural Networks. Maybe.
The ultimate intro to Graph Neural Networks. Maybe.
AI Coffee Break with Letitia
13 Can language models understand? Bender and Koller argument.
Can language models understand? Bender and Koller argument.
AI Coffee Break with Letitia
14 GANs explained | Generative Adversarial Networks video with showcase!
GANs explained | Generative Adversarial Networks video with showcase!
AI Coffee Break with Letitia
15 What nobody tells you about MULTIMODAL Machine Learning! ๐Ÿ™Š THE definition.
What nobody tells you about MULTIMODAL Machine Learning! ๐Ÿ™Š THE definition.
AI Coffee Break with Letitia
16 Multimodal Machine Learning models do not work. Here is why. Part 1/2 โ€“ The SYMPTOMS
Multimodal Machine Learning models do not work. Here is why. Part 1/2 โ€“ The SYMPTOMS
AI Coffee Break with Letitia
17 Why Multimodal Machine Learning models do not work. Part 2/2 โ€“ The CAUSES
Why Multimodal Machine Learning models do not work. Part 2/2 โ€“ The CAUSES
AI Coffee Break with Letitia
18 An image is worth 16x16 words: ViT | Vision Transformer explained
An image is worth 16x16 words: ViT | Vision Transformer explained
AI Coffee Break with Letitia
19 AI understanding language!? A roadmap to natural language understanding.
AI understanding language!? A roadmap to natural language understanding.
AI Coffee Break with Letitia
โ–ถ "What Can We Do to Improve Peer Review in NLP?" ๐Ÿ‘€
"What Can We Do to Improve Peer Review in NLP?" ๐Ÿ‘€
AI Coffee Break with Letitia
21 The curse of dimensionality. Or is it a blessing?
The curse of dimensionality. Or is it a blessing?
AI Coffee Break with Letitia
22 PCA explained with intuition, a little math and code
PCA explained with intuition, a little math and code
AI Coffee Break with Letitia
23 Data-efficient Image Transformers EXPLAINED! Facebook AI's DeiT paper
Data-efficient Image Transformers EXPLAINED! Facebook AI's DeiT paper
AI Coffee Break with Letitia
24 OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.
OpenAI's DALL-E explained. How GPT-3 creates images from descriptions.
AI Coffee Break with Letitia
25 Leaking training data from GPT-2. How is this possible?
Leaking training data from GPT-2. How is this possible?
AI Coffee Break with Letitia
26 OpenAIโ€™s CLIP explained! | Examples, links to code and pretrained model
OpenAIโ€™s CLIP explained! | Examples, links to code and pretrained model
AI Coffee Break with Letitia
27 Transformers can do both images and text. Here is why.
Transformers can do both images and text. Here is why.
AI Coffee Break with Letitia
28 UMAP explained | The best dimensionality reduction?
UMAP explained | The best dimensionality reduction?
AI Coffee Break with Letitia
29 NVIDIA Jarvis (now NVIDIA Riva) meets Ms. Coffee Bean
NVIDIA Jarvis (now NVIDIA Riva) meets Ms. Coffee Bean
AI Coffee Break with Letitia
30 Transformer in Transformer: Paper explained and visualized | TNT
Transformer in Transformer: Paper explained and visualized | TNT
AI Coffee Break with Letitia
31 [RANT] Adversarial attack on OpenAIโ€™s CLIP? Are we the fools or the foolers?
[RANT] Adversarial attack on OpenAIโ€™s CLIP? Are we the fools or the foolers?
AI Coffee Break with Letitia
32 Pattern Exploiting Training explained! | PET, iPET, ADAPET
Pattern Exploiting Training explained! | PET, iPET, ADAPET
AI Coffee Break with Letitia
33 Deep Learning for Symbolic Mathematics!? | Paper EXPLAINED
Deep Learning for Symbolic Mathematics!? | Paper EXPLAINED
AI Coffee Break with Letitia
34 FNet: Mixing Tokens with Fourier Transforms โ€“ Paper Explained
FNet: Mixing Tokens with Fourier Transforms โ€“ Paper Explained
AI Coffee Break with Letitia
35 Are Pre-trained Convolutions Better than Pre-trained Transformers? โ€“ Paper Explained
Are Pre-trained Convolutions Better than Pre-trained Transformers? โ€“ Paper Explained
AI Coffee Break with Letitia
36 "Please Commit More Blatant Academic Fraud" โ€“ A fellow PhD student's response.
"Please Commit More Blatant Academic Fraud" โ€“ A fellow PhD student's response.
AI Coffee Break with Letitia
37 Scaling Vision Transformers? How much data can a transformer get? #Shorts
Scaling Vision Transformers? How much data can a transformer get? #Shorts
AI Coffee Break with Letitia
38 How cross-modal are vision and language models really? ๐Ÿ‘€ Seeing past words. [Own work]
How cross-modal are vision and language models really? ๐Ÿ‘€ Seeing past words. [Own work]
AI Coffee Break with Letitia
39 Charformer: Fast Character Transformers via Gradient-based Subword Tokenization +Tokenizer explained
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization +Tokenizer explained
AI Coffee Break with Letitia
40 Positional embeddings in transformers EXPLAINED | Demystifying positional encodings.
Positional embeddings in transformers EXPLAINED | Demystifying positional encodings.
AI Coffee Break with Letitia
41 Adding vs. concatenating positional embeddings & Learned positional encodings
Adding vs. concatenating positional embeddings & Learned positional encodings
AI Coffee Break with Letitia
42 Self-Attention with Relative Position Representations โ€“ Paper explained
Self-Attention with Relative Position Representations โ€“ Paper explained
AI Coffee Break with Letitia
43 Saddle points vs. local minima in high dimensional spaces | โ“ #AICoffeeBreakQuiz #Shorts
Saddle points vs. local minima in high dimensional spaces | โ“ #AICoffeeBreakQuiz #Shorts
AI Coffee Break with Letitia
44 What is the model identifiability problem? | Explained in 60 seconds! | โ“ #AICoffeeBreakQuiz #Shorts
What is the model identifiability problem? | Explained in 60 seconds! | โ“ #AICoffeeBreakQuiz #Shorts
AI Coffee Break with Letitia
45 Data leakage during data preparation? | Using AntiPatterns to avoid MLOps Mistakes
Data leakage during data preparation? | Using AntiPatterns to avoid MLOps Mistakes
AI Coffee Break with Letitia
46 Is today's AI smarter than YOU? #Shorts
Is today's AI smarter than YOU? #Shorts
AI Coffee Break with Letitia
47 Convolution vs Cross-Correlation. How most CNNs do not compute convolutions. | โ“ #Shorts
Convolution vs Cross-Correlation. How most CNNs do not compute convolutions. | โ“ #Shorts
AI Coffee Break with Letitia
48 Why do we care about cross-correlations vs convolutions | โ“ #AICoffeeBreakQuiz #Shorts
Why do we care about cross-correlations vs convolutions | โ“ #AICoffeeBreakQuiz #Shorts
AI Coffee Break with Letitia
49 The convolution is not shift invariant. | Invariance vs Equivariance | โ“ #AICoffeeBreakQuiz #Shorts
The convolution is not shift invariant. | Invariance vs Equivariance | โ“ #AICoffeeBreakQuiz #Shorts
AI Coffee Break with Letitia
50 How to increase the receptive field in CNNs? | #AICoffeeBreakQuiz #Shorts
How to increase the receptive field in CNNs? | #AICoffeeBreakQuiz #Shorts
AI Coffee Break with Letitia
51 What is tokenization and how does it work? Tokenizers explained.
What is tokenization and how does it work? Tokenizers explained.
AI Coffee Break with Letitia
52 Foundation Models | On the opportunities and risks of calling pre-trained models โ€œFoundation Modelsโ€
Foundation Models | On the opportunities and risks of calling pre-trained models โ€œFoundation Modelsโ€
AI Coffee Break with Letitia
53 How modern search engines work โ€“ Vector databases explained! | Weaviate open-source
How modern search engines work โ€“ Vector databases explained! | Weaviate open-source
AI Coffee Break with Letitia
54 Eyes tell all: How to tell that an AI generated a face?
Eyes tell all: How to tell that an AI generated a face?
AI Coffee Break with Letitia
55 Swin Transformer paper animated and explained
Swin Transformer paper animated and explained
AI Coffee Break with Letitia
56 Data BAD | What Will it Take to Fix Benchmarking for NLU?
Data BAD | What Will it Take to Fix Benchmarking for NLU?
AI Coffee Break with Letitia
57 SimVLM explained | What the paper doesnโ€™t tell you
SimVLM explained | What the paper doesnโ€™t tell you
AI Coffee Break with Letitia
58 Generalization โ€“ Interpolation โ€“ Extrapolation in Machine Learning: Which is it now!?
Generalization โ€“ Interpolation โ€“ Extrapolation in Machine Learning: Which is it now!?
AI Coffee Break with Letitia
59 Do Transformers process sequences of FIXED or of VARIABLE length? | #AICoffeeBreakQuiz
Do Transformers process sequences of FIXED or of VARIABLE length? | #AICoffeeBreakQuiz
AI Coffee Break with Letitia
60 The efficiency misnomer | Size does not matter | What does the number of parameters mean in a model?
The efficiency misnomer | Size does not matter | What does the number of parameters mean in a model?
AI Coffee Break with Letitia

Related Reads

๐Ÿ“ฐ
On July 1, 2026, arXiv will spin out from Cornell University, its home for the past 25 years, to become an independent nonprofit organization. Major funding support from Simons Foundation and Schmidt Sciences. Ditching the red for their website. [N]
arXiv is becoming an independent nonprofit organization after 25 years at Cornell University, backed by major funding, which will impact the future of research and academia
Reddit r/MachineLearning
๐Ÿ“ฐ
CS-NRRMโ„ข Official Publications: Paper 1 and Paper 2 Are Now Available
Learn about the CS-NRRM's official publications on a 12-year longitudinal human observation archive and its significance in research and development
Medium ยท Data Science
๐Ÿ“ฐ
Found a potential mistake in an ICLR 2026 blogpost [D]
Verify a potential mistake in an ICLR 2026 blog post and learn how to effectively report errors in academic publications
Reddit r/MachineLearning
๐Ÿ“ฐ
Rebuttals Move Peer-Review Scores, but Initial-Review Structure Bounds the Movement
Learn how author rebuttals impact peer-review scores and the factors that influence their effectiveness in ICLR 2024-2025, using LLMs for measurement
ArXiv cs.AI
Up next
Indians Under House Arrest in America? ๐Ÿ˜ฑ Immigration Crisis Explained | SumanTV Classroom
SumanTV Classroom
Watch โ†’