I'm Director of Machine Learning at cohere.ai. My team focus is to make extremely large language models (up to 100B+ parameters) accessible for semantic search and classification.
Natural Language Processing and Machine Learning
I’m Director of Machine Learning at cohere.ai. My team develops state-of-the-art methods for search using (large) transformer networks using various techniques (dense & sparse embeddings, neural re-ranking, query generation). The most common challenge of applying neural search models is the lack of labeled training data in most domains. Hence, we develop novel strategies how to bridge this gap by combining the strengths of various approaches. See the the most recent publications and my talks for our current research focus.
Publications
2023
- DAPR: A Benchmark on Document-Aware Passage Retrieval
Kexin Wang, Nils Reimers, Iryna Gurevych. arxiv 2023. [pdf]
2022
- Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking
Tim Baumgärtner, Leonardo F. R. Ribeiro, Nils Reimers, Iryna Gurevych. EMNLP 2022. [pdf] [software] - MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff, Nouamane Tazi, Loïc Magne, Nils Reimers. arxiv 2022. [pdf] [software]
- Efficient Few-Shot Learning Without Prompts
Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat, Moshe Wasserblat, Oren Pereg. NeurIPS 2022 ENLSP Workshop. [pdf] [software] - Domain Adaptation for Memory-Efficient Dense Retrieval
Nandan Thakur, Nils Reimers, Jimmy Lin. arxiv 2022. [pdf] [software] - UKP-SQUARE: An Online Platform for Question Answering Research
Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych. ACL 2022 Demo. [pdf] [software] - OpenAI GPT-3 Text Embeddings – Really a new state-of-the-art in dense text embeddings?
Nils Reimers. medium 2022. [post]
2021
- GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
Kexin Wang, Nandan Thakur, Nils Reimers, Iryna Gurevych. arxiv 2021. [pdf] [software] - BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur, Nils Reimers, Andreas Rücklé, Abhishek Srivastava, Iryna Gurevych. NeurIPS 2021 Datasets and Benchmarks Track. [pdf] [software] - TWEAC: Transformer with Extendable QA Agent Classifiers
Gregor Geigle, Nils Reimers, Andreas Rücklé, Iryna Gurevych. arxiv 2021. [pdf] [software] - TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
Kexin Wang, Nils Reimers, Iryna Gurevych. EMNLP 2021 Findings. [pdf] [software] - Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval
Gregor Geigle, Jonas Pfeiffer, Nils Reimers, Ivan Vulić, Iryna Gurevych. TACL 2021. [pdf] [software]
2020
- The Curse of Dense Low-Dimensional Information Retrieval for Large Index Sizes
Nils Reimers, Iryna Gurevych. ACL 2021. [pdf]
- Cross-Document Event Coreference Resolution Beyond Corpus-Tailored Systems
Michael Bugert, Nils Reimers, Iryna Gurevych. Journal Computational Linguistics 2021. [pdf] [software] - AdapterDrop: On the Efficiency of Adapters in Transformers
Andreas Rücklé, Gregor Geigle, Max Glockner, Tilman Beck, Jonas Pfeiffer, Nils Reimers, Iryna Gurevych. EMNLP 2021. [pdf] - Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
Nandan Thakur, Nils Reimers, Johannes Daxenberger, Iryna Gurevych. NAACL 2021. [pdf] [software] - Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation
Nils Reimers, Iryna Gurevych. EMNLP 2020. [pdf] [software] - Breaking the Subtopic Barrier in Cross-Document Event Coreference Resolution
Michael Bugert, Nils Reimers, Shany Barhom, Ido Dagan, Iryna Gurevych. Text2Story@ECIR’20. [pdf]
2019
- Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers, Iryna Gurevych. EMNLP 2019. [pdf] [software] - Classification and Clustering of Arguments with Contextualized Word Embeddings
Nils Reimers, Benjamin Schiller, Tilman Beck, Johannes Daxenberger, Christian Stab, Iryna Gurevych. ACL 2019. [pdf] [software] - Revisiting Joint Modeling of Cross-document Entity and Event Coreference Resolution
Shany Barhom, Vered Shwartz, Alon Eirew, Michael Bugert, Nils Reimers, Ido Dagan. ACL 2019. [pdf] [software] - Alternative Weighting Schemes for ELMo Embeddings
Nils Reimers, Iryna Gurevych. arxiv 2019. [pdf] [software]
2018
- Universal Machine Learning Methods for Detecting and Temporal Anchoring of Events
Nils Reimers. Ph.D. thesis. [pdf]
- Why Comparing Single Performance Scores Does Not Allow to Draw Conclusions About Machine Learning Approaches
Nils Reimers, Iryna Gurevych. arxiv 2018. [pdf]
- Event Time Extraction with a Decision Tree of Neural Classifiers
Nils Reimers, Nazanin Dehghani, Iryna Gurevych. TACL 2018. [pdf] [software]
2017
- Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging
Nils Reimers, Iryna Gurevych. EMNLP 2017. [pdf] [software]
- Optimal Hyperparameters for Deep LSTM-Networks for Sequence Labeling Tasks
Nils Reimers, Iryna Gurevych. arxiv 2017. [pdf] [software]
2016
- Task-Oriented Intrinsic Evaluation of Semantic Textual Similarity
Nils Reimers, Philip Beyer, Iryna Gurevych. COLING 2016. [pdf]
- Temporal Anchoring of Events for the TimeBank Corpus
Nils Reimers, Nazanin Dehghani, Iryna Gurevych. ACL 2016. [pdf] [software]
- DARIAH‐DKPro‐Wrapper Output Format(DOF) Specification
Fotis Jannidis, Stefan Pernes, Isabella Reger, Nils Reimers, Steffen Pielström, Thorsten Vitt. DARIAH-DE Working Paper. [pdf] [software].
- A Tool for NLP-Preprocessing in Literary Text Analysis
Nils Reimers, Fotis Jannidis, Steffen Pielström, Stefan Pernes, Isabella Reger. Digital Humanities 2016: Conference Abstracts. [software].
- A Novel Attack Model for Collusion Secure Fingerprinting Codes
Marcel Schäfer, Waldemar Berchtold, Nils Reimers, Teetje Stark, Martin Steinebach. EI 2016. [pdf]
2015
- Event Nugget Detection, Classification and Coreference Resolution using Deep Neural Networks and Gradient Boosted Decision Trees
Nils Reimers, Iryna Gurevych. TAC 2015. [pdf] [software]
2014
- GermEval-2014: Nested Named Entity Recognition with Neural Networks
Nils Reimers, Judith Eckle-Kohler, Carsten Schnober, Jungi Kim, Iryna Gurevych. Konvens 2014. [pdf] [software]
2013
- Robust Hash Algorithms for Text
Martin Steinebach, Peter Klöckner, Nils Reimers, Dominik Wienand, Patrick Wolf. Communications and Multimedia Security, Volume 8099. [pdf]
- Computing on Authenticated Data for Adjustable Predicates
Björn Deiseroth, Victoria Fehr, Marc Fischlin, Manuel Maasz, Nils Reimers, Richard Stein. ACNS 2013. [pdf]
Software
- 2021 – GPL
Training procedure to adapt dense retrieval models to new domains. [Github] - 2021 – BEIR
Evaluation framework for out-of-domain Information Retrieval. [Github] - 2021 – EasyNMT
Framework for easy-to-use, state-of-the-art Neural Machine Translation. [Github] - 2019 – Sentence Transformers
Framework for generation of state-of-the-art sentence embeddings using transformer networks (BERT, RoBERTa, XLM-R). [Github] - 2019 – Argument Classification and Clustering
Using BERT to classify sentential arguments and to cluster them based on aspects. [Github] - 2018 – ELMo-BiLSTM-CNN-CRF Network for Sequence Classification
Extension of the BiLSTM-CNN-CRF network for sequence classification using contextualized word embeddings (ELMo embeddings). [Github] - 2017 – BiLSTM-CNN-CRF Network for Sequence Classification
A Python implementation of the state-of-the-art BiLSTM-CNN-CRF network for sequence classification. Up to 10 times faster than comparable implementations. [Github] - 2017 – Event Time Extraction with a Decision Tree of Neural Classifiers
This is experimental software to temporally anchor events in time. The software uses Keras and provides methods to train own neural networks. [Github] - 2016 – Language independent truecaser in Python
Truecasing is the process of restoring the correct capitalization of a text. This software creates a statistical language model to derive the correct casing for an input text. [Github] - 2015 – Event Nugget Detection using Deep Neural Networks
This is experimental software to extract events from English text. The software uses Theano and provides methods to train own neural networks as well as pre-trained models that can be applied out of the box. [Github] - 2015 – DARIAH-DKPro-Wrapper
The DARIAH-DKPro-Wrapper is an easy-to-use tool to extract basic linguistic information from text like Part-of-Speach, Named Entites, and syntax trees. Text as well as XML input files can be processed and the lingustic annotations are written to an easy to process file format. [Github]
Lectures
- Talks: I give on a regular basis talks at various conferences, summer schools, workshops and at research institutes.
- Deep Learning for NLP
In October 2015 I gave a lecture on Deep Learning and how it can be used in NLP. In November 2016, I gave a 1-day seminar on this topic at the University of Duisburg-Essen. The slides, videos and supplementary code can be found on Github. In the sommer semester 2018 I gave this lecture at the TU Darmstadt. - Foundations of Language Technology
In the winter semester 2014/2015 I created and supervised the exercise for the lecture ‘Foundations of Language Technology’, where the students learned the basic principles of NLP.
Top Repositories
15550
2261
2010
1653
1182
Selected Publications
- GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval. 2021. [pdf] [git]
- BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models. 2021. [pdf] [git]
- Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks. 2020. [pdf] [git]
- Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation. 2020. [pdf] [git]
- Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. 2019. [pdf] [git]