Skip to content

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

Notifications You must be signed in to change notification settings

ict-bigdatalab/awesome-pretrained-models-for-information-retrieval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation


logo of awesome repository

awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pre-training for IR). If I missed any papers, feel free to open a PR to include them! And any feedback and contributions are welcome!

Pre-training for IR

Survey Papers

First Stage Retrieval

Sparse Retrieval

Neural term re-weighting

Query or document expansion

Sparse representation learning

Dense Retrieval

Hard negative sampling

Late interaction and multi-vector representation

Knowledge distillation

Pre-training tailored for dense retrieval

Jointly learning retrieval and indexing

Multi-hop dense retrieval

Domain adaptation

Query reformulation

Bias

Hybrid Retrieval

Re-ranking Stage

Basic Usage

Discriminative ranking models

Representation-focused
Interanction-focused

Generative ranking models

Hybrid ranking models

Long Document Processing Techniques

Passage score aggregation

Passage representation aggregation

Designing new architectures

Improving Efficiency

Decoupling the interaction

Knowledge distillation

Partial Fine-tuning

Early exit

Other Topics

Query Expansion

Re-weighting Training Samples

Pre-training Tailored for Re-ranking

Adversarial Attack and Defence

Cross-lingual Retrieval

Jointly Learning Retrieval and Re-ranking

Model-based IR System

LLM and IR

Perspectives or Surveys

Retrieval Augmented LLM

LLM for IR

Synthetic Query Generation

Synthetic Document Generation

LLM for Relevance Scoring

LLM for Generative Retrieval

Retrieval-Augmented Text Generation

Others

Multimodal Retrieval

Unified Single-stream Architecture

Multi-stream Architecture Applied on Input

Other Resources

Some Retrieval Toolkits

Other Resources About Pre-trained Models in NLP

Surveys About Efficient Transformers