Call for Participants: EMNLP 2024 BoF on Embeddings, Reranker & Small LMs for Better Search

At EMNLP 2024 Miami? Join us for a Birds of a Feather session focusing on embeddings, rerankers, and small LMs for better search.

Event poster for "Embedding Reranker, Small LM & Better Search" on Nov 14, 2024, from 10:30 to 12:00 at Miami Lecture Hall. F

Join us on November 14, 2024, from 10:30 AM to 12:00 PM (Miami Time) for a Birds of a Feather (BoF) session focusing on embeddings, rerankers, and small LMs for better search at EMNLP 2024 in Miami. Following the success of our Embeddings BoF at EMNLP 2023 in Singapore, this 1.5-hour in-person session will bring together researchers specializing in embedding models, rerankers, and various topics in information retrieval. The session will feature presentations and a panel discussion, offering an excellent opportunity to explore recent advances in search foundation models, share your work with a specialized audience, and discuss emerging trends in search models in 2024. All EMNLP on-site participants are welcome to attend.

Event Details

  • Date: Thursday, November 14, 2024
  • Time: 10:30 AM - 12:00 PM (Miami Time)
  • Location: Miami Lecture Hall
  • Format: In-person

Registration

EMNLP 2024/BoF: Embeddings, Reranker, Small LMs for Better Search
What: A BoF (Birds of a Feather) in-person session on Embeddings, Reranker and Search LMs, co-organized by EMNLP PC and Jina AI. When & Where: Date: Thursday, November 14, 2024 Time: 10:30 AM - 12:00 PM (Miami Time) Location: Miami Lecture Hall Highlights: multimodal embeddings, long context embeddings, multi- and bilingual embeddings, embeddings for low-resource languages, applications of embeddings, and the interplay between embeddings and prompts. Participation: The session is open to everyone; you can simply walk in to participate. If you’re interested in contributing to this BoF, we welcome you to fill in the details below.

While general attendance registration is not mandatory, we strongly encourage participants to register via our Google Form to help us better organize the session. If you would like to present your work (10-minute slot) or participate as a panelist in this BoF session, please indicate this in the registration form. This is an excellent opportunity to share your research with experts in the field of embeddings and search models.

Topics of Interest

  • Multimodal, multilingual, cross-lingual, and cross-modal embeddings and rerankers
  • Late-interaction models (e.g., ColBERT, ColPali) and late chunking
  • Long-context embedding models
  • Instruction-tuning for embedding and reranker models
  • LLM-based embedding and reranker models
  • Small language models for document reading
  • Efficient and lightweight embedding architectures, attention mechanisms
  • Zero/few-shot retrieval and adaptation methods
  • Contrastive learning approaches for retriever
  • Matryoshka representation learning, embedding compression and quantization
  • Hybrid sparse-dense retrieval systems
  • Task-LoRA, domain adaptation, OOD, fine-tuning for embedding models
  • MTEB, LongMTEB, RAG evaluation metrics and benchmarks
  • Embedding models for code, structured data and time series
  • Privacy-preserving embedding techniques