A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models Paper • 2504.05496 • Published Apr 7, 2025 • 1
Sparks of Science: Hypothesis Generation Using Structured Paper Data Paper • 2504.12976 • Published Apr 17, 2025 • 1
view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG Jan 15 • 65
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 • 183
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 315
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 128
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Paper • 2409.04109 • Published Sep 6, 2024 • 48
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis Paper • 2206.01062 • Published Jun 2, 2022 • 3
AAAR-1.0: Assessing AI's Potential to Assist Research Paper • 2410.22394 • Published Oct 29, 2024 • 16
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 31, 2025 • 227
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 264
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 128
Graph Neural Prompting with Large Language Models Paper • 2309.15427 • Published Sep 27, 2023 • 1
Matryoshka Embedding Models Collection https://huggingface.co/blog/matryoshka • 14 items • Updated May 13, 2025 • 16
Efficient Estimation of Word Representations in Vector Space Paper • 1301.3781 • Published Jan 16, 2013 • 8
Unifying Large Language Models and Knowledge Graphs: A Roadmap Paper • 2306.08302 • Published Jun 14, 2023 • 3