Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 1 day ago • 134
jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated about 12 hours ago • 4
Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources Paper • 2409.08239 • Published 7 days ago • 15
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts? Paper • 2409.07703 • Published 8 days ago • 58
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published 7 days ago • 39
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers Paper • 2409.04109 • Published 14 days ago • 37
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Paper • 2409.07314 • Published 8 days ago • 49
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 8 days ago • 53
LLaMA-Omni: Seamless Speech Interaction with Large Language Models Paper • 2409.06666 • Published 9 days ago • 51
OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs Paper • 2409.05152 • Published 11 days ago • 27
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks Paper • 1908.10084 • Published Aug 27, 2019 • 4
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 5 days ago • 18
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Paper • 2409.05840 • Published 10 days ago • 43
Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published 15 days ago • 70
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Paper • 2409.04410 • Published 13 days ago • 23
Building Math Agents with Multi-Turn Iterative Preference Learning Paper • 2409.02392 • Published 16 days ago • 14
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing Paper • 2409.01322 • Published 17 days ago • 94
From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents Paper • 2409.03512 • Published 14 days ago • 25
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark Paper • 2409.02813 • Published 15 days ago • 27
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture Paper • 2409.02889 • Published 15 days ago • 53
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Paper • 2409.02095 • Published 16 days ago • 32
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming Paper • 2408.16725 • Published 21 days ago • 49
Large-Scale Multi-omic Biosequence Transformers for Modeling Peptide-Nucleotide Interactions Paper • 2408.16245 • Published 22 days ago • 4
VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images Paper • 2408.16176 • Published 22 days ago • 7
SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section Paper • 2408.16444 • Published 22 days ago • 7
InkubaLM: A small language model for low-resource African languages Paper • 2408.17024 • Published 21 days ago • 10
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published 21 days ago • 44
MemLong: Memory-Augmented Retrieval for Long Text Modeling Paper • 2408.16967 • Published 21 days ago • 1
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA Paper • 2409.02897 • Published 15 days ago • 42
Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation Paper • 2409.03271 • Published 15 days ago • 2
Power-LM Collection Dense & MoE LLMs trained with power learning rate scheduler. • 3 items • Updated 8 days ago • 13
view article Article Selective fine-tuning of Language Models with Spectrum By anakin87 • 17 days ago • 25
Mini-MOEs - Mixture of Experts 2x, x4 and x8 Collection Tiny but mighty. 1B, 1.1B, 2B (2x,x4,x8) MOE models. Suggest Q8 version, and review of original model page for template (!), usage & help. • 31 items • Updated Aug 9 • 4
CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization Paper • 2408.15914 • Published 22 days ago • 21
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding Paper • 2408.15545 • Published 23 days ago • 32
SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners Paper • 2408.16768 • Published 21 days ago • 25
CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published 21 days ago • 55
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published 23 days ago • 41