StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation Paper • 2409.12576 • Published about 21 hours ago • 5
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published about 13 hours ago • 3
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 8 days ago • 53
FACT: Learning Governing Abstractions Behind Integer Sequences Paper • 2209.09543 • Published Sep 20, 2022 • 2
LLaMA-Omni: Seamless Speech Interaction with Large Language Models Paper • 2409.06666 • Published 10 days ago • 51
Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper • 2408.14906 • Published 24 days ago • 137
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published 23 days ago • 41
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6 • 30
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data Paper • 2408.10119 • Published Aug 19 • 15
🧠 Abliteration Collection Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated Aug 17 • 12
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12 • 114
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 Paper • 2408.05147 • Published Aug 9 • 36
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters Paper • 2408.04093 • Published Aug 7 • 4
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models Paper • 2408.00735 • Published Aug 1 • 15
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning Paper • 2407.18248 • Published Jul 25 • 30
Adaptive Retrieval-Augmented Generation for Conversational Systems Paper • 2407.21712 • Published Jul 31 • 2
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated Jul 31 • 32
Bad Data Toolbox Collection PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18 • 10
Research projects on top of vLLM Collection Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29 • 12
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models Paper • 2406.11736 • Published Jun 17 • 4
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models Paper • 2311.09278 • Published Nov 15, 2023 • 7
🍃 MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 49
Llama 3.1 Collection This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Aug 2 • 570
Stateful Memory-Augmented Transformers for Dialogue Modeling Paper • 2209.07634 • Published Sep 15, 2022 • 1
To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review Paper • 2304.09355 • Published Apr 19, 2023 • 5
Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need Paper • 2303.15256 • Published Mar 27, 2023 • 1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone Paper • 2206.07643 • Published Jun 15, 2022 • 1
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper • 2407.08296 • Published Jul 11 • 31
E5-V: Universal Embeddings with Multimodal Large Language Models Paper • 2407.12580 • Published Jul 17 • 38
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Paper • 2405.18952 • Published May 29 • 10
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5 • 85