view article Article ⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw • Jun 3 • 26
Learning to Move Like Professional Counter-Strike Players Paper • 2408.13934 • Published 25 days ago • 21
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published 28 days ago • 109
view article Article Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models By isidentical • 24 days ago • 34
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Paper • 2202.10408 • Published Feb 21, 2022 • 5
view article Article Building DoRA Support for Embedding Layers in PEFT By ariG23498 • 27 days ago • 10
view article Article Using Writer Framework with Hugging Face Spaces By samjulien • about 1 month ago • 30
view article Article dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified By chansung • 29 days ago • 12
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14 • 40
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities Paper • 2305.11172 • Published May 18, 2023 • 1
view article Article The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29 • 12
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 40
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5 • 60
view article Article Querying Datasets with the Datasets Explorer Chrome Extension By cfahlgren1 • Jul 19 • 6
BRAG-v0.1 Collection BRAG is a series of SLMs (Small Language Models) specifically trained for RAG tasks. We release models with size 1.5b, 7b and 8b. • 4 items • Updated Aug 4 • 12
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 79
Gemma 2: Improving Open Language Models at a Practical Size Paper • 2408.00118 • Published Jul 31 • 73
Bilateral Reference for High-Resolution Dichotomous Image Segmentation Paper • 2401.03407 • Published Jan 7 • 1
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Paper • 2408.00653 • Published Aug 1 • 27
view article Article BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18 • 35
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Paper • 2401.10891 • Published Jan 19 • 58
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 16
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 By dvilasuero • Jul 30 • 31
view article Article Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚 By Isayoften • Jul 10 • 31
🍃 MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 49