195 223 270

Lain

not-lain

https://not-lain.github.io

AI & ML interests

custom AI models with HF integration, multimodal rag and open-source contributions

Articles

Organizations

not-lain's activity

upvoted an article 1 day ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

2 days ago

• 100

upvoted an article 2 days ago

Article

Accelerate 1.0.0

7 days ago

• 31

upvoted an article 3 days ago

Article

Introducing Community Tools

4 days ago

• 18

upvoted an article 5 days ago

Article

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

•

Jun 3

• 26

upvoted an article 6 days ago

Article

"Diffusers Image Fill" guide

•

7 days ago

• 19

upvoted an article 15 days ago

Article

Hugging Face partners with TruffleHog to Scan for Secrets

16 days ago

• 9

upvoted an article 19 days ago

Article

Scaling robotics datasets with video encoding

24 days ago

• 31

upvoted an article 23 days ago

Article

Understanding Vector Quantization in VQ-VAE

•

23 days ago

• 9

upvoted 2 papers 23 days ago

Learning to Move Like Professional Counter-Strike Players

Paper • 2408.13934 • Published 25 days ago • 21

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published 28 days ago • 109

upvoted an article 24 days ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

•

24 days ago

• 34

upvoted an article 26 days ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

• 23

upvoted a paper 26 days ago

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Paper • 2202.10408 • Published Feb 21, 2022 • 5

upvoted an article 27 days ago

Article

Building DoRA Support for Embedding Layers in PEFT

•

27 days ago

• 10

upvoted 2 articles 28 days ago

Article

Using Writer Framework with Hugging Face Spaces

•

about 1 month ago

• 30

Article

How to communicate in a Pull Request?

•

29 days ago

• 17

upvoted 2 articles 29 days ago

Article

The 5 Most Under-Rated Tools on Hugging Face

29 days ago

• 74

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

•

29 days ago

• 12

upvoted 2 articles about 1 month ago

Article

Tensor Parallelism

•

Aug 20

• 9

Article

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Aug 19

• 17

upvoted a collection about 1 month ago

Hermes 3

Collection

The Hermes 3 Series of Models • 8 items • Updated 28 days ago • 80

upvoted 5 articles about 1 month ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14

• 40

Article

The Workflow of PEFT

•

Aug 14

• 19

Article

Introduction to ggml

Aug 13

• 91

Article

Tool Use, Unified

Aug 12

• 49

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 96

upvoted 2 papers about 1 month ago

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Paper • 2305.11172 • Published May 18, 2023 • 1

Scalable Nested Optimization for Deep Learning

Paper • 2407.01526 • Published Jul 1 • 4

upvoted 2 articles about 1 month ago

Article

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Jan 29

• 12

Article

XetHub is joining Hugging Face!

Aug 8

• 76

upvoted a collection about 1 month ago

Parler-TTS: fully open-source high-quality TTS

Collection

If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. • 7 items • Updated Aug 8 • 40

upvoted 2 papers about 1 month ago

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5 • 60

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted 3 articles about 1 month ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 193

Article

Introducing TextImage Augmentation for Document Images

Aug 6

• 29

Article

Querying Datasets with the Datasets Explorer Chrome Extension

•

Jul 19

• 6

upvoted a collection about 2 months ago

BRAG-v0.1

Collection

BRAG is a series of SLMs (Small Language Models) specifically trained for RAG tasks. We release models with size 1.5b, 7b and 8b. • 4 items • Updated Aug 4 • 12

upvoted an article about 2 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 79

upvoted 2 papers about 2 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1 • 103

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

upvoted 2 articles about 2 months ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

•

Aug 25, 2023

• 17

Article

Local AI with Docker's Testcontainers

•

Aug 3

• 5

upvoted 2 papers about 2 months ago

Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Paper • 2401.03407 • Published Jan 7 • 1

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Paper • 2408.00653 • Published Aug 1 • 27

upvoted 4 articles about 2 months ago

Article

Inference for PROs

Sep 22, 2023

• 39

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18

• 35

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31

• 58

Article

Announcing the Hugging Face Fellowship Program

May 17, 2022

• 5

upvoted a paper about 2 months ago

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19 • 58

upvoted 4 articles about 2 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 50

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

• 16

Article

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

•

Jul 30

• 31

Article

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

•

Jul 10

• 31

upvoted a collection about 2 months ago

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 49

upvoted 6 articles 2 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 63

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 42

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16

• 30

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

Apr 5, 2022

• 11

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 242

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 92

Lain

AI & ML interests

Articles

RAG chatbot using llama3

Image-based search engine

Train custom AI models with the trainer API and adapt them to 🤗

Custom architectures with HuggingFace 🤗

Organizations

not-lain's activity

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Accelerate 1.0.0

Introducing Community Tools

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

"Diffusers Image Fill" guide

Hugging Face partners with TruffleHog to Scan for Secrets

Scaling robotics datasets with video encoding

Understanding Vector Quantization in VQ-VAE

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

Introduction to 3D Gaussian Splatting

Building DoRA Support for Embedding Layers in PEFT

Using Writer Framework with Hugging Face Spaces

How to communicate in a Pull Request?

The 5 Most Under-Rated Tools on Hugging Face

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Tensor Parallelism

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

A failed experiment: Infini-Attention, and why we should keep trying?

The Workflow of PEFT

Introduction to ggml

Tool Use, Unified

Welcome FalconMamba: The first strong attention-free 7B model

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

XetHub is joining Hugging Face!

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Introducing TextImage Augmentation for Document Images

Querying Datasets with the Datasets Explorer Chrome Extension

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

Local AI with Docker's Testcontainers

Inference for PROs

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Announcing the Hugging Face Fellowship Program

Memory-efficient Diffusion Transformers with Quanto and Diffusers

How to train a new language model from scratch using Transformers and Tokenizers

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

Docmatix - a huge dataset for Document Visual Question Answering

TGI Multi-LoRA: Deploy Once, Serve 30 Models

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Don't repeat yourself - 🤗 Transformers Design Philosophy

SmolLM - blazingly fast and remarkably powerful

How NuminaMath Won the 1st AIMO Progress Prize