rbiswasfc (Raja Biswas)

upvoted an article 1 day ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

2 days ago

• 100

upvoted a paper 15 days ago

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published 23 days ago • 32

upvoted an article 18 days ago

Article

The 5 Most Under-Rated Tools on Hugging Face

29 days ago

• 74

upvoted a paper 24 days ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published 29 days ago • 53

upvoted an article about 1 month ago

Article

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Aug 22, 2023

• 24

upvoted a paper about 1 month ago

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13 • 28

upvoted an article about 1 month ago

Article

Tool Use, Unified

Aug 12

• 49

upvoted a paper about 1 month ago

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Paper • 2408.04682 • Published Aug 8 • 14

upvoted a collection about 2 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co./spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 35

upvoted a paper 2 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

upvoted 2 collections 2 months ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems • 9 items • Updated 11 days ago • 40

H2O Danube3

Collection

6 items • Updated Jul 16 • 51

upvoted 2 papers 3 months ago

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 84

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11 • 36

upvoted a paper 4 months ago

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Paper • 2309.04269 • Published Sep 8, 2023 • 32

upvoted 3 articles 4 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28

• 146

Article

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

By

•

Jun 3

• 26

Article

Benchmarking Text Generation Inference

May 29

• 26

upvoted 2 papers 4 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 86

SUTRA: Scalable Multilingual Language Model Architecture

Paper • 2405.06694 • Published May 7 • 37

upvoted 2 articles 5 months ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

Sep 13, 2023

• 12

Article

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

By

•

May 3

• 17

upvoted 9 papers 5 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 68

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 114

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 124

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10 • 103

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 83

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 250

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Paper • 2404.15653 • Published Apr 24 • 26

Pre-training Small Base LMs with Fewer Tokens

Paper • 2404.08634 • Published Apr 12 • 33

upvoted an article 5 months ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

upvoted 10 papers 6 months ago

DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16 • 29

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

Paper • 2401.00690 • Published Jan 1 • 1

upvoted 2 collections 6 months ago

💫 StarCoder2

Collection

StarCoder2 models and datasets! • 8 items • Updated Mar 1 • 79

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 31 • 325

upvoted 8 papers 6 months ago

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20 • 46

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 107

Divide-or-Conquer? Which Part Should You Distill Your LLM?

Paper • 2402.15000 • Published Feb 22 • 22

Orca-Math: Unlocking the potential of SLMs in Grade School Math

Paper • 2402.14830 • Published Feb 16 • 24

Do Large Language Models Latently Perform Multi-Hop Reasoning?

Paper • 2402.16837 • Published Feb 26 • 24

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 61

upvoted 4 papers 8 months ago

H2O-Danube-1.8B Technical Report

Paper • 2401.16818 • Published Jan 30 • 16

Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 41

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31 • 59

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1 • 78

upvoted a paper 9 months ago

LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4 • 36

upvoted a collection 9 months ago

Zephyr 7B

Collection

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 144

upvoted a paper 9 months ago

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Paper • 2311.03099 • Published Nov 6, 2023 • 28

upvoted a collection 9 months ago

Awesome feedback datasets

Collection

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 64

Raja Biswas PRO

AI & ML interests

Organizations

rbiswasfc's activity

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

The 5 Most Under-Rated Tools on Hugging Face

Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model

Tool Use, Unified

Training and Finetuning Embedding Models with Sentence Transformers v3

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

Benchmarking Text Generation Inference

Fine-tuning Llama 2 70B using PyTorch FSDP

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

CodeGemma - an official Google release for code LLMs