Timothe Laborie's picture

17 9

Timothe Laborie

timothelaborie

·

AI & ML interests

ML

Organizations

timothelaborie's activity

New activity in madbuda/triton-windows-builds about 2 months ago

triton 3

#3 opened about 2 months ago by

commented a paper 4 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63 •

commented a paper 5 months ago

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Paper • 2404.18911 • Published Apr 29 • 29 •

New activity in 1bitLLM/bitnet_b1_58-3B 6 months ago

Why are these models fp32?

#2 opened 6 months ago by

commented a paper 6 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590 •

commented a paper 7 months ago

GPTVQ: The Blessing of Dimensionality for LLM Quantization

Paper • 2402.15319 • Published Feb 23 • 19 •

New activity in mistralai/Mistral-7B-v0.1 7 months ago

Fine Tuning for Classification

#129 opened 7 months ago by

MUHAMMAD-SOHAIL-ZZU

commented 2 papers 7 months ago

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15 • 17 •

TP-Aware Dequantization

Paper • 2402.04925 • Published Jan 15 • 3 •

commented 7 papers 8 months ago

BlackMamba: Mixture of Experts for State-Space Models

Paper • 2402.01771 • Published Feb 1 • 22 •

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

Paper • 2401.15077 • Published Jan 26 • 17 •

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25 • 17 •

The Impact of Reasoning Step Length on Large Language Models

Paper • 2401.04925 • Published Jan 10 • 15 •

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Paper • 2401.04658 • Published Jan 9 • 24 •

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9 • 41 •

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8 • 70 •

commented a paper 9 months ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118 •