Florian Zimmermeister's picture

Florian Zimmermeister

flozi00

·

AI & ML interests

ASR, German LLM

Organizations

$A\\Ware's profile picture$

flozi00's activity

upvoted a paper about 6 hours ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 1 day ago • 72

upvoted a paper 2 days ago

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11 • 31

upvoted a collection 11 days ago

INT4 LLMs for vLLM

Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! • 18 items • Updated 21 days ago • 5

upvoted a collection 25 days ago

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 37 items • Updated 24 days ago • 51

upvoted a collection about 2 months ago

Research projects on top of vLLM

Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29 • 12

upvoted an article 2 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 74

upvoted an article 4 months ago

Article

Benchmarking Text Generation Inference

May 29

• 26

upvoted a collection 5 months ago

AQLM

AQLM quantized LLMs • 20 items • Updated May 3 • 41

upvoted an article 5 months ago

Article

Inference for PROs

Sep 22, 2023

• 39

upvoted a collection 5 months ago

DPO datasets for DE

A collection of DPO datasets for the DE language. • 6 items • Updated Apr 15 • 1

upvoted a collection 7 months ago

Tower

Model weights and SFT data for Tower. • 10 items • Updated 16 days ago • 23

upvoted a paper 7 months ago

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

upvoted a paper 8 months ago

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30 • 34

upvoted a paper 11 months ago

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 56

upvoted 4 papers about 1 year ago

ModuleFormer: Learning Modular Large Language Models From Uncurated Data

Paper • 2306.04640 • Published Jun 7, 2023 • 7

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 40

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 170

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 80

upvoted a paper over 1 year ago

TART: A plug-and-play Transformer module for task-agnostic reasoning

Paper • 2306.07536 • Published Jun 13, 2023 • 11