diwank (Diwank Tomer)

upvoted a paper 42 minutes ago

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published about 21 hours ago • 5

upvoted a paper 43 minutes ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published about 13 hours ago • 3

upvoted a collection 1 day ago

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 8 days ago • 53

upvoted an article 2 days ago

Article

Introducing Community Tools

4 days ago

• 18

upvoted a paper 3 days ago

Agent Workflow Memory

Paper • 2409.07429 • Published 9 days ago • 25

upvoted 8 collections 4 days ago

upvoted a paper 4 days ago

FACT: Learning Governing Abstractions Behind Integer Sequences

Paper • 2209.09543 • Published Sep 20, 2022 • 2

upvoted a collection 4 days ago

Recommendation

Collection

11 items • Updated Jan 4 • 2

upvoted a paper 9 days ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published 10 days ago • 51

upvoted a paper 15 days ago

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Paper • 2409.01437 • Published 17 days ago • 70

upvoted 2 papers 16 days ago

ContextCite: Attributing Model Generation to Context

Paper • 2409.00729 • Published 19 days ago • 13

FLUX that Plays Music

Paper • 2409.00587 • Published 19 days ago • 31

upvoted 3 papers 20 days ago

LLaVaOLMoBitnet1B: Ternary LLM goes Multimodal!

Paper • 2408.13402 • Published 27 days ago • 17

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published 24 days ago • 119

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published 24 days ago • 137

upvoted a paper 22 days ago

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published 23 days ago • 41

upvoted a paper 24 days ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published 29 days ago • 84

upvoted a paper 29 days ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 30

upvoted a paper about 1 month ago

Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data

Paper • 2408.10119 • Published Aug 19 • 15

upvoted a collection about 1 month ago

🧠 Abliteration

Collection

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated Aug 17 • 12

upvoted 3 papers about 1 month ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 114

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9 • 36

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7 • 4

upvoted an article about 1 month ago

Article

How I train a LoRA: m3lt style training overview

By

•

Jul 1

• 45

upvoted a paper about 1 month ago

TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Paper • 2408.00735 • Published Aug 1 • 15

upvoted 3 papers about 2 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25 • 30

Very Large-Scale Multi-Agent Simulation in AgentScope

Paper • 2407.17789 • Published Jul 25 • 30

Adaptive Retrieval-Augmented Generation for Conversational Systems

Paper • 2407.21712 • Published Jul 31 • 2

upvoted 5 collections about 2 months ago

SigLIP

Collection

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated Jul 31 • 32

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 31 • 76

IndustryCorpus

Collection

19 items • Updated about 19 hours ago • 3

Bad Data Toolbox

Collection

PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18 • 10

Research projects on top of vLLM

Collection

Papers cited in https://blog.vllm.ai/2024/07/25/lfai-perf.html • 6 items • Updated Jul 29 • 12

upvoted 2 papers about 2 months ago

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models

Paper • 2406.11736 • Published Jun 17 • 4

Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Paper • 2311.09278 • Published Nov 15, 2023 • 7

upvoted 2 collections about 2 months ago

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 49

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Aug 2 • 570

upvoted 12 papers 2 months ago

Stateful Memory-Augmented Transformers for Dialogue Modeling

Paper • 2209.07634 • Published Sep 15, 2022 • 1

To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review

Paper • 2304.09355 • Published Apr 19, 2023 • 5

A Cookbook of Self-Supervised Learning

Paper • 2304.12210 • Published Apr 24, 2023 • 3

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Paper • 2303.15256 • Published Mar 27, 2023 • 1

Augmented Language Models: a Survey

Paper • 2302.07842 • Published Feb 15, 2023 • 3

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Paper • 2206.07643 • Published Jun 15, 2022 • 1

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12 • 56

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11 • 31

Scaling Diffusion Transformers to 16 Billion Parameters

Paper • 2407.11633 • Published Jul 16 • 25

E5-V: Universal Embeddings with Multimodal Large Language Models

Paper • 2407.12580 • Published Jul 17 • 38

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 52

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29 • 10

upvoted 2 articles 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 242

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 85

upvoted a paper 2 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 153

upvoted an article 2 months ago

Article

Vision Language Models Explained

Apr 11

• 176

Diwank Tomer PRO

AI & ML interests

Articles

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

Organizations

diwank's activity

Introducing Community Tools

How I train a LoRA: m3lt style training overview

SmolLM - blazingly fast and remarkably powerful

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Vision Language Models Explained