List: Reading list | Curated by Bhavin Jawade

Aug 27, 2024
148 stories
Reading list
Amir Ziai
mlbench — ML benchmarking utilsSimple utilities and visualizations to help with doing the right thing when you’re running ML experiments.
Aug 19, 2024
Aug 19, 2024
Amir Ziai
mlbench — ML benchmarking utilsSimple utilities and visualizations to help with doing the right thing when you’re running ML experiments.
Aug 19, 2024
Aug 19, 2024
In
TDS Archive
by
Bhavin Jawade
Tuning-Free Longer Context Lengths For LLMs — A Review of Self-Extend (LLM Maybe LongLM)A simple strategy to enable LLMs to consume longer context length inputs during inference without the need for finetuning.
Jan 4, 2024
1
Jan 4, 2024
1
In
TDS Archive
by
Shenyang(Andy) Huang
Temporal Graph Learning in 2024Continue the journey for evolving networks
Jan 18, 2024
1
Jan 18, 2024
1
In
TDS Archive
by
Cameron R. Wolfe, Ph.D.
Supervised Fine-Tuning (SFT) with Large Language ModelsUnderstanding how SFT works from idea to a working implementation…
Jan 16, 2024
3
Jan 16, 2024
3
In
TDS Archive
by
Nikita Kiselov
Building, Evaluating and Tracking a Local Advanced RAG System | Mistral 7b + LlamaIndex + W&BLearn how to build, evaluate and track advanced RAG system using local Mistral-7b, LlamaIndex, and W&B. Step-by-step guide with code.
Jan 19, 2024
3
Jan 19, 2024
3
In
Generative AI
by
Fabio Chiusano
Weekly AI and NLP News — January 15th 2024OpenAI releases the GPT Store, OpenAI vs The New York Times, and the Mixtral-of-Experts paper
Jan 15, 2024
Jan 15, 2024
In
TDS Archive
by
Wenqi Glantz
Democratizing LLMs: 4-bit Quantization for Optimal LLM InferenceA deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex
Jan 15, 2024
3
Jan 15, 2024
3
In
TDS Archive
by
Benjamin Marie
Run Mixtral-8x7B on Consumer Hardware with Expert OffloadingFinding the right trade-off between memory usage and inference speed
Jan 11, 2024
3
Jan 11, 2024
3
In
Towards AI
by
Florian June
A Detailed Explanation of Mixtral 8x7B ModelIncluding principles, diagrams, and code analysis.
Jan 9, 2024
1
Jan 9, 2024
1
In
AIGuys
by
Vishal Rajput
Mamba: Can it replace Transformers?Solving the quadratic scaling problem of Self-Attention.
Jan 8, 2024
9
Jan 8, 2024
9
In
Stackademic
by
Fabio Matricardi
Tiny-Vicuna-1B is the lightweight champion of the Tiny ModelsCommand and Conquer: the smallest Vicuna flavor is the Tiny Master of Instruction, answers your every call (Flawlessly!)
Jan 11, 2024
4
Jan 11, 2024
4
Fireworks.ai
FireAttention — Serving Open Source Models 4x faster than vLLM by quantizing with ~no tradeoffsServing Open Source Models 4x faster than vLLM by quantizing with ~no tradeoffs
Jan 8, 2024
1
Jan 8, 2024
1
In
Data Science at Microsoft
by
Andreas Stöffelbauer
How Large Language Models WorkFrom zero to ChatGPT
Oct 24, 2023
51
Oct 24, 2023
51
In
Level Up Coding
by
Salvatore Raieli
Tabula rasa: Could We Have a Transformer for Tabular DataWe are using large language models for everything, so why not for tabular data?
Jan 9, 2024
2
Jan 9, 2024
2
In
TDS Archive
by
Mariya Mansurova
Can LLMs Replace Data Analysts? Learning to CollaboratePart 3: Teaching the LLM agent to pose and address clarifying questions
Jan 9, 2024
6
Jan 9, 2024
6
Mastering LLM (Large Language Model)
LLM Training: A Simple 3-Step Guide You Won’t Find Anywhere Else!Discover How Language Models are Trained in 3 Easy Steps
Oct 1, 2023
4
Oct 1, 2023
4
In
TDS Archive
by
Maxime Labonne
ExLlamaV2: The Fastest Library to Run LLMsQuantize and run EXL2 models
Nov 20, 2023
6
Nov 20, 2023
6
In
TDS Archive
by
Maxime Labonne
ExLlamaV2: The Fastest Library to Run LLMsQuantize and run EXL2 models
Nov 20, 2023
6
Nov 20, 2023
6
In
TDS Archive
by
Maxime Labonne
Fine-tune a Mistral-7b model with Direct Preference OptimizationBoost the performance of your supervised fine-tuned models
Jan 1, 2024
9
Jan 1, 2024
9