LLM News and Articles

195 of 100
Sunday, 2024-07-14
13:37CURLoRA: Stable LLM Fine-Tuning and Catastrophic Forgetting Mitigation
13:14Deciphering AI: Leveraging Sparse Autoencoders for Enhanced Model Interpretability
13:07ChatGPT Effective promting techniques
13:01Craft Assistant on Commercialized Religion
12:25The Impact of Generative AI on Human Creativity: A Writer’s Perspective
12:24Retrieval-Augmented Generation (RAG) nedir? Nerelerde kullanılır?
12:20A beginner's guide to LLM quantization and testing
12:14GraphRAG(Graphs + Retrieval Augmented Generation): Unlocking LLM Discovery on Narrative Private…
11:49Create Markdown from a text prompt using Anthropic’s API
11:38Sizing Large Language Models: A T-Shirt Size Approach
11:20Three layers of context for useful AI
11:15Arena Learning: Transforming Post-Training of Large Language Models with AI-Powered Simulated Battles for Enhanced Efficiency and Performance in Natural Language Processing
11:05Learn Custom LLMs: Tutorial to Develop an LLM for Translating English to Punjabi
11:00Metron: A Holistic AI Framework for Evaluating User-Facing Performance in LLM Inference Systems
10:53AI paper this in this week!
09:22OpenAI whistleblowers ask SEC to investigate alleged restrictive NDAs
09:15Optimizing Large Language Models (LLMs) on CPUs: Techniques for Enhanced Inference and Efficiency
09:02Retrieval augmented Agents RAA // Advanced RAG + Agents == Better Agents
08:38Understanding LLM — Large Language Models
08:24Why Meta-Llama-3–8B Runs Faster on GPU vs. CPU: A Deep Dive into Gaianet Node Performance
08:21Dialogue with Claude 3
07:42Practical GenAI
07:39Advanced RAG: Embedded Tables
07:29The Transformative Impact of Large Language Models on DevOps
07:15FBI-LLM (Fully BInarized Large Language Model): An AI Framework Using Autoregressive Distillation for 1-bit Weight Binarization of LLMs from Scratch
07:00Unlocking the Power of Large Language Models: Parameter-Efficient Fine-Tuning Advance Techniques…
06:47RAG: Prototype to Production
06:24Enhancing LLM Reliability: The Lookback Lens Approach to Hallucination Detection
05:59A study on Attention mechanism
05:19Let’s explore ScrapeGraphAI
05:15Korvus: An All-in-One Open-Source RAG (Retrieval-Augmented Generation) Pipeline Built for Postgres
04:22Mooncake Paper on LLM Serving
03:51Q-GaLore Released: A Memory-Efficient Training Approach for Pre-Training and Fine-Tuning Machine Learning Models
03:06Speculative RAG: enhancing RAG with multiple drafts generation and verification
01:50The Illusion of Transparency: Why Big AI Companies Will Never Offer Uncensored AI Models
01:49The lean machine: crafting production-grade user intent detection and content moderation AI with…
01:305 Levels in AI by OpenAI: A Roadmap to Human-Level Problem Solving Capabilities
01:08Coffee Time Papers: Mixture of a Million Experts
01:05Effective Practices for Mocking LLM Responses During the Software Development Lifecycle
01:04The Dawn of a New Era in AI: NVIDIA’s Megatron-Turing NLG Redefines Language Processing
Saturday, 2024-07-13
22:24Natural Language Processing Glossary (Part I)
22:05It's an open secret that OpenAI is trying to IPO soon
21:37[DE]Vergleich der bedeutendsten Large Language Models (LLMs) im Juli 2024
21:36How Have Pre-Training Datasets for Large Language Models Evolved?
21:13THE LLM SHOWDOWN IN MOUNTAIN VIEW
20:56Let’s Build a Sample Chat Agent with Python and LangChain Part One 1 (Data to JSON)
20:34To Code or Not To Code
19:49AI tools for Design & Verification
19:46OpenAI Researcher Says He Quit When He Realized the Upsetting Truth
19:22How to use Mixture-of-Agents in your favorite Application
18:55Running LLM Models Locally: A Secure and Private Option for AI
18:45Three Practical Challenges of RAG and Their Mitigation Ideas
18:42NER, identificando nomes em dados textuais: Meus estudos em spaCy e NLP — Parte 5
18:28What is an LLM?
18:21Large Language Model: from pretrained to instructed one.
18:15Understanding and Mitigating Hallucinations in Large Language Models (LLMs)
17:51Whistleblowers accuse OpenAI of 'illegally restrictive' NDAs
17:51QuickRead Mixture of Agents: Achieving State-of-the-Art Performance with Collaborative LLMs
17:42Exploring DoRA: Improving on LoRA’s Parameter-Efficient Fine-Tuning
17:38✨QuickRead✨ Enhancing Retrieval-Augmented Generation: Exploring Modular RAG Innovations
17:20Latest Types of RAG
17:02OpenAI anticipates decrease in AI model costs amid adoption surge
16:58Inside Prompt Engineering: Demystifying Technical Intricacies
16:50Running LLMs Locally in Salesforce Experience Cloud using picoLLM Inference Engine SDK
16:35Breaking News: Meta Unveils MobileLLM, a Sub-Billion Parameter Language Model Transforming…
15:35Enhancing SQL Generation in Large Language Models with Graph Neural Networks
14:38RAG: Key Aspects of Performance: Metrics and Measurement
14:10Caching Out with Gemini: Making AI Chat Less Taxing (on Your Wallet)
14:07My Attempt at a Tree-View Hierarchical Summarizer to Read with AI
14:01Top Important LLMs Papers for the Week from 01/07 to 07/07
13:59Whose fault is it? Measuring Incoherence of Large Language Models
13:25Why you should outsource your agentic infrastructure, but own your cognitive architecture
13:13The Evolution of Large Language Models on OpenAI models' example
12:34Building blocks of Gen AI Applications in LLM/SLM
12:26CSV Analysis Visualization with LLMs
12:24Classifying Wikipedia articles using GPT 3.5 Turbo
11:29MHA vs MQA vs GQA vs MLA
11:20Linear Rope vs NTK vs YaRN vs CoPE
10:32The Ultimate Guide to Getting Started with Bloom LLM
10:06Show HN: Math.bot – Free, instant math problem solver powered by GPT-4
10:02Comparative Analysis of Fine-Tuning LLaMA 2 and LLaMA 3 Models
09:45Unveiling the Magic: How Large Language Models Work
09:33Yapay Zeka : Büyük Umutlar Bağladık ama Beklentiler Gerçekçi mi?
09:32What is Einstein Trust Layer?
09:15Researchers at Stanford Introduces In-Context Vectors (ICV): A Scalable and Efficient AI Approach for Fine-Tuning Large Language Models
09:10Ex-OpenAI staff call for "right to warn" about AI risks without retaliation
08:43Direct Documentation I: A Look Inside a Source Transmission
08:22Understanding LLM Routers: A Magical Mail Sorting System for Robots
07:41Beyond Chatbots: How LLMs Are Reshaping Industrial
07:31Use agents to write release note in Agent ChatRoom
07:15Can LLMs Help Accelerate the Discovery of Data-Driven Scientific Hypotheses? Meet DiscoveryBench: A Comprehensive LLM Benchmark that Formalizes the Multi-Step Process of Data-Driven Discovery
07:14Outlines: Make LLM structured outputs controllable and improve the stability of LLM applications
07:01The Concern of Privacy with LLMs
06:49OpenAI Scale Ranks Progress Toward 'Human-Level' Problem Solving
05:24Analyzing Trump - Biden debate using AI — Claude Sonnet 3.5
05:20Thoughts on LangChain
05:00Building AI Applications with ChatGPT APIs by Martin Yanev
04:36The Agentic Concept in LLM-based Application Development
04:09Azure OpenAI down in multiple regions
03:23Visualizing Low-Rank Adaptation (LoRA)
195 of 100
Was this helpful?
Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803