LLM News and Articles

15 of 100
Monday, 2024-10-14
18:06From Feature Flags to Prompt Flags
17:59Building a ReAct Agent from Scratch: A Beginner’s Guide:
16:52Chain of Thought and Tree of Thoughts: Solving the Shortest Path Problem Using Tree of Thoughts.
16:02Reducing Hallucinations by 95% with Memory Tuning
15:46Show HN: I made a macOS app to support Anthropic Claude
15:39How to Achieve Artificial Superintelligence
15:36Fine-Tuning LLM with LoRA for Effective Tool Selection in AI Agents
15:31Agents Routines & Hand offs, How to build them Intuitively
15:21Differential Transformer Explained: What is it and How Does It Work?
15:20AlphaCodium outperforms direct prompting of OpenAI's o1 on coding problems
15:07Text Splitting in LangChain: A Component of the RAG System
15:01What is RAG and RIG? Key Concepts of Recent Generative AI
14:50Deploying your LLM Project with GPU Support Using Docker and Docker Compose
14:49How To Run Ollama Models in Colab
14:39Demystifying Generative AI: How Does It Actually Work?
14:32OpenAI DevDay 2024: What ChatGPT Users Want
14:09ToolGen: framework that unifies tool retrieval and execution in LLMs for scalable and efficient AI…
14:02Building ElevateCV: A Dynamic Resume Builder with React and Flask
13:57Areas of Research in the LLM Field
13:55An Introduction to LLM Research
13:53Understanding TF-IDF and c-TF-IDF in Topic Modeling
13:39Entropix: Sampling Techniques for Maximizing Inference Performance
13:32Authorship Attribution: Why Identifying Who Wrote What is More Important Than Ever in the Age of…
13:31Building Multi AI Agent Systems: A Comprehensive Guide!
13:13Show HN: Microagent, a fork of OpenAI Swarm that supports Groq and Anthropic
12:50Language Model Categorisation
12:43How Do Businesses Successfully Scale LLM Solutions from Development to Deployment?
12:25Introduction to Power BI Front-End and Back-End: A Deep Dive
11:43Chat GPT is Bad at Math | Philip Okoampah Kwaning
11:21Building Production-Ready AI Agents with LangGraph: A Real-Life Use Case
11:06LightRAG the Cross breed of NavieRag and GraghRAG
10:11How to Create an Agriculture Chatbot Using Gemini API
09:57Top Open-Source AI Chatbot Tools for 2024–2025
09:47Simple RAG with Langchain, Google Gemini, and FAISS Vector Database
09:35How Do Customized Large Language Models Enhance Business Performance?
09:29How to Test the Phi-3.5 Model from Hugging Face on Google Colab
08:44How to Improve Search with LLMs
08:41How Google Missed the AI Boom and Let OpenAI Dominate
08:39Building a Multi-Agent AI System with Temporal.io:
08:39Unleashing LLM’s Self-Awareness: How SEAKR Enhances Knowledge Retrieval in RA
08:26Implementing a Retrieval-Augmented Generation (RAG) Model with OpenAI LLM
08:20Talk @ AWS Telco hackathon, Dallas, TX (09/2024)
08:12Fast Llama inference in pure, modern Java
08:11Attention Mechanism in LLMs: An Intuitive Explanation
08:01Build Your Own Private PDF Search Tool
08:015 Machine Learning Myths
07:56How to Run Your Own Local LLM: Updated for 2024 — Version 2
07:55Proof of current (LLMs) SOTA models fails to do general reasoning which isn’t on the internet.
07:51Multi-Headed Attention in BERT
07:40How Transformers Work: A Detailed Exploration of Transformer Architecture
07:30Fine Tuning Google Gemma: Enhancing LLMs with Customized Instructions
07:25The Road to AGI: Why Abstraction, Not Just Scaling Models, Is the Key
07:16AGI — homosapienslərin əlçat(an?)maz arzusu
07:13Fine-Tuning SAM 2 on a Custom Dataset
07:05Speculative RAG Implementation With Transformers
06:50Phi-3 Tutorial: Hands-On With Microsoft’s Smallest AI Model
06:35Fine-Tuning Phi-3.5 on E-Commerce Classification Dataset
06:30Exploring Chat Models with LangChain
06:16NVIDIA se lance dans les LLMs
04:32OpenR: An Open-Source AI Framework Enhancing Reasoning in Large Language Models
04:12Power BI: The Gateway to Advanced Analytics and Machine Learning
04:03NVIDIA AI Researchers Explore Upcycling Large Language Models into Sparse Mixture-of-Experts
04:02Beyond “Hello World” — A Race to The Future of Generative AI
03:56How LangChain and LlamaIndex Maintain Context
03:50Don’t Ever Drop the First Token. Here’s Why.
03:31What will happen if a big tech-based company hires a senior software developer using LLM and AI in…
03:04Unleashing the Power of Large Language Models: My Journey with LLMs
02:13“Large language models (LLMs) are beginning to revolutionize the way we work.”
01:54Liquid Foundation Models (LFMs): A Simple Explanation
01:27Trio: A browser-based LLM that runs locally to create a 3-step task workflow
01:21Llama 405B 506 tokens/second on an H200
00:51A Comprehensive Guide to Effective Methods for Fine-Tuning Large Language Models
00:37Satisfaction Scores of Generative AI Apps Based on Real-Life Questions
00:32Unlocking the Power of Retrieval-Augmented Generation (RAG) with Large Language Models (LLMs)
00:32The Multimodal Generative AI Revolution
00:041000 Days of Learning AI & ML Challenge
Sunday, 2024-10-13
23:39Understanding the Limitations of Mathematical Reasoning in Large Language Models
23:26OpenAI's AI-adjusted earnings numbers have echoes of Groupon and WeWork
23:10Can Editing LLMs Inject Harm? A Deep Dive into New Safety Threats
22:15Generative AI On Android — Gemini Nano | Part I
22:13Generative AI On Android — Gemini Nano | Part II
22:13Beauty, the Last Bastion
21:36EligereAI — Technical Breakdown, Background
21:22Use Prolog to improve LLM's reasoning
21:05Building, Customizing, Training, and Deploying LLMs with Ollama
20:50[Weekend Read] KnowPhish: LLMs Meet Multimodal KGs for Enhancing RBPDs
20:43Building Next-Gen Apps with LLMs: A Practical Guide with LangChain
20:36Mathematical Foundations of Large Language Models
20:16Understanding Causal Model Induction in Neural Networks for Interpretability
20:04A Note on Supercharging Your RAG System
19:58OpenAI Swarm: A Lightweight Framework for Multi-Agent Orchestration
19:58AgentKit, A Lightweight Multi-Agent Framework for Creating Complex Apps
19:57AI-Agent Consensus Framework: Reducing Bias and Improving Accuracy in Generative AI
19:47An Introduction to NLP and LLMs in the Age of AI
19:20Small Language Models: Innovations, Applications, and Challenges
19:17An LLM TDD Loop
19:11Understanding Causal and Masked Language Models: How Scaling Laws Impact Their Power
18:48Building an AI-Powered Retrieval System for Alphabet’s Earnings Reports and Conference Call…
18:40Fine Tuning Llama 3.2 11B for Question Answering
18:33Breaking Down AI Agentic Patterns in AutoGen
15 of 100
Was this helpful?
Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803