LLM News and Articles

155 of 100
Wednesday, 2024-08-28
11:25Long context prompting tips by Anthropic
11:20Show HN: PromptMage – Simplify and Manage Your LLM Workflows
11:16Automating Table of Content Extraction and Filtering in Papers with LlamaIndex (Part 2)
11:08SarcasmBench: A Comprehensive Evaluation Framework Revealing the Challenges and Performance Gaps of Large Language Models in Understanding Subtle Sarcastic Expressions
11:07PHP and LLMs — Web Scraping and Building an Events Database
11:06OpenAI Aims to Release New AI Model, 'Strawberry,' in Fall
10:45Building a Q&A Bot Using Your Website and Files — Part 1 (Extracting the Content)
09:47Leveraging AI Tools to Enhance Education: Top 10 AI Solutions
09:45Is Your SQL Database Safe? How LLM Integration Could Put Everything At Risk
09:26Harnessing LLMs for Automated Timeline Generation
09:17Threat Intelligence Report Summarization
09:07Harnessing AI for Intelligent Q&A Generation: A Deep Dive into CrewAI
08:54Optimizing RAG for Production: Leveraging Filtering and Reranking for Better Performance
08:50AI Powered Document Summarization
08:43Using an LLM for sorting a list with linear time
08:34OpenAI shows 'Strawberry' to feds, races to launch it
08:14From Context to Cognition: Advanced Prompting Strategies for Agentic AI
07:56Options to Deploy Your LLM (in increasing complexity):
07:52Karpathy on Software 2.0 (2017)
07:43The Great AI Debate: LLM Company Cloned Trump and Harris and Made Them Fight
07:29Revolutionizing Retail with AI Magic
07:04Retrieval Augmented Generation (RAG) Systems
07:04Retrieval Augmented Generation (RAG) Systems
06:51How to build an LLM Agent for Sales?
06:38What makes document search hard?
06:27A Comprehensive Guide to Fine-tuning LLMs using RLHF (Part-2)
06:27Building a Database-Driven Chatbot with LangChain and OpenAI: A Practical Approach (Part 2…
06:22The 6 Best LLM Tools To Run Models Locally
06:17An Ultimate Guide to Embeddings and Vectors: Code Included
06:11How to Train Your Language Model (In a single day with one GPU): Cramming
06:07Layman’s guide to getting what you want with LLMs: INTERS (Part-1)
06:01A Comprehensive Guide to fine-tuning LLMs using RLHF (Part-1)
05:44Enforcing JSON outputs in commercial LLMs
05:32Why Creating Your Large Language Model (LLM) Using PHP is a Bad Idea and How PHP Could Be…
05:32RAG II: Query Transformations
05:24A Comprehensive Guide to Types of Neural Network Architectures
05:04Will Function(RoutingAgent) Calling architecture replace Intent Classification?
05:01Are You Afraid of Artificial Intelligence?
04:38How to Build AI Agents from Scratch with Python
04:34AI: Reality Check
04:32Spamming "Hi" at Every LLM
04:32Humans, Large Language Models, and Lucky Number 7
04:12Creating Dynamic Mindmap of information from collaborative discourse among LLM Agents with…
03:59Enhancing Data Ingestion Testing with Generative AI: Leveraging LLM Models(part 2-Practice)
03:08Unlocking the Power of PDFs
02:54Mappify: Because Our Thoughts Don’t Travel in Straight Lines
02:53LLMs for Operational Efficiency
02:25I build this modern Tic-Tac-Toe with AI in 5 minutes
02:24LongWriter: The Next Frontier in LLMs — Unleashing 10,000+ Word Generation!
02:01Want to Chat with Documents Locally? Try This Open Source RAG Tool (Kotaemon)
01:57Altman-backed startup to test AV mass transit system in Atlanta
01:53Google’s Gemini-1.5-Pro 0827 Is Better Than GPT-4o and Claude 3.5 Sonnet Now
01:33Gemini 1.5 Flash-8B: Google’s Answer to GPT-4o-Mini Alternative?
01:31AI Coming Soon in 2025: An In-Depth Report
00:35Cerebras Launches the Fastest AI Inference
00:27Understanding Vulnerabilities in LLM
Tuesday, 2024-08-27
23:56Beyond Basic RAG: Similarity ≠ Relevance
23:53Boosting LLM Inference Speed Using Speculative Decoding
23:33Turning Heated Debates into LOLs: The Making of WhatsApp Sentinel
23:31What *is* ChatGPT if it's not a chatbot?
23:25Introduction to LLMs and the generative AI : Part 4 — Parameter efficient fine-tuning (PEFT)
22:15OpenAI Shows 'Strawberry' AI to the Feds and Uses It to Develop 'Orion'
22:03Enhancing AI performance using RAG architectures: The Art of Chunking
21:19Law Meets AI: A Deep Dive into LawLLM’s Potential in Legal Research
21:00Unit Economics of LLM APIs
20:46Create a x.com search chrome extension without writing code (yourself)
20:42Researchers from Oxford University are introducing a novel graph-based Retrieval-Augmented…
19:13Unlocking Knowledge: How AI Revolutionized My Learning Process
19:12The Evolution of Transformers: A Journey Through the Milestones of Language Models
19:10Meta Llama 3 : Your 5-Step Guide to Google Colab Setup
19:03A PHP dev’s dream: An AI home that really gets you
19:00LMStudio 0.3 chat with documents new feature. Use case with the new Hermes 3 — Llama-3.1 8B
18:53Retrieval-Augmented Generation (RAG): Components and Workflow Explained
18:35Can You Train AI Models on CPUs? How Many Would You Need?
18:05AI 3 body problem — number one
18:02Stateful and Responsible AI Agents
17:43LLM APIs for Document Data Extraction
17:41The Truth About LLMs: Why They’re Not as Smart as You Think
17:36Chunking Strategies — How to Choose the Right Chunking Method for My RAG Pipeline
17:34MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Models (MLLMs)
17:12Azure ML — End to end Building and Deploying ML Models
16:58Building “Vishay”: Reference Enabled Chatbot using Langchain and Gemini
16:55Mastering Celery-Redis Queue Setup: Insights from a UC Berkeley LLM Project
16:53How Anthropic built Artifacts
16:49Cerebras Inference: AI at Instant Speed
16:48Cerebras Inference
16:42Cerebras launches inference for Llama 3.1; benchmarked at 1846 tokens/s on 8B
16:40Agentic AI: Part 3
16:36Influencing a Large Language Model response with in-context learning.
16:34Half of AGI safety researchers have left OpenAI
16:33Cerebras Launches the Fastest AI Inference
16:31LLM Study Diary: Decoding LangChain’s Official Multimodal RAG Sample
16:24Counting Tokens in OpenAI API Requests Using tiktoken
16:15Case Study: How Salomatic Used Langtrace to Build a Reliable Medical Report Generation System
16:10cerebras: 450 tokens/sec llama 3.1 70B
16:10Hyperparameter Tuning:
16:09AI against AI ?
16:01RAG app with uploads on Cloudflare workers
16:01Anthropic: Artifacts are now generally available
16:01Generative AI Certification Test: Our New Launch With Activeloop
155 of 100
Was this helpful?
Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803