LLM News and Articles

163 of 100
Monday, 2024-08-19
06:00Understanding the 27 Unique Challenges in Large Language Model Development: An Empirical Study of Over 29,000 Developer Forum Posts and 54% Unresolved Issues
05:22LLMs Still Can’t Plan And Reason
05:12Knowledge graph extraction using LLMs // Advanced RAG
05:04Building a Versatile AI Chatbot with LangGraph: A Step-by-Step Guide
05:00The Challenges of Implementing Retrieval Augmented Generation (RAG) in Production
04:55Paper Review: Winning Amazon KDD Cup24
04:39LLM-Powered Summaries: Unlock Business Insights with Chain of Density
04:17Enhancing Civil Engineering Calculations in Chatbots with CalcTree API Integration
03:51End to end LLMOps Pipeline — Part 7- Validating Kubernetes Manifests with kube-score
02:52How to Glean Insights from Support Requests Using Prompt Engineering
02:52The Schema Connection: Unlocking Human-like Adaptability in Large Language Models.
01:02My conversation with Perplexity AI Pro
00:00Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI
Sunday, 2024-08-18
23:1530 days of learning LLM!
23:08Is Llama 3.1 a baby step towards intelligence explosion (AI improving AI)
22:09Decoding the Transformer Model: Architecture, Loss Function, and Inference from the ‘Attention is…
22:09The Rise of Arabic LLMs and leveraging it to cater for Illiterates
21:54Building Personal Chatbot — Part 2
21:32In the end it really is just text completion (mostly)
21:27Show HN: Dive into Transformers and LLM World – Llama 3.1 in Go, Step by Step
21:17Maximizing Efficiency in Deep Learning: From Quantization to Multi-GPU Scaling
21:17USC Researchers Present Safer-Instruct: A Novel Pipeline for Automatically Constructing Large-Scale Preference Data
21:04Building a Generative AI Model with Markov Chains: Understanding HMMs
20:29Fine-tuning Language Models for Triple Extraction with Data Augmentation
20:27I Trained a 2D Game Animation Generation Model to Create Complex, Cool Game Actions (Fully…
20:20Review : OneLLM: One Framework to Align All Modalities with Language
19:14Introduction to Language Model Merging
19:10Serving Large models (part one): VLLM, LLAMA CPP Server, and SGLang
19:08Linux Foundation Backs Open Source LLM Initiative
19:07Digitalizando tus Finanzas. De imágenes (o PDFs) a datos estructurados: OCR vs LLMs
19:07Anthropic AI Claude’s SEO Optimization Impact
19:04LLM Architectures Explained: Word Embeddings (Part 2)
18:49How ChatGPT Works: A Complete Explanation
18:48How I Built a Language Translator Using LangChain and Few-Shot Learning
18:24Mastering RAG: A Deep Dive into Retriever
18:10Top 8 LLM+RAG Use Cases in Finance Domain
18:03Learn Prompt Engineering in 5 Minutes
17:33LLM Pre-training Vs. Fine-Tuning
17:21My AI Chatbot Is Better Than Yours! At Least in One Way Yours Can’t Even Do
17:16Sam Altman Is Battling with Governments over Your Eyes
17:15Zero-Shot Fact-Checking with Semantic Triples and Knowledge Graphs
16:59Subverting the System: The Danger of Prompt Injection
16:54Build Free, Build Reliably, Build Proud: Road to LLM Observability
16:50Top Open-Source Large Language Models Shaping AI Today
16:50xAI: Grok-2 shocks the world of AI
16:15Create Sentiment Analyzer using LLM Model
15:55An LLM Approach to Continuous Learning (Part 1 of 4)
15:53Behind HuggingFace Transformer Pipeline
15:51This Projects made Me kick Start into Machine learning;
15:50Building LLM functionality made easy with AWS Bedrock (Part II)
15:16Elevating Search with Retrieval-Augmented Generation (RAG) Systems: Techniques and Best Practices
15:10Show HN: AdalFlow: The library to build and auto-optimize any LLM task pipeline
14:40LLM server for inference
14:34Multi-Niveau Response Structuring (MNRS) Prompting
14:21LLM can be your startup partner
14:19Building Local RAG:From Start to Finish
14:14RAG - Reworking Reranking
14:03The Evolution of SQL
13:52Run Self hosted nvidia NIM
13:50Which Metrics Should You Monitor for Large Language Model Performance?
13:49STAGE: Simplified Text-Attributed Graph Embeddings Using Pre-trained LLMs
13:365 Best Practices in RAG
12:40LLM ops : GPU VRAM Requirements for Large Language Models LLM
12:29The Dunning-Kruger Effect and LLMs: Confidence vs. Competence in AI
12:29The Dunning-Kruger Effect and LLMs: Confidence vs. Competence in AI
12:24Mastering RAG: A Deep Dive into Embeddings
12:23A Domain-Specific LLM Project. How to Make AI Generate Text.
12:01Streamline Your LLM Evaluation: A Step-by-Step Guide to RAG Metrics with Streamlit
11:16What is Prompt Caching : Reduce LLM cost by 90%!
11:08LLM Model Composition from scratch
10:48Observations on the State of Affairs in Legal Tech
10:37Rootless mode
10:31Jacob: Three bubble tops from AI, restoring a buildup all consumed the fastest
10:24An Introduction to Legal AI
10:14Demystifying Large Language Models
09:59GraphRAG: The idea and the implication
09:49Accelerating Large Language Models with TensorRT-LLM
09:41Streamlining Your AI Development with a Unified LLM Factory in Python
09:26LLM Architectures — the building blocks
09:11Mastering RAG: A Deep Dive into Text Splitting
09:02Gen AI @ Work
08:53Hugging Words: How the NLP Pipeline Transforms Text into Insights
08:31Llama3.1 405B Modelini Ücretsiz Kullanalım!
08:14Beyond Model Size: The Future of LLM Optimisation
08:04Aquila2: Advanced Bilingual Language Models Ranging from 7 to 70 Billion Parameters
07:58This AI Paper from John Hopkins Introduces Continual Pre-training and Fine-Tuning for Enhanced LLM Performance
07:57Prompt Injection Defense: Fortifying AI app at the Application Level
07:49Understanding Implicit Chain of Thought Reasoning via Knowledge Distillation
07:43Mastering RAG Fusion in Simple Steps: A Deep Dive into Retrieval-Augmented Generation”
07:35DSPy Guide: Running your Project on Local Models (Part 1)
06:55Enhancing domain-specific RAG systems
06:01How to Get a Batching API Like OpenAI for Open-Source Models
05:50Summary Blog: Self-Taught Evaluators
05:41RAGChecker: A Fine-Grained Evaluation Framework for Diagnosing Retrieval and Generation Modules in RAG
05:09Leveraging Langchain and AWS Bedrock for Seamless Multi-LLM Integration
04:20RLHF vs. DPO: Choosing the Method for LLMs Alignment Tuning
03:00Massive Energy for Massive GPU Empowering AI
02:43Maximizing LLM Accuracy for Production: A Practical Guide
02:41Let’s debate AGI: Can a Turing machine simulate the human brain?
01:31A Beginner’s Guide to LLMs — What’s a Large-Language Model and How Does it Work?
163 of 100
Was this helpful?
Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803