LLM News and Articles
Sunday, 2024-08-25 | ||||
04:28 | Stop AI from Bluffing: How RAG Delivers Accurate, Up-to-Date Facts https://medium.com/gptalk/stop-ai-from-bluffing-how-rag-delivers-accurate-up-to-date-facts-7d0bbc1a456d | |||
03:44 | Implement better AI models using RAGs | Improve your LLM by 300% https://medium.com/@thcookieh/implement-better-ai-models-using-rags-improve-your-llm-by-300-1a537332d60e | |||
03:44 | From Basics to Production: Mastering Retrieval-Augmented Generation (RAG) with Large Language… https://maddymaster.medium.com/implementing-retrieval-augmented-generation-rag-with-large-language-models-llms-35d9930d5592 | |||
03:44 | Heterogeneous Mixture of Experts (HMoE): Enhancing Model Efficiency and Performance with Diverse Expert Capacities https://www.marktechpost.com/2024/08/24/heterogeneous-mixture-of-experts-hmoe-enhancing-model-efficiency-and-performance-with-diverse-expert-capacities/ | |||
03:34 | MagicDec: Unlocking Up to 2x Speedup in LLaMA Models for Long-Context Applications https://www.marktechpost.com/2024/08/24/magicdec-unlocking-up-to-2x-speedup-in-llama-models-for-long-context-applications/ | |||
03:30 | Implementa mejores modelos de IA usando Rags | Mejora un 300% tu LLM https://medium.com/@thcookieh/implementa-mejores-modelos-de-ia-usando-rags-mejora-un-300-tu-llm-eb06a7e6097d | |||
03:26 | Chat-Vox https://medium.com/@aliayubi167/chat-vox-da34d3efd8df | |||
02:30 | Building a SaaS Website from Scratch https://medium.com/@modelfusion/building-a-saas-website-from-scratch-8628bf266136 | |||
02:27 | Anthropic Claude 3.5 can create icalendar files, so I did this https://gregsramblings.com/stupid-but-useful-ai-tricks-creating-calendar-entries-from-an-image-using-anthropic-claude-35 | |||
01:25 | Using Gen AI to help with coding https://medium.com/@prule70/using-gen-ai-to-help-with-coding-99cecb3b091a | |||
01:00 | Jamba 1.5 AI Model: Crunching huge documents fast on single GPU https://medium.com/gptalk/jamba-1-5-crunching-huge-documents-fast-on-single-gpu-7be82b2e27f9 | |||
00:02 | Large Language Model = Enigma Machine https://generativeai.pub/large-language-model-enigma-machine-8e06fe1228c2 | |||
Saturday, 2024-08-24 | ||||
23:39 | A light-weight no-cost implementation of web based Retrieval-Augmented Generation https://medium.com/@anthony.demeusy/a-light-weight-no-cost-implementation-of-web-based-retrieval-augmented-generation-548a898ed313 | |||
23:08 | You can train a flux lora on your own handwriting and it works https://old.reddit.com/r/StableDiffusion/comments/1f0hs79/you_can_train_handwriting_loras_with_flux | |||
21:37 | Company songs, generative AI, and product life cycles https://jchyip.medium.com/company-songs-generative-ai-and-product-life-cycles-2f70f43b7a7f | |||
20:14 | Exécuter Phi-3-mini-4k-instruct localement avec llama.cpp : Un guide étape par étape https://medium.com/@_jeremy_/ex%C3%A9cuter-phi-3-mini-4k-instruct-localement-avec-llama-cpp-un-guide-%C3%A9tape-par-%C3%A9tape-9360720de591 | |||
20:11 | Running Phi-3-mini-4k-instruct Locally with llama.cpp: A Step-by-Step Guide https://medium.com/@_jeremy_/running-phi-3-mini-4k-instruct-locally-with-llama-cpp-a-step-by-step-guide-3e070763f697 | |||
20:07 | Setting Up an Ollama Server with GPU on AWS and Integrating with a .NET Blazor Server App https://medium.com/@justinking_2311/setting-up-an-ollama-server-with-gpu-on-aws-and-integrating-with-a-net-blazor-server-app-a2a8a0f41773 | |||
20:01 | A real-life use case with codes: build an AI-powered research assistant with Llama3, LlamaIndex… https://pub.towardsai.net/a-real-life-use-case-with-codes-build-an-ai-powered-research-assistant-with-llama3-llamaindex-ad105a3eda77 | |||
19:56 | Navigating the Future of AI: My Journey as a 3rd-Year BTech Student and What You Should Know About… https://medium.com/@Iammilansoni/navigating-the-future-of-ai-my-journey-as-a-3rd-year-btech-student-and-what-you-should-know-about-f3fcf5e309c7 | |||
19:47 | Tutorial: Using LLM and LangChain to Power Up AI for Database with Chatbot — Part 1 https://medium.com/@ygeszvain/tutorial-using-llm-and-langchain-to-power-up-ai-for-database-with-chatbot-part-1-82d84c0f403a | |||
19:29 | Improve RAG Systems Learning from Mistakes https://medium.com/@Lorenzo_Pozzi/improve-rag-systems-learning-from-mistakes-60f63d779dc2 | |||
19:28 | Karpathy on VS Code Cursor and Sonnet 3.5 vs. GitHub Copilot https://twitter.com/karpathy/status/1827143768459637073 | |||
19:19 | Harnessing LLMs for Knowledge Graph Construction on Indian Healthcare: Extracting Concepts and… https://kshitijkutumbe.medium.com/harnessing-llms-for-knowledge-graph-construction-on-indian-healthcare-extracting-concepts-and-a580de674171 | |||
19:01 | Bringing AI Home: Your Private LLM Chat Alternative to ChatGPT https://medium.com/@mohammedhs404/bringing-ai-home-your-private-llm-chat-alternative-to-chatgpt-755ae70b4989 | |||
18:45 | GROK 2 Is Here—Can It Improve Your Writing and Stories? https://medium.com/@pgadityasingh/grok-2-is-here-can-it-improve-your-writing-and-stories-c36f17aca97e | |||
18:27 | What Is an LLM Agent and How Can You Leverage It in Your Applications? https://medium.com/@mehar.chand.cloud/what-is-an-llm-agent-and-how-can-you-leverage-it-in-your-applications-42ae9f4f6faf | |||
18:05 | Exploring Perplexity: A Concise Guide with Python Code https://python.plainenglish.io/exploring-perplexity-a-concise-guide-with-python-code-cfc201ba6c1a | |||
18:01 | When (not) to Use GraphRAG https://pub.towardsai.net/when-not-to-use-graphrag-02a80d77fcbf | |||
18:01 | Automated Design of Agentic Systems (ADAS) https://www.llmwatch.com/p/automated-design-of-agentic-systems | |||
17:48 | A Brief History of Causality from Homo Sapiens to AGI https://cplu.medium.com/would-agi-pray-to-digital-angles-a-brief-history-of-causality-from-plato-and-aristotle-to-pearl-f12faf081b68 | |||
17:37 | How does an LLM sample a sentence? https://python.plainenglish.io/how-does-an-llm-sample-a-sentence-ac64b741414f | |||
17:21 | Exploring LoRA(Low-Rank Adaptation) https://medium.com/@hasfatauil12/exploring-lora-low-rank-adaptation-c58b7192e5ba | |||
17:17 | Exploring AI Innovations: Insights on Microsoft’s Phi 3.5 Mini and NVIDIA’s SLM https://medium.com/@saumil23/exploring-ai-innovations-insights-on-microsofts-phi-3-5-mini-and-nvidia-s-slm-33b4fedf409e | |||
16:53 | Biorecap: An R package for summarizing bioRxiv preprints with a local LLM https://blog.stephenturner.us/p/biorecap-r-package-for-summarizing-biorxiv-preprints-local-llm | |||
16:50 | Practical Implementation and Future Directions https://medium.com/@mpuig/practical-implementation-and-future-directions-0a3b8e87f229 | |||
16:40 | The top 30 books to expand the capabilities of AI: a biased reading list https://medium.com/@jmugan/the-top-30-books-to-expand-the-capabilities-of-ai-a-biased-reading-list-f521bed51dbc | |||
16:37 | Fine-Tuning Small Language Models for AI Agent Tool Utilization https://ai.gopubby.com/fine-tuning-small-language-models-for-ai-agent-tool-utilization-fcb10a1bbf51 | |||
16:19 | ColBERT and Beyond: Advancing Retrieval Techniques https://medium.com/@mpuig/colbert-and-beyond-advancing-retrieval-techniques-81df1b2324d6 | |||
16:17 | Microsoft GraphRAG with an RDF Knowledge Graph — Part 3 https://medium.com/@ianormy/microsoft-graphrag-with-an-rdf-knowledge-graph-part-3-328f85d7dab2 | |||
16:12 | I am https://kaleidoscopesharts.medium.com/i-am-0d2b573cc71f | |||
16:10 | Importing Your Unstructured Triples into WhyHow.AI — Notebook Demonstration https://medium.com/enterprise-rag/importing-your-unstructured-triples-into-whyhow-ai-notebook-demonstration-3f424a2b27a5 | |||
16:00 | Bi-Encoders and Cross-Encoders: Two Sides of the Retrieval Coin https://medium.com/@mpuig/bi-encoders-and-cross-encoders-two-sides-of-the-retrieval-coin-06a95fe18619 | |||
15:56 | Navigating the GenAI Revolution: Uber’s Strategic Approach to AI Transformation https://medium.com/@praveengovi/navigating-the-genai-revolution-ubers-strategic-approach-to-ai-transformation-2cbf9861586a | |||
14:50 | Embedding Models in RAG Systems: The Cornerstone of Effective Retrieval https://medium.com/@mpuig/embedding-models-in-rag-systems-the-cornerstone-of-effective-retrieval-f2c888d69744 | |||
14:48 | RAG Systems vs. Traditional Language Models: A New Era of AI-Powered Information Retrieval https://medium.com/@mpuig/rag-systems-vs-traditional-language-models-a-new-era-of-ai-powered-information-retrieval-887ec31c15a0 | |||
14:44 | Building a Small Retrieval-Augmented Generation (RAG) Application: An End-to-End Guide https://medium.com/@kts.ramamoorthy07/building-a-small-retrieval-augmented-generation-rag-application-an-end-to-end-guide-9c7fab887ee2 | |||
14:38 | Inference Optimizations #1 — Continuous Batching https://donmoon.medium.com/inference-optimizations-1-continuous-batching-03408c673098 | |||
14:28 | Are LLMs and In-Context Learning Enough for NLP? https://josecamachocollados.medium.com/are-llms-and-in-context-learning-enough-for-nlp-b05221144bd8 | |||
14:21 | Processing Unstructured Data with Snowflake Cortex AI https://medium.com/snowflake/processing-unstructured-data-with-snowflake-cortex-ai-807f880b9c4e | |||
14:09 | Navigating the Ethical and Practical Challenges of Large Language Models https://medium.com/@armaansinghbhau8/navigating-the-ethical-and-practical-challenges-of-large-language-models-084aa0784a20 | |||
13:54 | How OpenAI or DeepMind calculates cost of training a transformer based models? https://masteringllm.medium.com/how-openai-or-deepmind-calculates-cost-of-training-a-transformer-based-models-b0b629f0942b | |||
13:53 | Direct Preference Optimization (DPO) of LLMs: A Paradigm Shift https://medium.com/@edmond.po/direct-preference-optimization-a-novel-approach-to-language-model-alignment-1f829d4ac306 | |||
13:46 | A guide to run LLM on Akash Network with KoboldCPP https://medium.com/@txartblock/a-guide-to-run-llm-on-akash-network-with-koboldcpp-8edee116afcd | |||
13:45 | A Game Changer! You Can Now Create a Chatbot for Any GitHub Repo Using Llama 3.1 405B or 70B! https://medium.com/@abdulvahapmutlu/a-game-changer-you-can-now-create-a-chatbot-for-any-github-repo-using-llama-3-1-405b-or-70b-63e936b44e9e | |||
13:28 | ReAnnotated Transformer https://medium.com/@kiangyeow/reannotated-transformer-450633432baa | |||
12:51 | My Internship Experiences as a Machine Learning Engineer https://medium.com/@williechu1125/my-internship-experiences-as-a-machine-learning-engineer-94ae6f54c95e | |||
12:15 | How to Train a FLUX.1 LoRA https://notes.dsebastien.net/30+Areas/33+Permanent+notes/33.02+Content/How+to+train+a+FLUX.1+LoRA | |||
11:36 | Grok-2: Elon Musk’s AI Lovechild That’s Making GPT-4 Sweat https://medium.com/@cognidownunder/grok-2-elon-musks-ai-lovechild-that-s-making-gpt-4-sweat-f4fb1d8198b7 | |||
11:25 | Introduction To Generative AI And LLM In Depth https://medium.com/@fraidoonomarzai99/introduction-to-generative-ai-and-llm-in-depth-aaf4bb5546ff | |||
11:24 | LLM-RAG pt.1 / Vector DB, What is the difference? https://medium.com/dev-ai/rag-pt-1-vector-db-what-is-the-difference-f4a9702f26da | |||
11:18 | Understanding SpreadsheetLLM: A Novel Approach to Spreadsheet Data Processing https://medium.com/@jain.ajanuj/understanding-spreadsheetllm-a-novel-approach-to-spreadsheet-data-processing-f7788553bdc5 | |||
09:50 | Building a production-ready Chatbot: Knowledge, RAG, and Search (1) https://medium.com/@tsunhanchiang/building-a-production-ready-chatbot-knowledge-rag-and-search-1-4492ab00028e | |||
08:45 | Decentralized Uncensored LLM Model: Is Decentralized AI the Key to True Freedom? https://medium.com/ailogic/decentralized-uncensored-llm-model-is-decentralized-ai-the-key-to-true-freedom-41a6fecc6c08 | |||
08:38 | Introduction to Basics of Quantization in Large Language Models https://ojasvinsood.medium.com/introduction-to-basics-of-quantization-in-large-language-models-649d419c3c0e | |||
07:31 | Fine-tuning FLUX.1: Customizing Your AI Image Generator https://medium.com/@naman1011/fine-tuning-flux-1-customizing-your-ai-image-generator-58f75e7ffe6d | |||
07:04 | KPAI — A new way to look at business metrics https://medium.com/firebird-technologies/kpai-a-new-way-to-look-at-business-metrics-75eaf0da8dbd | |||
06:54 | Implementing Retrieval-Augmented Generation (RAG) into a local Large Language Model https://medium.com/@jakob.rohrhirsch/implementing-retrieval-augmented-generation-rag-into-a-local-large-language-model-95395477de7d | |||
06:37 | Building a LLM comment summarization component https://medium.com/codex/building-a-llm-comment-summarization-component-d0456cc23792 | |||
06:05 | The AI Revolution in Quantitative Finance: How LLMs are Reshaping Trading Strategies https://medium.com/@pta.forwork/the-ai-revolution-in-quantitative-finance-how-llms-are-reshaping-trading-strategies-828ea572364b | |||
04:46 | Understanding CONDENSE: A New Approach to Optimizing Large Language Models https://bobrupakroy.medium.com/understanding-condense-a-new-approach-to-optimizing-large-language-models-99397a5da011 | |||
04:27 | Building Efficient Applications with Few shot Classification https://generativeai.pub/building-efficient-applications-with-few-shot-classification-6afee7a4db43 | |||
04:16 | What causes LLM Hallucinations? https://generativeai.pub/what-causes-llm-hallucinations-26c10bbb03ae | |||
04:15 | Generative AI in Business: Great Potential, Greater Challenges https://generativeai.pub/generative-ai-in-business-great-potential-greater-challenges-52af2ab9dd67 | |||
04:08 | Retrieval Augmented Generation (RAG) https://medium.com/@abdulwasiueunk/retrieval-augmented-generation-rag-6bb8f7ab2f3c | |||
03:08 | Human-AI Interaction Playbook https://medium.com/@yujinisgr8/human-ai-interaction-playbook-9b9effd049c7 | |||
02:11 | Advanced RAG Techniques — The LLMCompiler Approach https://gabrielgomes61320.medium.com/advanced-rag-techniques-the-llmcompiler-approach-ed3ecf1ddb43 | |||
01:24 | Unleash the Full Potential of AI with ModelFusion: Your Ultimate AI Companion https://medium.com/@modelfusion/unleash-the-full-potential-of-ai-with-modelfusion-your-ultimate-ai-companion-38422c9eaeeb | |||
01:17 | LLMs missing Language https://medium.com/pat-inc/llms-missing-language-5a13bbde4719 | |||
00:56 | From Emotionless Machines to Intuitive Intelligences: Rethinking AI Cognition https://medium.com/@mbonsign/from-emotionless-machines-to-intuitive-intelligences-rethinking-ai-cognition-9bf7e1e59233 | |||
Friday, 2024-08-23 | ||||
22:38 | The Mathematical Essence of Loss Function Design in Deep Neural Networks https://medium.com/autonomous-agents/the-mathematical-essence-of-loss-function-design-in-deep-neural-networks-9dc76b08406c | |||
21:23 | Fine-Tuning OpenAI GPT Models: A Comprehensive Step-by-Step Guide https://medium.com/@amosmaru10/fine-tuning-openai-gpt-models-a-comprehensive-step-by-step-guide-0f98623789fb | |||
20:52 | Beating OpenAI's structured outputs on cost, accuracy and speed https://www.boundaryml.com/blog/sota-function-calling | |||
20:23 | Databricks Generative AI Engineer Associate Certification: Study Guide Part 1 https://medium.com/@chandadipendu/databricks-generative-ai-engineer-associate-certification-study-guide-part-1-70cf3c483085 | |||
20:18 | Liger-Kernel: Efficient Triton kernels for LLM training https://github.com/linkedin/Liger-Kernel | |||
20:12 | AI21 Labs Released Jamba 1.5 Family of Open Models: Jamba 1.5 Mini and Jamba 1.5 Large Redefining Long-Context AI with Unmatched Speed, Quality, and Multilingual Capabilities for Global Enterprises https://www.marktechpost.com/2024/08/23/ai21-labs-released-jamba-1-5-family-of-open-models-jamba-1-5-mini-and-jamba-1-5-large-redefining-long-context-ai-with-unmatched-speed-quality-and-multilingual-capabilities-for-global-enterprises/ | |||
20:08 | Getting Started with Local AI/LLMs in Three Easy Steps https://runeberg.medium.com/getting-started-with-local-ai-llms-in-three-easy-steps-bddebcf26570 | |||
20:01 | Simplifying LLM Development: Treat It Like Regular ML https://pub.towardsai.net/simplifying-llm-development-treat-it-like-regular-ml-5b0648c9c938 | |||
19:46 | Semantic Search Battle: Vertex AI vs. Table Search for shops https://medium.com/@nastiya.levchenko/semantic-search-battle-vertex-ai-vs-table-search-for-shops-df6136f21d66 | |||
19:14 | Knowledge Diaries: I launched an Open Source Dev Tool! https://medium.com/@seifmamdouh7878/knowledge-diaries-i-launched-an-open-source-dev-tool-3544eefd2595 | |||
19:08 | Chat with your personal pdf free using Cohere https://medium.com/@vaibhavdahiya/chat-with-your-personal-pdf-free-using-cohere-4c9e7b2561af | |||
19:07 | Intent Detection Using LLMs https://blog.gopenai.com/intent-detection-using-llms-ba857f673260 | |||
19:01 | Integrating AI models in Next.js https://medium.com/@iqrahamzaworks/integrating-ai-models-in-next-js-5423c27aa30e | |||
19:00 | Lab #6: Chat with Cloud Diagram Agent (ChatGPT, Langchain & Streamlit) https://ai.gopubby.com/lab-6-chat-with-cloud-diagram-agent-chatgpt-langchain-streamlit-67ac3e211933 | |||
18:55 | Prompt Caching with Claude 3.5 Sonnet https://medium.com/dair-ai/prompt-caching-with-claude-3-5-sonnet-e59c91eeda9c | |||
18:44 | Navigating the Cost Landscape of LLMs in Production: Strategies for Optimization and Informed… https://medium.com/@shivam_23581/navigating-the-cost-landscape-of-llms-in-production-strategies-for-optimization-and-informed-e252d8307209 | |||
18:34 | This AI Paper by National University of Singapore Introduces A Comprehensive Survey of Language Models for Tabular Data Analysis https://www.marktechpost.com/2024/08/23/this-ai-paper-by-national-university-of-singapore-introduces-a-comprehensive-survey-of-language-models-for-tabular-data-analysis/ | |||
18:10 | Automating Table of Content Extraction and Filtering in Papers with LlamaIndex https://medium.com/@hborobia/automating-table-of-content-extraction-and-filtering-in-papers-with-llamaindex-7fc6a7cd3aae | |||
18:01 | Perplexity AI plans to start running ads in 4th quarter https://www.cnbc.com/2024/08/22/perplexity-ai-plans-to-start-running-search-ads-in-fourth-quarter.html | |||
18:01 | 👁️ Agent-ception: When Agents Are Creating Agents https://www.llmwatch.com/p/agent-ception-when-agents-are-creating |
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803