LLM News and Articles

1 58 of 100

Sunday, 2024-08-25
04:28		Stop AI from Bluffing: How RAG Delivers Accurate, Up-to-Date Facts https://medium.com/gptalk/stop-ai-from-bluffing-how-rag-delivers-accurate-up-to-date-facts-7d0bbc1a456d
03:44		Implement better AI models using RAGs \| Improve your LLM by 300% https://medium.com/@thcookieh/implement-better-ai-models-using-rags-improve-your-llm-by-300-1a537332d60e
03:44		From Basics to Production: Mastering Retrieval-Augmented Generation (RAG) with Large Language… https://maddymaster.medium.com/implementing-retrieval-augmented-generation-rag-with-large-language-models-llms-35d9930d5592
03:44		Heterogeneous Mixture of Experts (HMoE): Enhancing Model Efficiency and Performance with Diverse Expert Capacities https://www.marktechpost.com/2024/08/24/heterogeneous-mixture-of-experts-hmoe-enhancing-model-efficiency-and-performance-with-diverse-expert-capacities/
03:34		MagicDec: Unlocking Up to 2x Speedup in LLaMA Models for Long-Context Applications https://www.marktechpost.com/2024/08/24/magicdec-unlocking-up-to-2x-speedup-in-llama-models-for-long-context-applications/
03:30		Implementa mejores modelos de IA usando Rags \| Mejora un 300% tu LLM https://medium.com/@thcookieh/implementa-mejores-modelos-de-ia-usando-rags-mejora-un-300-tu-llm-eb06a7e6097d
03:26		Chat-Vox https://medium.com/@aliayubi167/chat-vox-da34d3efd8df
02:30		Building a SaaS Website from Scratch https://medium.com/@modelfusion/building-a-saas-website-from-scratch-8628bf266136
02:27		Anthropic Claude 3.5 can create icalendar files, so I did this https://gregsramblings.com/stupid-but-useful-ai-tricks-creating-calendar-entries-from-an-image-using-anthropic-claude-35
01:25		Using Gen AI to help with coding https://medium.com/@prule70/using-gen-ai-to-help-with-coding-99cecb3b091a
01:00		Jamba 1.5 AI Model: Crunching huge documents fast on single GPU https://medium.com/gptalk/jamba-1-5-crunching-huge-documents-fast-on-single-gpu-7be82b2e27f9
00:02		Large Language Model = Enigma Machine https://generativeai.pub/large-language-model-enigma-machine-8e06fe1228c2
Saturday, 2024-08-24
23:39		A light-weight no-cost implementation of web based Retrieval-Augmented Generation https://medium.com/@anthony.demeusy/a-light-weight-no-cost-implementation-of-web-based-retrieval-augmented-generation-548a898ed313
23:08		You can train a flux lora on your own handwriting and it works https://old.reddit.com/r/StableDiffusion/comments/1f0hs79/you_can_train_handwriting_loras_with_flux
21:37		Company songs, generative AI, and product life cycles https://jchyip.medium.com/company-songs-generative-ai-and-product-life-cycles-2f70f43b7a7f
20:14		Exécuter Phi-3-mini-4k-instruct localement avec llama.cpp : Un guide étape par étape https://medium.com/@_jeremy_/ex%C3%A9cuter-phi-3-mini-4k-instruct-localement-avec-llama-cpp-un-guide-%C3%A9tape-par-%C3%A9tape-9360720de591
20:11		Running Phi-3-mini-4k-instruct Locally with llama.cpp: A Step-by-Step Guide https://medium.com/@_jeremy_/running-phi-3-mini-4k-instruct-locally-with-llama-cpp-a-step-by-step-guide-3e070763f697
20:07		Setting Up an Ollama Server with GPU on AWS and Integrating with a .NET Blazor Server App https://medium.com/@justinking_2311/setting-up-an-ollama-server-with-gpu-on-aws-and-integrating-with-a-net-blazor-server-app-a2a8a0f41773
20:01		A real-life use case with codes: build an AI-powered research assistant with Llama3, LlamaIndex… https://pub.towardsai.net/a-real-life-use-case-with-codes-build-an-ai-powered-research-assistant-with-llama3-llamaindex-ad105a3eda77
19:56		Navigating the Future of AI: My Journey as a 3rd-Year BTech Student and What You Should Know About… https://medium.com/@Iammilansoni/navigating-the-future-of-ai-my-journey-as-a-3rd-year-btech-student-and-what-you-should-know-about-f3fcf5e309c7
19:47		Tutorial: Using LLM and LangChain to Power Up AI for Database with Chatbot — Part 1 https://medium.com/@ygeszvain/tutorial-using-llm-and-langchain-to-power-up-ai-for-database-with-chatbot-part-1-82d84c0f403a
19:29		Improve RAG Systems Learning from Mistakes https://medium.com/@Lorenzo_Pozzi/improve-rag-systems-learning-from-mistakes-60f63d779dc2
19:28		Karpathy on VS Code Cursor and Sonnet 3.5 vs. GitHub Copilot https://twitter.com/karpathy/status/1827143768459637073
19:19		Harnessing LLMs for Knowledge Graph Construction on Indian Healthcare: Extracting Concepts and… https://kshitijkutumbe.medium.com/harnessing-llms-for-knowledge-graph-construction-on-indian-healthcare-extracting-concepts-and-a580de674171
19:01		Bringing AI Home: Your Private LLM Chat Alternative to ChatGPT https://medium.com/@mohammedhs404/bringing-ai-home-your-private-llm-chat-alternative-to-chatgpt-755ae70b4989
18:45		GROK 2 Is Here—Can It Improve Your Writing and Stories? https://medium.com/@pgadityasingh/grok-2-is-here-can-it-improve-your-writing-and-stories-c36f17aca97e
18:27		What Is an LLM Agent and How Can You Leverage It in Your Applications? https://medium.com/@mehar.chand.cloud/what-is-an-llm-agent-and-how-can-you-leverage-it-in-your-applications-42ae9f4f6faf
18:05		Exploring Perplexity: A Concise Guide with Python Code https://python.plainenglish.io/exploring-perplexity-a-concise-guide-with-python-code-cfc201ba6c1a
18:01		When (not) to Use GraphRAG https://pub.towardsai.net/when-not-to-use-graphrag-02a80d77fcbf
18:01		Automated Design of Agentic Systems (ADAS) https://www.llmwatch.com/p/automated-design-of-agentic-systems
17:48		A Brief History of Causality from Homo Sapiens to AGI https://cplu.medium.com/would-agi-pray-to-digital-angles-a-brief-history-of-causality-from-plato-and-aristotle-to-pearl-f12faf081b68
17:37		How does an LLM sample a sentence? https://python.plainenglish.io/how-does-an-llm-sample-a-sentence-ac64b741414f
17:21		Exploring LoRA(Low-Rank Adaptation) https://medium.com/@hasfatauil12/exploring-lora-low-rank-adaptation-c58b7192e5ba
17:17		Exploring AI Innovations: Insights on Microsoft’s Phi 3.5 Mini and NVIDIA’s SLM https://medium.com/@saumil23/exploring-ai-innovations-insights-on-microsofts-phi-3-5-mini-and-nvidia-s-slm-33b4fedf409e
16:53		Biorecap: An R package for summarizing bioRxiv preprints with a local LLM https://blog.stephenturner.us/p/biorecap-r-package-for-summarizing-biorxiv-preprints-local-llm
16:50		Practical Implementation and Future Directions https://medium.com/@mpuig/practical-implementation-and-future-directions-0a3b8e87f229
16:40		The top 30 books to expand the capabilities of AI: a biased reading list https://medium.com/@jmugan/the-top-30-books-to-expand-the-capabilities-of-ai-a-biased-reading-list-f521bed51dbc
16:37		Fine-Tuning Small Language Models for AI Agent Tool Utilization https://ai.gopubby.com/fine-tuning-small-language-models-for-ai-agent-tool-utilization-fcb10a1bbf51
16:19		ColBERT and Beyond: Advancing Retrieval Techniques https://medium.com/@mpuig/colbert-and-beyond-advancing-retrieval-techniques-81df1b2324d6
16:17		Microsoft GraphRAG with an RDF Knowledge Graph — Part 3 https://medium.com/@ianormy/microsoft-graphrag-with-an-rdf-knowledge-graph-part-3-328f85d7dab2
16:12		I am https://kaleidoscopesharts.medium.com/i-am-0d2b573cc71f
16:10		Importing Your Unstructured Triples into WhyHow.AI — Notebook Demonstration https://medium.com/enterprise-rag/importing-your-unstructured-triples-into-whyhow-ai-notebook-demonstration-3f424a2b27a5
16:00		Bi-Encoders and Cross-Encoders: Two Sides of the Retrieval Coin https://medium.com/@mpuig/bi-encoders-and-cross-encoders-two-sides-of-the-retrieval-coin-06a95fe18619
15:56		Navigating the GenAI Revolution: Uber’s Strategic Approach to AI Transformation https://medium.com/@praveengovi/navigating-the-genai-revolution-ubers-strategic-approach-to-ai-transformation-2cbf9861586a
14:50		Embedding Models in RAG Systems: The Cornerstone of Effective Retrieval https://medium.com/@mpuig/embedding-models-in-rag-systems-the-cornerstone-of-effective-retrieval-f2c888d69744
14:48		RAG Systems vs. Traditional Language Models: A New Era of AI-Powered Information Retrieval https://medium.com/@mpuig/rag-systems-vs-traditional-language-models-a-new-era-of-ai-powered-information-retrieval-887ec31c15a0
14:44		Building a Small Retrieval-Augmented Generation (RAG) Application: An End-to-End Guide https://medium.com/@kts.ramamoorthy07/building-a-small-retrieval-augmented-generation-rag-application-an-end-to-end-guide-9c7fab887ee2
14:38		Inference Optimizations #1 — Continuous Batching https://donmoon.medium.com/inference-optimizations-1-continuous-batching-03408c673098
14:28		Are LLMs and In-Context Learning Enough for NLP? https://josecamachocollados.medium.com/are-llms-and-in-context-learning-enough-for-nlp-b05221144bd8
14:21		Processing Unstructured Data with Snowflake Cortex AI https://medium.com/snowflake/processing-unstructured-data-with-snowflake-cortex-ai-807f880b9c4e
14:09		Navigating the Ethical and Practical Challenges of Large Language Models https://medium.com/@armaansinghbhau8/navigating-the-ethical-and-practical-challenges-of-large-language-models-084aa0784a20
13:54		How OpenAI or DeepMind calculates cost of training a transformer based models? https://masteringllm.medium.com/how-openai-or-deepmind-calculates-cost-of-training-a-transformer-based-models-b0b629f0942b
13:53		Direct Preference Optimization (DPO) of LLMs: A Paradigm Shift https://medium.com/@edmond.po/direct-preference-optimization-a-novel-approach-to-language-model-alignment-1f829d4ac306
13:46		A guide to run LLM on Akash Network with KoboldCPP https://medium.com/@txartblock/a-guide-to-run-llm-on-akash-network-with-koboldcpp-8edee116afcd
13:45		A Game Changer! You Can Now Create a Chatbot for Any GitHub Repo Using Llama 3.1 405B or 70B! https://medium.com/@abdulvahapmutlu/a-game-changer-you-can-now-create-a-chatbot-for-any-github-repo-using-llama-3-1-405b-or-70b-63e936b44e9e
13:28		ReAnnotated Transformer https://medium.com/@kiangyeow/reannotated-transformer-450633432baa
12:51		My Internship Experiences as a Machine Learning Engineer https://medium.com/@williechu1125/my-internship-experiences-as-a-machine-learning-engineer-94ae6f54c95e
12:15		How to Train a FLUX.1 LoRA https://notes.dsebastien.net/30+Areas/33+Permanent+notes/33.02+Content/How+to+train+a+FLUX.1+LoRA
11:36		Grok-2: Elon Musk’s AI Lovechild That’s Making GPT-4 Sweat https://medium.com/@cognidownunder/grok-2-elon-musks-ai-lovechild-that-s-making-gpt-4-sweat-f4fb1d8198b7
11:25		Introduction To Generative AI And LLM In Depth https://medium.com/@fraidoonomarzai99/introduction-to-generative-ai-and-llm-in-depth-aaf4bb5546ff
11:24		LLM-RAG pt.1 / Vector DB, What is the difference? https://medium.com/dev-ai/rag-pt-1-vector-db-what-is-the-difference-f4a9702f26da
11:18		Understanding SpreadsheetLLM: A Novel Approach to Spreadsheet Data Processing https://medium.com/@jain.ajanuj/understanding-spreadsheetllm-a-novel-approach-to-spreadsheet-data-processing-f7788553bdc5
09:50		Building a production-ready Chatbot: Knowledge, RAG, and Search (1) https://medium.com/@tsunhanchiang/building-a-production-ready-chatbot-knowledge-rag-and-search-1-4492ab00028e
08:45		Decentralized Uncensored LLM Model: Is Decentralized AI the Key to True Freedom? https://medium.com/ailogic/decentralized-uncensored-llm-model-is-decentralized-ai-the-key-to-true-freedom-41a6fecc6c08
08:38		Introduction to Basics of Quantization in Large Language Models https://ojasvinsood.medium.com/introduction-to-basics-of-quantization-in-large-language-models-649d419c3c0e
07:31		Fine-tuning FLUX.1: Customizing Your AI Image Generator https://medium.com/@naman1011/fine-tuning-flux-1-customizing-your-ai-image-generator-58f75e7ffe6d
07:04		KPAI — A new way to look at business metrics https://medium.com/firebird-technologies/kpai-a-new-way-to-look-at-business-metrics-75eaf0da8dbd
06:54		Implementing Retrieval-Augmented Generation (RAG) into a local Large Language Model https://medium.com/@jakob.rohrhirsch/implementing-retrieval-augmented-generation-rag-into-a-local-large-language-model-95395477de7d
06:37		Building a LLM comment summarization component https://medium.com/codex/building-a-llm-comment-summarization-component-d0456cc23792
06:05		The AI Revolution in Quantitative Finance: How LLMs are Reshaping Trading Strategies https://medium.com/@pta.forwork/the-ai-revolution-in-quantitative-finance-how-llms-are-reshaping-trading-strategies-828ea572364b
04:46		Understanding CONDENSE: A New Approach to Optimizing Large Language Models https://bobrupakroy.medium.com/understanding-condense-a-new-approach-to-optimizing-large-language-models-99397a5da011
04:27		Building Efficient Applications with Few shot Classification https://generativeai.pub/building-efficient-applications-with-few-shot-classification-6afee7a4db43
04:16		What causes LLM Hallucinations? https://generativeai.pub/what-causes-llm-hallucinations-26c10bbb03ae
04:15		Generative AI in Business: Great Potential, Greater Challenges https://generativeai.pub/generative-ai-in-business-great-potential-greater-challenges-52af2ab9dd67
04:08		Retrieval Augmented Generation (RAG) https://medium.com/@abdulwasiueunk/retrieval-augmented-generation-rag-6bb8f7ab2f3c
03:08		Human-AI Interaction Playbook https://medium.com/@yujinisgr8/human-ai-interaction-playbook-9b9effd049c7
02:11		Advanced RAG Techniques — The LLMCompiler Approach https://gabrielgomes61320.medium.com/advanced-rag-techniques-the-llmcompiler-approach-ed3ecf1ddb43
01:24		Unleash the Full Potential of AI with ModelFusion: Your Ultimate AI Companion https://medium.com/@modelfusion/unleash-the-full-potential-of-ai-with-modelfusion-your-ultimate-ai-companion-38422c9eaeeb
01:17		LLMs missing Language https://medium.com/pat-inc/llms-missing-language-5a13bbde4719
00:56		From Emotionless Machines to Intuitive Intelligences: Rethinking AI Cognition https://medium.com/@mbonsign/from-emotionless-machines-to-intuitive-intelligences-rethinking-ai-cognition-9bf7e1e59233
Friday, 2024-08-23
22:38		The Mathematical Essence of Loss Function Design in Deep Neural Networks https://medium.com/autonomous-agents/the-mathematical-essence-of-loss-function-design-in-deep-neural-networks-9dc76b08406c
21:23		Fine-Tuning OpenAI GPT Models: A Comprehensive Step-by-Step Guide https://medium.com/@amosmaru10/fine-tuning-openai-gpt-models-a-comprehensive-step-by-step-guide-0f98623789fb
20:52		Beating OpenAI's structured outputs on cost, accuracy and speed https://www.boundaryml.com/blog/sota-function-calling
20:23		Databricks Generative AI Engineer Associate Certification: Study Guide Part 1 https://medium.com/@chandadipendu/databricks-generative-ai-engineer-associate-certification-study-guide-part-1-70cf3c483085
20:18		Liger-Kernel: Efficient Triton kernels for LLM training https://github.com/linkedin/Liger-Kernel
20:12		AI21 Labs Released Jamba 1.5 Family of Open Models: Jamba 1.5 Mini and Jamba 1.5 Large Redefining Long-Context AI with Unmatched Speed, Quality, and Multilingual Capabilities for Global Enterprises https://www.marktechpost.com/2024/08/23/ai21-labs-released-jamba-1-5-family-of-open-models-jamba-1-5-mini-and-jamba-1-5-large-redefining-long-context-ai-with-unmatched-speed-quality-and-multilingual-capabilities-for-global-enterprises/
20:08		Getting Started with Local AI/LLMs in Three Easy Steps https://runeberg.medium.com/getting-started-with-local-ai-llms-in-three-easy-steps-bddebcf26570
20:01		Simplifying LLM Development: Treat It Like Regular ML https://pub.towardsai.net/simplifying-llm-development-treat-it-like-regular-ml-5b0648c9c938
19:46		Semantic Search Battle: Vertex AI vs. Table Search for shops https://medium.com/@nastiya.levchenko/semantic-search-battle-vertex-ai-vs-table-search-for-shops-df6136f21d66
19:14		Knowledge Diaries: I launched an Open Source Dev Tool! https://medium.com/@seifmamdouh7878/knowledge-diaries-i-launched-an-open-source-dev-tool-3544eefd2595
19:08		Chat with your personal pdf free using Cohere https://medium.com/@vaibhavdahiya/chat-with-your-personal-pdf-free-using-cohere-4c9e7b2561af
19:07		Intent Detection Using LLMs https://blog.gopenai.com/intent-detection-using-llms-ba857f673260
19:01		Integrating AI models in Next.js https://medium.com/@iqrahamzaworks/integrating-ai-models-in-next-js-5423c27aa30e
19:00		Lab #6: Chat with Cloud Diagram Agent (ChatGPT, Langchain & Streamlit) https://ai.gopubby.com/lab-6-chat-with-cloud-diagram-agent-chatgpt-langchain-streamlit-67ac3e211933
18:55		Prompt Caching with Claude 3.5 Sonnet https://medium.com/dair-ai/prompt-caching-with-claude-3-5-sonnet-e59c91eeda9c
18:44		Navigating the Cost Landscape of LLMs in Production: Strategies for Optimization and Informed… https://medium.com/@shivam_23581/navigating-the-cost-landscape-of-llms-in-production-strategies-for-optimization-and-informed-e252d8307209
18:34		This AI Paper by National University of Singapore Introduces A Comprehensive Survey of Language Models for Tabular Data Analysis https://www.marktechpost.com/2024/08/23/this-ai-paper-by-national-university-of-singapore-introduces-a-comprehensive-survey-of-language-models-for-tabular-data-analysis/
18:10		Automating Table of Content Extraction and Filtering in Papers with LlamaIndex https://medium.com/@hborobia/automating-table-of-content-extraction-and-filtering-in-papers-with-llamaindex-7fc6a7cd3aae
18:01		Perplexity AI plans to start running ads in 4th quarter https://www.cnbc.com/2024/08/22/perplexity-ai-plans-to-start-running-search-ads-in-fourth-quarter.html
18:01		👁️ Agent-ception: When Agents Are Creating Agents https://www.llmwatch.com/p/agent-ception-when-agents-are-creating

1 58 of 100

Was this helpful?

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v2024072803

Support LLM Explorer