LLM News and Articles
Sunday, 2024-07-14 | ||||
13:37 | CURLoRA: Stable LLM Fine-Tuning and Catastrophic Forgetting Mitigation https://zenodo.org/records/12740116 | |||
13:14 | Deciphering AI: Leveraging Sparse Autoencoders for Enhanced Model Interpretability https://medium.com/@amisha.p/deciphering-ai-leveraging-sparse-autoencoders-for-enhanced-model-interpretability-b813c5b47a1c | |||
13:07 | ChatGPT Effective promting techniques https://medium.com/@muhammad.a0625/chatgpt-effective-promting-techniques-f60e10275240 | |||
13:01 | Craft Assistant on Commercialized Religion https://medium.com/extratone/craft-assistant-protestant-7e2b53e354ce | |||
12:25 | The Impact of Generative AI on Human Creativity: A Writer’s Perspective https://tristwolff.medium.com/the-impact-of-generative-ai-on-human-creativity-a-writers-perspective-a68396d4f613 | |||
12:24 | Retrieval-Augmented Generation (RAG) nedir? Nerelerde kullanılır? https://medium.com/@enesarslan./retrieval-augmented-generation-rag-nedir-nerelerde-kullan%C4%B1l%C4%B1r-ad7fe20438d6 | |||
12:20 | A beginner's guide to LLM quantization and testing https://www.theregister.com/2024/07/14/quantization_llm_feature/ | |||
12:14 | GraphRAG(Graphs + Retrieval Augmented Generation): Unlocking LLM Discovery on Narrative Private… https://medium.com/@vinodkumargr/graphrag-graphs-retreival-augmented-generation-unlocking-llm-discovery-on-narrative-private-1bf977dadcdd | |||
11:49 | Create Markdown from a text prompt using Anthropic’s API https://ai.gopubby.com/create-markdown-from-a-text-prompt-using-anthropics-api-ed81691a2e41 | |||
11:38 | Sizing Large Language Models: A T-Shirt Size Approach https://medium.com/@amir36/sizing-large-language-models-a-t-shirt-size-approach-efb25a3ff343 | |||
11:20 | Three layers of context for useful AI https://medium.com/@pcbje/three-layers-of-context-for-useful-ai-f533276ca50a | |||
11:15 | Arena Learning: Transforming Post-Training of Large Language Models with AI-Powered Simulated Battles for Enhanced Efficiency and Performance in Natural Language Processing https://www.marktechpost.com/2024/07/14/arena-learning-transforming-post-training-of-large-language-models-with-ai-powered-simulated-battles-for-enhanced-efficiency-and-performance-in-natural-language-processing/ | |||
11:05 | Learn Custom LLMs: Tutorial to Develop an LLM for Translating English to Punjabi https://medium.com/@sandha.iitr/learn-custom-llms-tutorial-to-develop-an-llm-for-translating-english-to-punjabi-24da62b296d7 | |||
11:00 | Metron: A Holistic AI Framework for Evaluating User-Facing Performance in LLM Inference Systems https://www.marktechpost.com/2024/07/14/metron-a-holistic-ai-framework-for-evaluating-user-facing-performance-in-llm-inference-systems/ | |||
10:53 | AI paper this in this week! https://medium.com/@teerapong.ha62/ai-paper-this-in-this-week-5fa1ecd2c381 | |||
09:22 | OpenAI whistleblowers ask SEC to investigate alleged restrictive NDAs https://www.reuters.com/technology/openai-whistleblowers-ask-sec-investigate-restrictive-non-disclosure-agreements-2024-07-13/ | |||
09:15 | Optimizing Large Language Models (LLMs) on CPUs: Techniques for Enhanced Inference and Efficiency https://www.marktechpost.com/2024/07/14/optimizing-large-language-models-llms-on-cpus-techniques-for-enhanced-inference-and-efficiency/ | |||
09:02 | Retrieval augmented Agents RAA // Advanced RAG + Agents == Better Agents https://sbagency.medium.com/retrieval-augmented-agents-raa-advanced-rag-agents-better-agents-922ecde75373 | |||
08:38 | Understanding LLM — Large Language Models https://just-merwan.medium.com/understanding-llm-large-language-models-fed8b5a40301 | |||
08:24 | Why Meta-Llama-3–8B Runs Faster on GPU vs. CPU: A Deep Dive into Gaianet Node Performance https://medium.com/@zulfanbaswedan/why-the-gaianet-node-llm-mode-meta-llama-3-8b-runs-faster-on-the-gpu-compared-to-running-on-the-cpu-baed3da64379 | |||
08:21 | Dialogue with Claude 3 https://medium.com/@chatc3po/dialogue-with-claude-3-ab74d2913b49 | |||
07:42 | Practical GenAI https://medium.com/@sudhanshu.bhargav/practical-genai-47decc717e0e | |||
07:39 | Advanced RAG: Embedded Tables https://medium.com/@sudhanshu.bhargav/advanced-rag-embedded-tables-c29ab5e3bd5b | |||
07:29 | The Transformative Impact of Large Language Models on DevOps https://medium.com/@naseefcse/the-transformative-impact-of-large-language-models-on-devops-ca2157c698d2 | |||
07:15 | FBI-LLM (Fully BInarized Large Language Model): An AI Framework Using Autoregressive Distillation for 1-bit Weight Binarization of LLMs from Scratch https://www.marktechpost.com/2024/07/14/fbi-llm-fully-binarized-large-language-model-an-ai-framework-using-autoregressive-distillation-for-1-bit-weight-binarization-of-llms-from-scratch/ | |||
07:00 | Unlocking the Power of Large Language Models: Parameter-Efficient Fine-Tuning Advance Techniques… https://medium.com/@vkmauryavk/unlocking-the-power-of-large-language-models-parameter-efficient-fine-tuning-advance-techniques-4815d0e98b9c | |||
06:47 | RAG: Prototype to Production https://medium.com/@sudhanshu.bhargav/rag-prototype-to-production-14ef4f4dab90 | |||
06:24 | Enhancing LLM Reliability: The Lookback Lens Approach to Hallucination Detection https://www.marktechpost.com/2024/07/13/enhancing-llm-reliability-the-lookback-lens-approach-to-hallucination-detection/ | |||
05:59 | A study on Attention mechanism https://medium.com/perceptronai/a-study-on-attention-mechanism-7d199cf783b6 | |||
05:19 | Let’s explore ScrapeGraphAI https://medium.com/@minhle_0210/lets-explore-scrapegraphai-cf697640fe1b | |||
05:15 | Korvus: An All-in-One Open-Source RAG (Retrieval-Augmented Generation) Pipeline Built for Postgres https://www.marktechpost.com/2024/07/13/korvus-an-all-in-one-open-source-rag-retrieval-augmented-generation-pipeline-built-for-postgres/ | |||
04:22 | Mooncake Paper on LLM Serving https://medium.com/@zagfox/mooncake-paper-on-llm-serving-27a1385ec420 | |||
03:51 | Q-GaLore Released: A Memory-Efficient Training Approach for Pre-Training and Fine-Tuning Machine Learning Models https://www.marktechpost.com/2024/07/13/q-galore-released-a-memory-efficient-training-approach-for-pre-training-and-fine-tuning-machine-learning-models/ | |||
03:06 | Speculative RAG: enhancing RAG with multiple drafts generation and verification https://medium.com/@techsachin/speculative-rag-enhancing-rag-with-multiple-drafts-generation-and-verification-8a1db886aa25 | |||
01:50 | The Illusion of Transparency: Why Big AI Companies Will Never Offer Uncensored AI Models https://medium.com/@monty.zumas1/the-illusion-of-transparency-why-big-ai-companies-will-never-offer-uncensored-ai-models-3cd9cc64fe97 | |||
01:49 | The lean machine: crafting production-grade user intent detection and content moderation AI with… https://blog.cubed.run/the-lean-machine-crafting-production-grade-user-intent-detection-and-content-moderation-ai-with-d261d9f28f2a | |||
01:30 | 5 Levels in AI by OpenAI: A Roadmap to Human-Level Problem Solving Capabilities https://www.marktechpost.com/2024/07/13/5-levels-in-ai-by-openai-a-roadmap-to-human-level-problem-solving-capabilities/ | |||
01:08 | Coffee Time Papers: Mixture of a Million Experts https://medium.com/@weidagang/coffee-time-papers-mixture-of-a-million-experts-fec662e9d115 | |||
01:05 | Effective Practices for Mocking LLM Responses During the Software Development Lifecycle https://medium.com/@vuongngo/effective-practices-for-mocking-llm-responses-during-the-software-development-lifecycle-73f726c3f994 | |||
01:04 | The Dawn of a New Era in AI: NVIDIA’s Megatron-Turing NLG Redefines Language Processing https://medium.com/@vdwayne/the-dawn-of-a-new-era-in-ai-nvidias-megatron-turing-nlg-redefines-language-processing-5fd0adce4550 | |||
Saturday, 2024-07-13 | ||||
22:24 | Natural Language Processing Glossary (Part I) https://lzhangstat.medium.com/natural-language-processing-glossary-part-i-8ddb0cff08ff | |||
22:05 | It's an open secret that OpenAI is trying to IPO soon https://twitter.com/deliprao/status/1811817326599102592 | |||
21:37 | [DE]Vergleich der bedeutendsten Large Language Models (LLMs) im Juli 2024 https://medium.com/@TheAIQueenDC/de-vergleich-der-bedeutendsten-large-language-models-llms-im-juli-2024-e8e40695d102 | |||
21:36 | How Have Pre-Training Datasets for Large Language Models Evolved? https://medium.com/@jelkhoury880/how-have-pre-training-datasets-for-large-language-models-evolved-13d74c01f8e8 | |||
21:13 | THE LLM SHOWDOWN IN MOUNTAIN VIEW https://medium.com/@mikec.chrabaszcz/the-llm-showdown-in-mountain-view-cfa53106de49 | |||
20:56 | Let’s Build a Sample Chat Agent with Python and LangChain Part One 1 (Data to JSON) https://medium.com/@ahmedtammaa101_24052/lets-build-a-sample-chat-agent-with-python-and-langchain-part-one-1-data-to-json-e2e8b6017873 | |||
20:34 | To Code or Not To Code https://medium.com/@naveen.xavier/to-code-or-not-to-code-08cc024f67e2 | |||
19:49 | AI tools for Design & Verification https://medium.com/@shivamkatiyar274/ai-tools-for-design-verification-cc3507253544 | |||
19:46 | OpenAI Researcher Says He Quit When He Realized the Upsetting Truth https://futurism.com/openai-researcher-quit-realized-upsetting-truth | |||
19:22 | How to use Mixture-of-Agents in your favorite Application https://medium.com/silicon-and-synapses/mixture-of-agents-supercharging-open-source-language-models-behind-a-familiar-api-825e4f8aa4d9 | |||
18:55 | Running LLM Models Locally: A Secure and Private Option for AI https://medium.com/@goofylucilo/running-llm-models-locally-a-secure-and-private-option-for-ai-e8971e27e835 | |||
18:45 | Three Practical Challenges of RAG and Their Mitigation Ideas https://ai.gopubby.com/three-practical-challenges-of-rag-and-their-mitigation-ideas-5cc8e6dd7e30 | |||
18:42 | NER, identificando nomes em dados textuais: Meus estudos em spaCy e NLP — Parte 5 https://medium.com/@surreauxpp/ner-identificando-nomes-em-dados-textuais-meus-estudos-em-spacy-e-nlp-parte-5-5bc0c1f73180 | |||
18:28 | What is an LLM? https://medium.com/illumination/what-is-an-llm-be1c2150bbae | |||
18:21 | Large Language Model: from pretrained to instructed one. https://ivan-sur.medium.com/large-language-model-from-pretrained-to-instructed-one-efb141d55284 | |||
18:15 | Understanding and Mitigating Hallucinations in Large Language Models (LLMs) https://medium.com/@asimsultan2/understanding-and-mitigating-hallucinations-in-large-language-models-llms-30d23852aae6 | |||
17:51 | Whistleblowers accuse OpenAI of 'illegally restrictive' NDAs https://techcrunch.com/2024/07/13/whistleblowers-accuse-openai-of-illegally-restrictive-ndas/ | |||
17:51 | QuickRead Mixture of Agents: Achieving State-of-the-Art Performance with Collaborative LLMs https://vishwanathkamath.medium.com/quickread-mixture-of-agents-achieving-state-of-the-art-performance-with-collaborative-llms-8556545f76f2 | |||
17:42 | Exploring DoRA: Improving on LoRA’s Parameter-Efficient Fine-Tuning https://medium.com/@edmond.po/exploring-dora-improving-on-loras-parameter-efficient-fine-tuning-d72edc045f64 | |||
17:38 | ✨QuickRead✨ Enhancing Retrieval-Augmented Generation: Exploring Modular RAG Innovations https://vishwanathkamath.medium.com/quickread-enhancing-retrieval-augmented-generation-exploring-modular-rag-innovations-201f6c1f1c98 | |||
17:20 | Latest Types of RAG https://medium.com/@alaa.sayed.engineer/latest-types-of-rag-ccd5e12fbeff | |||
17:02 | OpenAI anticipates decrease in AI model costs amid adoption surge https://venturebeat.com/ai/openai-anticipates-decrease-in-ai-model-costs-amid-adoption-surge/ | |||
16:58 | Inside Prompt Engineering: Demystifying Technical Intricacies https://medium.com/@sreenith.r/inside-prompt-engineering-demystifying-technical-intricacies-36296d0dfad3 | |||
16:50 | Running LLMs Locally in Salesforce Experience Cloud using picoLLM Inference Engine SDK https://akutishevsky.medium.com/running-llms-locally-in-salesforce-experience-cloud-using-picollm-inference-engine-sdk-762d0e11450e | |||
16:35 | Breaking News: Meta Unveils MobileLLM, a Sub-Billion Parameter Language Model Transforming… https://blog.stackademic.com/breaking-news-meta-unveils-mobilellm-a-sub-billion-parameter-language-model-transforming-220a21cd0c1d | |||
15:35 | Enhancing SQL Generation in Large Language Models with Graph Neural Networks https://medium.com/@frankmorales_91352/enhancing-sql-generation-in-large-language-models-with-graph-neural-networks-fa4958e9a312 | |||
14:38 | RAG: Key Aspects of Performance: Metrics and Measurement https://sunila-gollapudi.medium.com/rag-key-aspects-for-performance-metrics-and-measurement-c41b1aa18499 | |||
14:10 | Caching Out with Gemini: Making AI Chat Less Taxing (on Your Wallet) https://medium.com/@wasimmajidmalik/caching-out-with-gemini-making-ai-chat-less-taxing-on-your-wallet-212f40bb1a46 | |||
14:07 | My Attempt at a Tree-View Hierarchical Summarizer to Read with AI https://medium.com/@BitsOfChris/my-attempt-at-a-tree-view-hierarchical-summarizer-to-read-with-ai-2ae2423d7140 | |||
14:01 | Top Important LLMs Papers for the Week from 01/07 to 07/07 https://pub.towardsai.net/top-important-llms-papers-for-the-week-from-01-07-to-07-07-59f6732fab8e | |||
13:59 | Whose fault is it? Measuring Incoherence of Large Language Models https://medium.com/@federicoerrica/whose-fault-is-it-measuring-incoherence-of-large-language-models-9da21b8f2459 | |||
13:25 | Why you should outsource your agentic infrastructure, but own your cognitive architecture https://blog.langchain.dev/why-you-should-outsource-your-agentic-infrastructure-but-own-your-cognitive-architecture/ | |||
13:13 | The Evolution of Large Language Models on OpenAI models' example https://medium.com/@rusanger/the-evolution-of-large-language-models-on-openai-models-example-cf4930c76142 | |||
12:34 | Building blocks of Gen AI Applications in LLM/SLM https://towardsdev.com/building-blocks-of-gen-ai-applications-in-llm-slm-78ca1bfca2c7 | |||
12:26 | CSV Analysis Visualization with LLMs https://medium.com/@omjishukla/csv-analysis-visualization-with-llms-d9acf5431dc3 | |||
12:24 | Classifying Wikipedia articles using GPT 3.5 Turbo https://medium.com/@spriya2809/classifying-wikipedia-articles-using-gpt-3-5-turbo-7ec85a2f1d52 | |||
11:29 | MHA vs MQA vs GQA vs MLA https://medium.com/@zaiinn440/mha-vs-mqa-vs-gqa-vs-mla-c6cf8285bbec | |||
11:20 | Linear Rope vs NTK vs YaRN vs CoPE https://medium.com/@zaiinn440/linear-rope-vs-ntk-vs-yarn-vs-cope-d33587ddfd35 | |||
10:32 | The Ultimate Guide to Getting Started with Bloom LLM https://medium.com/@krishani_70219/the-ultimate-guide-to-getting-started-with-bloom-llm-067c4ed57857 | |||
10:06 | Show HN: Math.bot – Free, instant math problem solver powered by GPT-4 https://math.bot | |||
10:02 | Comparative Analysis of Fine-Tuning LLaMA 2 and LLaMA 3 Models https://pub.towardsai.net/comparative-analysis-of-fine-tuning-llama-2-and-llama-3-models-b476a06c7879 | |||
09:45 | Unveiling the Magic: How Large Language Models Work https://medium.com/@mr_haseeb/unveiling-the-magic-how-large-language-models-work-300ea11b73b9 | |||
09:33 | Yapay Zeka : Büyük Umutlar Bağladık ama Beklentiler Gerçekçi mi? https://medium.com/@seliskacmaz1/yapay-zeka-b%C3%BCy%C3%BCk-umutlar-ba%C4%9Flad%C4%B1k-ama-beklentiler-ger%C3%A7ek%C3%A7i-mi-dfb2d35cfe44 | |||
09:32 | What is Einstein Trust Layer? https://medium.com/@khushis287/what-is-einstein-trust-layer-ba49bb9d0836 | |||
09:15 | Researchers at Stanford Introduces In-Context Vectors (ICV): A Scalable and Efficient AI Approach for Fine-Tuning Large Language Models https://www.marktechpost.com/2024/07/13/researchers-at-stanford-introduces-in-context-vectors-icv-a-scalable-and-efficient-ai-approach-for-fine-tuning-large-language-models/ | |||
09:10 | Ex-OpenAI staff call for "right to warn" about AI risks without retaliation https://arstechnica.com/information-technology/2024/06/ex-openai-staff-call-for-right-to-warn-about-ai-risks-without-retaliation/ | |||
08:43 | Direct Documentation I: A Look Inside a Source Transmission https://mindtripblog.medium.com/direct-documentation-i-a-look-inside-a-source-transmission-c52a73d9a1a9 | |||
08:22 | Understanding LLM Routers: A Magical Mail Sorting System for Robots https://medium.com/@trinad536/understanding-llm-routers-a-magical-mail-sorting-system-for-robots-034fba878adf | |||
07:41 | Beyond Chatbots: How LLMs Are Reshaping Industrial https://klaothongchan.medium.com/beyond-chatbots-how-llms-are-reshaping-industrial-dc47f446e19f | |||
07:31 | Use agents to write release note in Agent ChatRoom https://medium.com/@g2260578356/write-release-note-with-agents-in-agent-chatroom-1e80521f603a | |||
07:15 | Can LLMs Help Accelerate the Discovery of Data-Driven Scientific Hypotheses? Meet DiscoveryBench: A Comprehensive LLM Benchmark that Formalizes the Multi-Step Process of Data-Driven Discovery https://www.marktechpost.com/2024/07/13/can-llms-help-accelerate-the-discovery-of-data-driven-scientific-hypotheses-meet-discoverybench-a-comprehensive-llm-benchmark-that-formalizes-the-multi-step-process-of-data-driven-discovery/ | |||
07:14 | Outlines: Make LLM structured outputs controllable and improve the stability of LLM applications https://ullyer.medium.com/outlines-make-llm-structured-outputs-controllable-and-improve-the-stability-of-llm-applications-584ae9db3789 | |||
07:01 | The Concern of Privacy with LLMs https://pub.towardsai.net/the-concern-of-privacy-with-llms-2630828dda67 | |||
06:49 | OpenAI Scale Ranks Progress Toward 'Human-Level' Problem Solving https://www.bloomberg.com/news/articles/2024-07-11/openai-sets-levels-to-track-progress-toward-superintelligent-ai | |||
05:24 | Analyzing Trump - Biden debate using AI — Claude Sonnet 3.5 https://medium.com/@omkamal/analyzing-trump-biden-debate-using-ai-claude-sonnet-3-5-ee12a2a4e320 | |||
05:20 | Thoughts on LangChain https://seniorbrogrammer.medium.com/thoughts-on-langchain-67c2346139b5 | |||
05:00 | Building AI Applications with ChatGPT APIs by Martin Yanev https://medium.com/@varmabh183/building-ai-applications-with-chatgpt-apis-by-martin-yanev-c87c533d8c2d | |||
04:36 | The Agentic Concept in LLM-based Application Development https://medium.com/@pankaj_pandey/the-agentic-concept-in-llm-based-application-development-48beea5cc00d | |||
04:09 | Azure OpenAI down in multiple regions https://azure.status.microsoft/en-us/status | |||
03:23 | Visualizing Low-Rank Adaptation (LoRA) https://pub.towardsai.net/visualizing-low-rank-adaptation-lora-4526726279cb |
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072803