LLM News and Articles

1 90 of 100

Friday, 2024-07-19
16:21		Remember That? How Semantic Caching Supercharges AI Assistants https://medium.com/@ganeshkannappan/remember-that-how-semantic-caching-supercharges-ai-assistants-246cbeb14af1
16:14		Rapid protein evolution by few-shot learning with a protein language model https://www.biorxiv.org/content/10.1101/2024.07.17.604015v1
16:12		GPT-4o-mini is out! Let’s see how it performs https://medium.com/@tihomir.manushev/gpt-4o-mini-is-out-lets-see-how-it-performs-c459fbd016cb
16:06		Considerations for building an AI-driven for Document Search and Retrieval System https://medium.com/@paul.ekwere/considerations-for-building-an-ai-driven-for-document-search-and-retrieval-system-88d7b20e976e
15:56		LangChain in Chains #30: Document Chatbots https://medium.com/@okanyenigun/langchain-in-chains-30-document-chatbots-3918bac29a26
15:51		Choosing the Best Structured Output Parser Approach \| 3 Ways To Generate Structured Output https://kaushikshakkari.medium.com/choosing-the-best-structured-output-parser-approach-3-ways-to-generate-structured-output-d9686482729c
15:51		Choosing the Best Structured Output Parser Approach \| 3 Ways To Generate Structured Output https://blog.gopenai.com/choosing-the-best-structured-output-parser-approach-3-ways-to-generate-structured-output-d9686482729c
15:14		Understanding Large Language Models : Chapter 1- Transformers and Transformer block. https://viraajkadam.medium.com/understanding-large-language-models-chapter-1-transformers-and-transformer-block-a6d65462ddfe
15:12		Meta won't release its multimodal Llama AI model in the EU https://www.theverge.com/2024/7/18/24201041/meta-multimodal-llama-ai-model-launch-eu-regulations
15:04		Fine-tuning LLMs On Educational Datasets- An Overview https://medium.com/@balajiwork01/fine-tuning-llms-on-educational-datasets-an-overview-54d24b55bde7
15:02		Apple Open-Sources LLM DCLM-7B https://huggingface.co/apple/DCLM-7B
14:49		Enhancing Contextual Understanding with ReALM: A Novel Method for Conversational Agents https://vishwanathkamath.medium.com/enhancing-contextual-understanding-with-realm-a-novel-method-for-conversational-agents-75a3e05cf8a6
14:38		RAG-boosted with Knowledge Graph https://medium.com/totalenergies-digital-factory/rag-boosted-with-knowledge-graph-60b5f4c5afcb
14:34		Massive Windows Outage Causes Global Chaos https://autoblogs.medium.com/massive-windows-outage-causes-global-chaos-cf7eb351cc3d
14:31		Have you ever thought that Artificial Intelligence was some kind of black magic? https://andreabelvedere.medium.com/have-you-ever-thought-that-artificial-intelligence-was-some-kind-of-black-magic-1b7e05591b0a
14:18		Bridging Two Worlds: How to Unite Symbolic and Connectionist AI for the Future of LLM-Empowered… https://medium.com/@wwzzyy9982/bridging-two-worlds-how-to-unite-symbolic-and-connectionist-ai-for-the-future-of-llm-empowered-5d6ec95ceccd
14:02		AI Hallucinations https://pub.towardsai.net/ai-hallucinations-6d0091726b86
13:54		Worldwide IT Blackout https://autoblogs.medium.com/worldwide-it-blackout-e7d4643a1915
13:51		Multimodal Retrieval Augmented Generation for Sustainable Finance — With Code https://medium.com/artificial-corner/multimodal-retrieval-augmented-generation-for-sustainable-finance-with-code-5a910f3b666c
12:46		Optimizing LLMs with RLHF https://medium.com/feedback-intelligence/optimizing-llms-with-rlhf-c1ebc7d998ad
12:15		Fine-Tuning an LLM vs. RAG Approach https://medium.com/@way2mhemanth/fine-tuning-an-llm-vs-rag-approach-55e82b5e8b71
12:14		Transforming Robot Programming with Language AI https://medium.com/@haitham.bouammar71/transforming-robot-programming-with-language-ai-c8ed974c4df5
12:00		Exploring the Frontiers of Text Segmentation for Better RAG Systems https://medium.com/@hajar.mousannif/exploring-the-frontiers-of-text-segmentation-for-better-rag-systems-1b9f9af49258
11:34		Understand GPT Tokens and Models Comparison https://blog.gopenai.com/understand-gpt-tokens-and-models-comparison-16acc771a01c
11:26		Enhancing Mathematical Reasoning in AI: Integrating LLMs with Monte Carlo Tree Search https://medium.com/@chaudharysahil379/enhancing-mathematical-reasoning-in-ai-integrating-llms-with-monte-carlo-tree-search-b3ef188cba9a
11:16		Self-Attention Mechanism In Transformers https://medium.com/@govindarajpriyanthan/self-attention-mechanism-in-transformers-1e46af9e1afb
11:12		Wolfram LLM Benchmarking Project https://www.wolfram.com/llm-benchmarking-project/
11:12		Refining the Role of GPT-4 LLM in Virtual Internships https://medium.com/@rishitmayank/refining-the-role-of-gpt-4-llm-in-virtual-internships-f4bdb5798ea3
10:43		Transformers in Large Language Model https://medium.com/version-1/transformers-in-large-language-model-2f7d485f50b0
10:26		Literature Review Generation using Llama and Arxiv https://medium.com/@khaoujai/literature-review-generation-using-llama-and-arxiv-8a25aecb3f04
10:19		Simplify LLM Quantization Process for Success https://medium.com/@marketing_novita.ai/simplify-llm-quantization-process-for-success-7126c26633df
10:09		How Structure and Language Choices Impact Prompt Engineering for LLMs https://medium.com/@gprdino/how-structure-and-language-choices-impact-prompt-engineering-for-llms-8dd712421bd9
10:01		LLM as a Service: Your Partner in LM Model Development https://medium.com/@krishani_70219/llm-as-a-service-your-partner-in-lm-model-development-302e2c72c054
09:31		Master LLM Sentiment Analysis: A Simple Guide https://medium.com/@marketing_novita.ai/master-llm-sentiment-analysis-a-simple-guide-a9692001e780
09:15		DotaMath: Advancing LLMs’ Mathematical Reasoning Through Decomposition and Self-Correction https://www.marktechpost.com/2024/07/19/dotamath-advancing-llms-mathematical-reasoning-through-decomposition-and-self-correction/
09:00		OpenAI is releasing a cheaper, smarter model. ChatGPT 4o mini launches today https://www.theverge.com/2024/7/18/24200714/openai-new-cheaper-smarter-model-gpt-4o-mini
09:00		This Survey Paper Presents a Comprehensive Review of LLM-based Text-to-SQL https://www.marktechpost.com/2024/07/19/this-survey-paper-presents-a-comprehensive-review-of-llm-based-text-to-sql/
08:59		The Art of Prompt Engineering https://medium.com/@Datafied/the-art-of-prompt-engineering-4f175f21835a
08:40		How to Build a RAG-Powered Chatbot with Google Gemini and MyScaleDB https://medium.com/@myscale/how-to-build-a-rag-powered-chatbot-with-google-gemini-and-myscaledb-79c0024cd237
08:34		OpenAI Unveils Cost-Effective GPT-4o Mini: A Game-Changer for Developers and Startups https://medium.com/ai-news-nuggets/openai-unveils-cost-effective-gpt-4o-mini-a-game-changer-for-developers-and-startups-10a96bdbd1da
08:33		Dialogue with Claude 7 https://medium.com/@chatc3po/dialogue-with-claude-7-1d43c23e8d17
08:21		Crunching Nonsense: ChatGPT and Data Analysis https://medium.datasquirrel.ai/crunching-nonsense-chatgpt-and-data-analysis-3198f14b5338
08:17		Lessons Learned from Week 3 of the LLM Zoomcamp: Vector Search and Embeddings https://hunglethanh.medium.com/lessons-learned-from-week-3-of-the-llm-zoomcamp-vector-search-and-embeddings-32d3b31dd8b9
08:14		How to Run InternLM2 Locally: A Comprehensive Guide https://towards-agi.medium.com/how-to-run-internlm2-locally-a-comprehensive-guide-e1aad4044956
08:12		How to run LLMs on CPU-based systems https://medium.com/@simeon.emanuilov/how-to-run-llms-on-cpu-based-systems-1623e04a7da5
08:06		How to Run Mistral NeMo 12B Locally: A Comprehensive Guide https://towards-agi.medium.com/how-to-run-mistral-nemo-12b-locally-a-comprehensive-guide-75b74510dbff
07:34		Enhancing Model Capacity with Mixture-of-Experts: The Rise of Mixtral 8x7B https://vishwanathkamath.medium.com/enhancing-model-capacity-with-mixture-of-experts-the-rise-of-mixtral-8x7b-78dfc79cb64a
07:24		Mem0: The Missing Link in Long-Term AI Interactions https://medium.com/@omkamal/mem0-the-missing-link-in-long-term-ai-interactions-4e89e906d30c
07:17		Retrieval-Augmented Thoughts: Revolutionizing Long-Horizon Tasks with Retrieval-Augmented Thoughts https://vishwanathkamath.medium.com/retrieval-augmented-thoughts-revolutionizing-long-horizon-tasks-with-retrieval-augmented-thoughts-500a162b846d
07:10		Leveraging Large Language Models (LLMs) in AWS for Advanced Data Recommendation Systems https://blog.stackademic.com/leveraging-large-language-models-llms-in-aws-for-advanced-data-recommendation-systems-73fbb1db3b7a
07:06		The Rise of the AI LLMs https://medium.com/@Nilabh/the-rise-of-the-ai-llms-644b262e67f9
06:23		Can TTT models beat transformers? Unveiling Learning at testing for the next frontier in AI https://medium.com/@justinliu1205/can-ttt-models-beat-transformers-unveiling-learning-at-testing-for-the-next-frontier-in-ai-b7423065d1b1
06:18		Introducing ELM Turbo: Next-generation Efficient, Decomposable LLMs https://medium.com/sujith-ravi/introducing-elm-turbo-next-generation-efficient-decomposable-llms-a2347bd08676
06:17		A Comprehensive Analysis of LoRA Variants https://medium.com/@atharv6f_47401/a-comprehensive-analysis-of-lora-variants-b0eee98fc9e1
06:12		Build a scalable RAG ingestion pipeline using 74.3% less code https://medium.com/decodingml/build-a-scalable-rag-ingestion-pipeline-using-74-3-less-code-ac50095100d6
05:51		What is GPT-4o mini and what does it mean for Finance and FP&A https://christianmartinezfinancialfox.medium.com/what-is-gpt-4o-mini-and-what-does-it-mean-for-finance-and-fp-a-3576edc11272
05:45		Building an LLM Chatbot with SQL Integration https://medium.com/@raipragya256/building-an-llm-chatbot-with-sql-integration-9ee0c9d3df89
05:41		GPT-40 Mini: Advancing Cost-Efficient Intelligence https://medium.com/@manoranjan.rajguru/gpt-40-mini-advancing-cost-efficient-intelligence-aee5957ff95f
04:48		Understanding the Training of Large Language Models (LLMs) https://medium.com/@siddharthkharche/understanding-the-training-of-large-language-models-llms-6125cf801fdd
04:37		The Z Hypothesis: A Unified Framework for Human and AI Cognition https://medium.com/@greyboi/the-z-hypothesis-a-unified-framework-for-human-and-ai-cognition-f9d823982760
04:16		Mathstral: 7B LLM designed for math reasoning and scientific discovery https://mistral.ai/news/mathstral/
04:16		Deepset-Mxbai-Embed-de-Large-v1 Released: A New Open Source German/English Embedding Model https://www.marktechpost.com/2024/07/18/deepset-mxbai-embed-de-large-v1-released-a-new-open-source-german-english-embedding-model/
04:05		Show HN: ChatGPT Chrome Extension to Keep Temporary Chat Enabled https://github.com/EliseiNicolae/chatgpt-always-temporary-chat-on
02:37		OpenAI Releases GPT-4o Mini — A Cheap and Fast Small Language Model https://generativeai.pub/openai-releases-gpt-4o-mini-a-cheap-and-fast-small-language-model-eb9ecec8c15f
02:28		How is “GPT-4o mini” Game Changer in AI space (Milan’s Outlook) https://medium.com/@itsmybestview/how-is-gpt-4o-mini-game-changer-in-ai-space-milans-outlook-ededfd3a8831
02:25		[Paper Review/KR] MAVIS: Mathematical Visual Instruction Tuning https://medium.com/@halcyon0424/paper-review-kr-mavis-mathematical-visual-instruction-tuning-ea1d51e07867
02:02		GPT-4o mini is significantly smarter and cheaper than GPT-3.5 Turbo https://twitter.com/OpenAIDevs/status/1813990748406317221
01:42		How to Fine-Tune LLM’s for Summarization ?? https://medium.com/@khadkaujjwal47/how-to-fine-tune-llms-for-summarization-0f223a8bf15e
01:17		Challenges of Productionizing RAGs https://sivasathivel-kandasamy.medium.com/challenges-of-productionizing-rags-9545082b38b5
01:15		The Art of AI: Reimagining Artwork Analysis with RAG and LLMs https://medium.com/@alicejeanchoi/the-art-of-ai-reimagining-artwork-analysis-with-rag-and-llms-640e0225421f
00:01		OWASP Top 10 for Large Language Models https://medium.com/@sampratap/owasp-top-10-for-large-language-models-0d8c61ae31ae
Thursday, 2024-07-18
23:35		Multi-model Learning Models https://medium.com/@LiliMeng1/multi-model-learning-models-5c7d2d204c90
23:25		At 15c/million tokens, will GPT 4o Mini be the foundation of Agentic Workflows? https://chrisjanwust.medium.com/at-15c-million-tokens-will-gpt-4o-mini-be-the-foundation-of-agentic-workflows-7fd189138da4
23:21		cloning myself using LoRA https://medium.com/@avikmalladi/cloning-myself-using-lora-5bb69d241337
22:54		LLMs https://medium.com/@akshayhitendrashah/llms-1627909bf766
22:52		From Hype to Reality: How TAS Design’s LLMOps is Reinvigorating Generative AI https://medium.com/@TASDesignGroup/from-hype-to-reality-how-tas-designs-llmops-is-reinvigorating-generative-ai-0202f1bbb92d
22:38		Beyond the Gen AI Hype https://medium.com/@sandeep.bose_6501/beyond-the-gen-ai-hype-b83d5f69df2b
22:31		GPT-3.5 Turbo FINALLY Has A Successor https://medium.com/@impure/gpt-3-5-turbo-finally-has-a-successor-51cb1e2f3507
22:30		OpenAI Launches GPT-4o-Mini https://autoblogs.medium.com/openai-launches-gpt-4o-mini-d7266cb28305
22:15		GPT-4o Mini https://simonwillison.net/2024/Jul/18/gpt-4o-mini/
22:03		GPT-4o Mini — Thoughts, Pricing, and Independent Evaluation https://medium.com/@lars.chr.wiik/gpt-4o-mini-thoughts-pricing-and-independent-evaluation-140d5ab8aed1
21:37		Revolutionizing Fashion E-commerce: My Journey with Generative AI at Fashom https://medium.com/@rdesai2000/revolutionizing-fashion-e-commerce-my-journey-with-generative-ai-at-fashom-ae817b28933c
21:19		Do AI Models Actually Understand Language? https://aarnetalman.medium.com/do-ai-models-actually-understand-language-ce2f4e9a7fb9
21:15		Andrej Karpathy: "LLM model size competition is intensifying backwards https://twitter.com/karpathy/status/1814038096218083497
20:46		Enhancing Performance with C/C++ Code Execution for Langchain Agents https://itnext.io/enhancing-performance-with-c-c-code-execution-for-langchain-agents-a8974c4000f5
19:43		Production Ready Advanced RAG Optimization with Llama-Index and Qdrant Vector Database https://medium.com/rahasak/production-ready-advanced-rag-optimization-with-llama-index-and-qdrant-vector-database-23ad6427b20a
19:38		How to Accurately Conduct Data Analysis with ChatGPT 4.0 https://jobmill.com.ng/data-analysis-with-chatgpt-4-0/
19:37		Mistral AI is on fire…AI innovation at its peak https://sandar-ali.medium.com/mistral-ai-is-on-fire-ai-innovation-at-its-peak-d78d1dbb86ff
19:14		How Large language Models work? https://ai.plainenglish.io/how-large-language-models-work-ae40b277ff5c
19:10		Large Language Models — Retrieval Augmented Generation (RAG), Part 7 https://medium.com/@linghuang_76674/large-language-models-retrieval-augmented-generation-rag-part-7-87a6e01d6e35
18:54		Mistral AI and NVIDIA Collaborate to Release Mistral NeMo: A 12B Open Language Model Featuring 128k Context Window, Multilingual Capabilities, and Tekken Tokenizer https://www.marktechpost.com/2024/07/18/mistral-ai-and-nvidia-collaborate-to-release-mistral-nemo-a-12b-open-llm-featuring-128k-context-window-multilingual-capabilities-and-tekken-tokenizer/
18:38		Efficiency vs Mediocrity: The Double-Edged Sword of Foundation Models https://medium.com/@andrei.ionut.damian/efficiency-vs-mediocrity-the-double-edged-sword-of-foundation-models-944461c9b036
18:29		RAGS : A bare bones introduction and When you’ll need them https://medium.com/@harshar613/rags-a-bare-bones-introduction-and-when-youll-need-them-b1c81182012b
18:26		Unveiling the Truth: Spotting Hallucinations in LLMs https://blog.stackademic.com/unveiling-the-truth-spotting-hallucinations-in-llms-3aeeffc38815
18:19		Exposing the “magic” of AI / LLMs https://medium.com/@martijn_moret/exposing-the-magic-of-ai-llms-1d35365a45ff
18:17		GPT-4o mini https://cobusgreyling.medium.com/gpt-4o-mini-5dc420aa3715
18:06		OpenAI is too cheap to beat https://generatingconversation.substack.com/p/openai-is-too-cheap-to-beat-redux
18:02		Anatomy of TGI, Text Generation Inference (II) https://medium.com/@martiniglesiasgo/anatomy-of-tgi-text-generation-inference-ii-6aace06c5efb
18:00		Anatomy of TGI for LLM Inference (I) https://medium.com/@martiniglesiasgo/anatomy-of-tgi-for-llm-inference-i-6ac8895d903d
17:55		Together Inference Engine 2.0 with new Turbo and Lite endpoints https://www.together.ai/blog/together-inference-engine-2

1 90 of 100

Was this helpful?

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v2024072803

Support LLM Explorer