What open-source LLMs or SLMs are you in search of? 38199 in total.

LLM List Based on «GemmoeForCausalLM» LLM Architecture Was this list helpful?

Searching for new models built with the GemmoeForCausalLM architecture? Our directory features a diverse range of small and large language models (SLMs and LLMs) specifically designed using it.
Discover the latest in language model technology, with models ranging in size from 3b to 70b, all utilizing HuggingFace transformers with the ready-to-use GemmoeForCausalLM class. Compare them based on processing power, advanced features, and their unique capabilities tailored for various computational tasks.
Which of these models excel in specific areas and achieve the highest benchmarks? Our comprehensive directory answers these questions, presenting the newest models built with the GemmoeForCausalLM architecture in a clear and concise manner.
For the latest innovations in language modeling, particularly those leveraging the GemmoeForCausalLM architecture, our list is an essential resource. Explore our table below, which showcases both SLMs and LLMs, and find the perfect model that fits your specific needs and tasks. Unlock the potential of 'GemmoeForCausalLM' LLM architecture!
Model Size
Model VRAM
LLM List Based on «GemmoeForCausalLM» LLM Architecture
Loading a list of LLMs...
Here comes the list of the Small and Large Language Models
Model Name Maintainer Size Score VRAM (GB) Quantized License Context Len Likes Downloads Modified Languages Architectures
— Large Language Model
— Adapter
— Code-Generating Model
— Listed on LMSys Arena Bot ELO Rating
— Original Model
— Merged Model
— Instruction-Based Model
— Quantized Model
— Finetuned Model
— Mixture-Of-Experts

LLM Explorer "Score" is the dynamically calculated score depending on the various parameters. Read more...

Table Headers Explained  
  • Name — The title and maintainer account associated with the model.
  • Params — The number of parameters used in the model.
  • Score — The model's score depending on the selected rating (default is the LLM Explorer Score).
  • Likes — The number of "likes" given to the model by users.
  • VRAM — The rough estimate of the GB required for inference.
  • Downloads — The total number of downloads for the model.
  • Quantized — Specifies whether the model is quantized.
  • CodeGen — Specifies whether the model can recognize or infer source code.
  • License — The type of license associated with the model.
  • Languages — The list of languages supported by the model (where specified).
  • Maintainer — The author or maintainer of the model.
  • Architectures — The transformer architecture used in the model.
  • Context Len — The content length supported by the model.
  • Tags — The list of tags specified by the model's maintainer.

Choose another global filter

  All Large Language Models   LMSYS ChatBot Arena ELO   OpenLLM LeaderBoard v1   OpenLLM LeaderBoard v2   Original & Foundation LLMs   OpenCompass LeaderBoard   Recently Added Models   Code Generating Models   Instruction-Based LLMs   Uncensored LLMs   LLMs Fit in 4GB RAM   LLMs Fit in 8GB RAM   LLMs Fit in 12GB RAM   LLMs Fit in 24GB RAM   LLMs Fit in 32GB RAM   GGUF Quantized Models   GPTQ Quantized Models   EXL2 Quantized Models   Fine-Tuned Models   LLMs for Commercial Use   TheBloke's Models   Context Size >16K Tokens   Mixture-Of-Experts Models   Apple's MLX LLMs   Small Language Models
Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241110