Close Menu
    What's Hot

    Small language models

    April 16, 2024

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    April 1, 2024

    The developer’s guide to open source LLMs and generative AI

    March 19, 2024
    Facebook X (Twitter) Instagram
    Facebook Instagram LinkedIn
    AI VentunoAI Ventuno
    • Home
    • AI Giants
      1. Meta (Facebook)
      2. Google
      3. Amazon
      4. View All

      Llama 2: Open Foundation and Fine-Tuned Chat Models

      April 1, 2024

      Introducing Gemini: Google’s AI Gets a Fresh Identity!

      February 10, 2024

      Google’s Bard chatbot gets the Gemini Pro update globally

      February 2, 2024

      Google’s Lumiere brings AI video closer to real than unreal.

      January 28, 2024

      Google Introduces Gemini, a Cutting-Edge Language Model Set

      January 10, 2024
      8.9

      DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

      January 15, 2021
      8.9

      Bose QuietComfort Earbuds II: Noise-Cancellation Kings Reviewed

      January 15, 2021

      Thousands Of PC Games Discounted In New Black Friday Sale

      January 15, 2021

      Take Your Photography to The Next Level with This Drone

      January 14, 2021

      Will Using a VPN on Phone Helps Protect You from Ransomware?

      January 14, 2021

      Popular New Xbox Game Pass Game Being Review Bombed With “0s”

      January 14, 2021

      Google Says Surveillance Vendor Targeted Samsung Phones

      January 14, 2021

      Why Are iPhones More Expensive Than Android Phones?

      January 14, 2021
    • Papers
    • Tools
      • Prompts
    • About us
    AI VentunoAI Ventuno
    Home » The developer’s guide to open source LLMs and generative AI
    AI

    The developer’s guide to open source LLMs and generative AI

    ai_adminBy ai_adminMarch 19, 2024No Comments5 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    open source LLM
    Open source LLM
    Share
    Facebook Twitter LinkedIn Pinterest Email

    We all know that AI is changing the world. But what happens when you combine AI with the power of open source?

    Over the past year, there has been an explosion of open source generative AI projects on GitHub with more than 8,000. They range from commercially backed large language models (LLMs) like Meta’s LLaMA to experimental open source LLM applications.

    These projects offer many benefits to open source developers and the machine learning community and are a great way to start building new AI-powered features and applications.

    Open source vs. closed source LLMs

    We are familiar with LLMs: neural network-based language models trained on vast quantities of data to mimic human behavior by performing various downstream tasks, like question answering, translation, and summarization. LLMs have disrupted the world with the introduction of tools like ChatGPT and GitHub Copilot.

    Open source LLMs differ from their closed counterparts regarding the source code (and sometimes other components, as well). With closed LLMs, the source code—which explains how the model is structured and how the training algorithms work—isn’t published.

    “When you’re doing research, you want access to the source code so you can fine-tune some of the pieces of the algorithm itself,” says Alireza Goudarzi, a senior researcher of machine learning at GitHub. “With closed models, it’s harder to do that.”

    Open source LLMs help the industry at large: because so many people contribute, they can be developed faster than closed models. They can also be more effective for edge cases or specific applications , can contain bespoke security controls, and can run on local models.

    But closed models—often built by larger companies—have advantages, too. For one, they’re embedded in systems with filters for biased information, inappropriate language, and other questionable content. They also frequently have security measures baked in. Plus, they don’t need fine-tuning, a specialized skill set requiring dedicated people and teams.

    “Closed, off-the-shelf LLMs are high quality,” notes Eddie Aftandilian, a principal researcher at GitHub. “They’re often far more accessible to the average developer.”

    How to fine-tune open source LLMs

    Fine-tuning open source models is done on the large cloud provider hosted by the LLM, such as AWS, Google Cloud, or Microsoft Azure. Fine-tuning allows you to optimize the model by creating more advanced language interactions in applications like virtual assistants and chatbots. This can improve model accuracy anywhere from five to 10 percent.

    As for best practices? is recommended being careful about data sampling and being clear about the specific needs of the application you’re trying to build. The curated data should match your needs exactly since the models are pre-trained on anything you can find online.

    You need to emphasize certain things related to your objectives, let’s say you’re trying to create a model to process TV and smart home commands. You’d want to preselect your data to have more of a command form.

    Choosing your model

    Which open source model is best for you? Experts recommend focusing on models’ performance benchmarks against different scenarios, such as reasoning, domain-specific understanding of law or science, and linguistic comprehension.

    However, don’t assume that the benchmark results are correct or meaningful.

    Rather, ask yourself, how good is this model at a particular task? It’s pretty easy to let benchmarks seep into the training set due to lack of deep understanding, skewed performance, or limited generalization.

    When this happens, the model is trained on its own evaluation data. Which would make it look better than it should.

    You should also consider how much the model costs to run and its overall latency rates. A large model, for instance, might be exceptionally powerful. But if it takes minutes to generate responses versus seconds, there may be better options. 

    Open source LLMs available today

    There are several open source commercially licensed models available.

    OpenLLaMAAn open source reproduction of Meta’s LLaMA model, developed by Berkeley AI Research, this project provides permissively licensed models with 3B, 7B, and 13B parameters, and is trained on one trillion tokens. OpenLLaMA models have been evaluated on tasks using the lm-evaluation-harness and perform comparably to the original LLaMA and GPT-J across most tasks. But because of the tokenizer’s configuration, the models aren’t great for code generation tasks with empty spaces.
    Falcon-SeriesDeveloped by Abu Dhabi’s Technology Innovation Institute (TII), Falcon-Series consists of two models: Falcon-40B and Falcon-7B. The series has a unique training data pipeline that extracts content with deduplication and filtering from web data. The models also use multi-query attention, which improves the scalability of inference. Falcon can generate human-like text, translate languages, and answer questions.
    MPT-SeriesA set of decoder-only large language models, MPT-Series models have been trained on one trillion tokens spanning code, natural language text, and scientific text. Developed by MosaicML, these models come in two specific versions: MPT-Instruct, designed to be task-oriented, and MPT-Chat, which provides a conversational experience. It’s most suitable for virtual assistants, chatbots, and other interactive user engagement tools.
    FastChat-T5A large transformer model with three billion parameters, FastChat-T5 is a chatbot model developed by the FastChat team through fine-tuning the Flan-T5-XL model. Trained on 70,000 user-shared conversations, it generates responses to user inputs autoregressively and is primarily for commercial applications. It’s a strong fit for applications that need language understanding, like virtual assistants, customer support systems, and interactive platforms. 

    The future of open source LLMs

    There’s been a scurry of activity in the open source LLM world.

    Developers are very active on some of these open source models, they can optimize performance, explore new use cases, and push for new algorithms and more efficient data.

    And that’s just the start.

    Meta’s LLaMA model is now available for commercial use, allowing businesses to create their own AI solutions.

    Originally, the machine learning community thought that more advanced generative AI would require more advanced algorithms. But that hasn’t been the case.

    The simple algorithm actually stays the same, regardless of how much it can do. Scaling is the only change, which is completely mind-blowing.

    Who knows how open source LLMs will revolutionize the developer landscape.

    We are excited that we’re seeing so many open source LLMs now. When developers start building with these models, the possibilities are endless.

    LLM Open Source
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    LLM

    Small language models

    April 16, 2024
    Facebook

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    April 1, 2024
    Featured

    Google Introduces Gemini, a Cutting-Edge Language Model Set

    January 10, 2024
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

    January 23, 202468 Views

    Single-View 3D Human Digitalization with Large Reconstruction Models

    January 23, 202446 Views

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    April 1, 202436 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews
    85
    Featured

    Pico 4 Review: Should You Actually Buy One Instead Of Quest 2?

    ai_adminJanuary 15, 2021
    8.1
    Uncategorized

    A Review of the Venus Optics Argus 18mm f/0.95 MFT APO Lens

    ai_adminJanuary 15, 2021
    8.9
    Editor's Picks

    DJI Avata Review: Immersive FPV Flying For Drone Enthusiasts

    ai_adminJanuary 15, 2021
    Our Picks

    Small language models

    April 16, 2024

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    April 1, 2024

    The developer’s guide to open source LLMs and generative AI

    March 19, 2024
    Most Popular

    Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

    January 23, 202468 Views

    Single-View 3D Human Digitalization with Large Reconstruction Models

    January 23, 202446 Views

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    April 1, 202436 Views
    Latest Papers

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    April 1, 2024

    Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs

    January 23, 2024

    Single-View 3D Human Digitalization with Large Reconstruction Models

    January 23, 2024
    AI Ventuno
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Technology
    • Language Models
    • Tools
    • About us
    © 2025 AI Ventuno. Designed by Ventuno Studio.

    Type above and press Enter to search. Press Esc to cancel.