The State of Open-Source Language Models

Cindy Khantushig

11 months ago

The State of Open-Source Language Models

Microsoft recently announced the release of its new Orca open-source language model. This blog discusses the release of Orca and the implications of open-source projects for the generative AI space overall.

The Open-Source Landscape

Recently there has been much uproar and debate amongst the machine learning community about the release of open-source models, such as Meta’s LLaMa model, which catalyzed a surge of open-source development akin to last year’s release of Stable Diffusion for image generation. These open-source models are usually smaller and more lightweight than proprietary behemoths like OpenAI’s GPT-4.

Open-source models have given rise to a widespread community of contributors that are developing novel applications for language models. In particular, the open-source community has been swift to pursue device optimizations for ML models, creating highly-efficient LLMs, that, while operating with reduced sophistication, can run on cheap devices. The growth of open-source has led to some concern among industry leaders that open-source models could be used for malicious purposes because they lack the same degree of moderation.

Enter Microsoft Orca

Among the concerns about misuse, prominent AI industry leaders have also sounded the alarm about the risks of potential competition from open-source models. Google was famously quoted as saying ‘we have no moat’ when comparing the progress of open-source to its own proprietary language models. Still, other researchers have suggested there is little reason to worry. One paper called “The False Promise of Imitating Proprietary LLMs” suggested that open-source will have difficulty catching up, and that smaller open-source models are essentially cheap imitations.

The release of Microsoft Orca, however, seems to disrupt some of these counterarguments against open-source. This new open-source language model developed by Microsoft excels on several significant benchmarks. More importantly, there is evidence that Orca can mimic the reasoning capabilities of more advanced models like GPT-4, potentially even learning the same underlying understanding. While it’s too early to tell what open-source developers will do with Orca, it’s clear that Microsoft sees value in the rising tide of open-source research in machine learning.

Bottom Line

The open-source machine learning movement is growing as independent contributors pursue the development of new, optimized approaches to developing and deploying language models. Microsoft’s Orca release suggests that industry leaders see the value of investing in these developments.

See Adam LIVE in our upcoming webinar!

Prompt Engineering for the Enterprise: Accelerating Workflows with Generative AI

Join us for a webinar where Aragon’s analyst, Adam Pease, will explore the process of prompt engineering for large language models within an enterprise context. This session will not only introduce you to the fundamental principles of prompt engineering but also showcase how forward-thinking businesses are already leveraging generative AI to revolutionize their operational efficiency.

On Thursday, July 13th, Adam’s talk will cover:

How businesses can deploy generative AI in a cost-effective way
The critical importance of prompt engineering for successfully leveraging generative AI
Actionable best practices for prompt engineering in different workplace use cases

The State of Open-Source Language Models

The State of Open-Source Language Models

The Open-Source Landscape

Enter Microsoft Orca

Bottom Line

See Adam LIVE in our upcoming webinar!

Prompt Engineering for the Enterprise: Accelerating Workflows with Generative AI

Prompt Engineering for the Enterprise: Accelerating Workflows with Generative AI

This blog on is part of the Content AI blog series by Aragon Research’s Analyst, Adam Pease.

Missed the previous installments? Catch up here:

Blog 1: RunwayML Foreshadows the Future of Content Creation

Blog 2: NVIDIA Enters the Text-to-Image Fray

Blog 3: Will OpenAI’s New Chatbot Challenge Legacy Search Engines?

Blog 4: Adobe Stock Accepts Generative Content and Meets Backlash

Blog 5: OpenAI Makes a Move for 3D Generative Content with Point-E

Blog 6: ChatGPT and the Problem of Detecting AI-Generated Content

Blog 7: Content AI: Voice AI Takes a Step Forward

Blog 8: AI in the Courtroom: Are Robot Lawyers the Future of Law?

Blog 9: GitHub Copilot and the Legality of Generative Content

Blog 10: Google Steps into the Chat AI Ring with Bard, Anthropic Investment

Blog 11: Exploring Google Bard’s Botched Demo

Blog 12: Meta AI Is Working at the Intersection of Robotics and Generative AI

Blog 13: Meta’s New AI Model Leaks

Blog 14: Students in China Use ChatGPT from Behind the Firewall

Blog 15: OpenAI’s ChatGPT API Will Transform Application Experiences

Blog 16: Microsoft Announces Copilot X, GPT-4 Integration

Blog 17: BloombergGPT Brings Generative AI to Finance

Blog 18: Stability AI Releases Its First Large Language Model: StableLM

Blog 19: OpenAI to Patent ‘GPT’

Blog 20: Pinecone and the Power of Vector Databases for AI

Blog 21: Alphabet Plans New Generative AI Announcements for Google I/O

Blog 22: Europe Moves to Regulate Generative AI

Blog 23: OpenAI Introduces Code Interpreter Plugin for ChatGPT

Blog 24: Generative AI and the Labor Market: Is It Causing Job Loss?

Blog 25: OpenAI Announces Function Calling for Its GPT-4 API