Artificial Intelligence Timeline

2022 - Present

2022
February
Midjourney v1
April
Midjourney v2
Dall-e 2 – announced for gradual release
July
Midjourney v3
August
stable diffusion 1.4
October
Stable Diffusion 1.5
November
midjourney v4
stable diffusion 2.0
Chat GPT 3.5, a large language model of the OPENAI company is released to the public and becomes a hysterical hit in a short time
December
stable diffusion 2.1
2023
February
The Meta company releases the LLaMA language model in open source in a limited way for research purposes. Shortly afterwards the model was leaked
Microsoft is gradually releasing Bing AI to the waiting list - an AI chat based on an upgraded GPT model that integrates internet search. The company later announced that it is based on GPT 4.
March
Midjourney v5
OPENAI's GPT 4 model is partially released, the model presents a multimodal image analysis capability and a significant improvement in support for multiple languages
Google releases the artificial intelligence chat Bard in a limited way, the chat is based on the LaMDA language model
April 2023
Adobe releases the Firefly image creation model as a beta version to a waiting list. The model allowed a variety of capabilities including text formatting
May
Midjourney v5.1
Google announces an upgrade to bard that will move to be based on the upgraded PaLM 2 language model. In addition, it will support 180 countries and many languages
June
Midjourney v5.2
July
Stable Diffusion XL 1.0
Anthropic announces a new version of their large language model - Claude 2
The Meta company releases the LLaMA 2 open source language model to the general public in a variety of sizes
October
Dall-e 3
Adobe releases Firefly 2
November
Stable Diffusion XL Turbo - A fast model that allows the creation of an image in one step in real time
December
Midjourney v6
Google is upgrading Bard in limited areas, which is moving to be based on the upgraded Gemini Pro language model.
X Corporation launches Grok AI chatbot for paid subscribers in English language
2024
February
Stability AI announces Stable Diffusion 3 (gradually released to waiting list)
Google is upgrading the artificial intelligence chat in Bard which will be based on the new Gemini Pro model, in all available languages. Google replaces "Bard" with "Gemini"
Google announces the Gemini Pro 1.5 multimodal language model capable of parsing up to a million tokens, as well as parsing video and images, the model is gradually released to developers on a waiting list
OPENAI announces the Sora model that produces videos up to a minute long, the model is not released to the public at this time
March
X Corporation announces the upcoming release of the Grok 1.5 open source model
Anthropic announces the Claude 3, A new version of her large language model. The version was deployed in 3 different sizes, with the largest model performing better than GPT 4
The company Suno ai, which develops a model for creating music, releases suno v3 to the general public
April
The company Stability AI releases a new update to the music creation model - Stable Audio 2.0
Corporation X releases an upgrade to its language model. Grok-1.5V, which integrates high level image recognition. In the test presented by the company - the model is the best in identifying and analyzing images compared to other models
The Mistral company releases its new model Mixtral 8x22B as open source. This is the most powerful model among the open source models and it contains 141 billion parameters but uses a method that allows more economical use
Meta releases the LLaMA 3 model as open source in sizes 8B and 70B parameters. The large model shows better performance than Claude 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later release larger models with 400 billion parameters and more
Microsoft releases the Phi-3-mini model in open source. The model comes in a reduced version of 3.8B parameters, which allows it to run on mobile devices as well, and it presents capabilities similar to gpt 3.5
Adobe announces its new image creation model Firefly 3
The startup Reka AI presented a series of multimodal language models in 3 sizes. The models are capable of processing video, audio and images. The large model featured similar capabilities to the GPT4
Apple releases as full open source a series of small language models under the name OpenELM. The models are available in four weights between 270 million and 3 billion parameters
May 2024
OPENAI announces a gpt4o model that presents full multimodal capabilities, including receiving text, images, audio and video, and creating audio, text and images, with all the features built into the model itself. The model shows twice the performance and speed of the gpt4-turbo model.