OpenAIโ€™s Sora breaks the internet
Click me to go to the top
๐Ÿ‘ท๐Ÿป Weโ€™re experiencing technical issues. Some site functionality is temporarily down.
side
outstream

The best AI tools weโ€™ve tested sent to your inbox weekly.

โ€Subscribe to my weekly AI newsletter and get my top list of GPTs for free.

By clicking Sign Up you're confirming that you agree with our Terms and Conditions. Unsubscribe at any time.
Thank you! The World is Yours ๐Ÿค
Oops! Something went wrong while submitting the form.

What's brewing in AI #28

OpenAIโ€™s Sora breaks the internet

This is some text inside of a div block.
(
This is some text inside of a div block.
ratings)
user ratings)
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
_

OpenAI announces Sora, Google releases Gemini 1.5, Eleven Labs announces AI generated sound effects, and more AI news this week you don't want to miss.

Creator of whatplugin.ai & the What's Brewing in AI newsletter
Feb 20, 2024

Welcome to the 158 new subscribers who joined in the last week.

Sit back, grab a coconut (or coffee), and check out some pretty mind-blowing advancements in AI.

Whatโ€™s brewing in AI this week:

  • OpenAIโ€™s Sora breaks the internet
  • Gemini 1.5 dwarfs ChatGPTโ€™s context window
  • GPTs: Editorโ€™s choice and new arrivals in the GPT store
  • The other top stories in AI this week

โ€

Darioโ€™s Picks:

I. OpenAI introduces Sora

OpenAI introduced Sora last Thursday โ€“ a next generation text-to-video model which, by the look of it, vastly outperforms anything weโ€™ve seen so far.

An AI generated video says more than a thousand words:

โ€

โ€

Sora has currently only been made available to early testers without a set release date to the public.

โ€

Why it matters:

Sora understands and simulates the physical world in motion with unprecedented realism โ€“ all from some simple text instructions. If it delivers what it says it does weโ€™re officially at the point that, without a lot of scrutiny, you can no longer tell if a video has been AI generated or not.

Our perception of reality and โ€œwhatโ€™s realโ€ is going to be challenged. The opportunities for productive use and solving real-life problems is massive, as is the potential for abuse. The dynamics and costs in media production, with video now being as easy as typing on a keyboard, will change too.

Paraphrasing the philosopher Marshall McLuhan: technology is a means to extend our senses and โ€“ as soon as weโ€™ve adopted it โ€“ societal norms, values, practices, and structures are bound to change. Iโ€™d say weโ€™ve just taken a groundbreaking leap as far as extending and enlarging the visual human imagination goes. But donโ€™t take it from me. Take it from Will Smith eating spaghetti.

โ€

II. Google launches Gemini Pro 1.5

Google has announced a new model (again): Gemini 1.5.

โ€

โ€

The most notable thing is the massive context window of 10m tokens (GPT-4 has 128k). To get a glimpse of the opportunities it brings, I recommend Googleโ€™s own demos:

Source: Google

Key features:

  • Itโ€™s Googleโ€™s first Gemini 1.5 model, and the first model available for testing is Gemini Pro 1.5.
  • Itโ€™s mid-sized, but performs in benchmark tests at a similar level to the larger Ultra 1.0.
  • Right now, the model has a 128k context window for most users โ€“ same as GPT-4 Turbo. However, a limited group of developers have gotten access to the model with a massive context window of 10M tokens. This feature seems like it will be rolled out eventually, but first needs to be optimised in terms of latency and computational requirements to enhance the user experience (and, Iโ€™m presuming, control Googleโ€™s costs). ย 
  • Itโ€™s based on a new Mixture-of-Experts (MoE) architecture.
  • Itโ€™s multimodal โ€“ understands images, video and audio. In other words, itโ€™s not just big PDF files youโ€™ll be able to feed the model, but also lengthy audio transcripts, image databases, entire movies and more.

Why it matters: The context window jump is huge compared to all other existing models. According to Google, the performance persists at a very high level even as the context window increases. Judging by Googleโ€™s demo videos, we could soon be able to feed AI all our data and get high-quality responses back.

โ€

III. AI sound effects coming soon Eleven Lab

Eleven Labs, a leader in text-to-speech, is getting a notable feature soon: AI generated sound effects. Thatโ€™s right, you will soon (no dates give yet) be able to simply describe a sound with a prompt and generate the audio for it. For their demo, the company cleverly chose to overlay AI generated sounds on some of OpenAIโ€™s Sora clips.

Why it matters: Not as revolutionary compared to the other news above, but nevertheless a cool new feature that seems inevitable and pairs well with the advances in AI video. Our AI generated worlds will need sound, wonโ€™t they?

โ€

GPTs

Editorโ€™s Choice

Weekly picks from me to you

Enterprise AI Use Case Advisor

๐Ÿ’ฌ100+

This GPT will help you develop AI use cases for your company.

โ€

โ€

โž This GPT asks the right questions, and suggests tailored solution on how to best leverage AI in your organisation. Itโ€™s made by one of the biggest influencers in the AI sphere, Allie K. Miller.

โ€

A couple of useful SEO tools

โ€

Search Intent Optimization Tool

๐Ÿ’ฌ900+ ย โ†‘100

Enhances content relevance based on Search Quality Evaluator Guidelines, utilizing methodologies from recent academic research.

โž Reviews the relevance of your siteโ€™s content based on Googleโ€™s official guidelines. Tested it on a page on my site and found a good optimisation opportunity.

โ€

Blog Post Title Generator

๐Ÿ’ฌ600+ ย โ†‘100

Generates 25 SEO-focused blog titles.

โž Simple tool to create optimised titles for articles (or get ideas for new articles). Enter your target keyword and it gives you title suggestions categorised into lists, stories, opinions, questions and frameworks.

โ€

New arrivals in the GPT store

New GPTs featured in OpenAIโ€™s official GPT store in the last 7 days

  1. Diagrams โšกPRO BUILDERโšก ย Rank 7 in ย  Programming
  2. Website Generator ย Rank 9 in ย  Programming
  3. AI Humanizer Pro ย Rank 8 in ย  Writing
  4. Physics Oracle ย Rank 7 in ย Education
  5. Math Solver ย Rank 12 in ย Education
  6. math ย Rank 7 in ย Lifestyle

โ€

Bytes

  • Groq (not to be confused by Elonโ€™s Grok) is a new AI model with really fast response time. It uses LPU (language processing units) instead of GPU, unlocking faster speeds. Hereโ€™s a demo by Matt Shumer. BTW, the naming similarity to Elon Muskโ€™s chatbot is incredible. As if this space isnโ€™t already confusing enough when it comes to names ๐Ÿ˜„ Groq was actually first though, founded in 2016.
  • Fresh of the rumour mill: OpenAI is developing a Web Search product to challenge Google.
  • Reddit signs AI content licensing deal ahead of its IPO.
  • Deutsche Telekom showcased an AI phone at MWC 2024. The phone is launched in collaboration with Qualcomm and Brain.ai, and uses AI assistants to replace the apps.
  • ChatGPTโ€™s web traffic is down 11% since itโ€™s peak in May last year.
  • Anthropic is testing Prompt Shield to avoid misinformation in the upcoming elections. It redirects questions on politics and voting to โ€œauthoritativeโ€ sources of voting information.

โ€

GPTs mentioned

No Results Found

Sorry mate, we couldn't find any such plugins.

Plugins mentioned in this article

No items found.
๐Ÿคท๐Ÿปโ€โ™‚๏ธ
No Results Found

Sorry mate, we couldn't find any such plugins.

โ•Please exercise caution when using any plugins. While available on the ChatGPT plugin store, they have not been verified by the author of this site. Use is at your own risk and potential security risks may be associated. For more information, refer to this article.

GPTs by category

Most Popular GPTs
50
GPTs
Trending GPTs
30
GPTs
OpenAIโ€™s Sora breaks the internet