This past week has been about OpenAI vs Google and who is the likely leader in the world of AI. The OpenAI announcements have also been accompanied by a lot of drama with key people in the team leaving along with some news about how OpenAI treats exists within the company. Not a good look for OpenAI – but we have a very good lesson in PR from Sam Altman as well. That + some interesting updates from Meta and Based Hardware – our favourite open source hardware dev – Nik Shevchenko and a new Humaniod Robot! Let’s dive in:
OpenAI’s new GPT-4o Update
OpenAI announced their GPT-4o model, a new multimodal AI model based on the GPT4 series. The long and short of it is that this version of chatGPT can now see and hear things natively. The live demos were very interesting and this opens up the use cases for building directly atop foundational models as compared to having wrappers and pipes to fit real-world use cases. We’ve done a detailed breakdown here.
Google I/O – and a host of updates from Google on AI
Google I/O had so many angles to it that it’s hard to describe.
- The AI meme lives on
- Google is now the real life ‘Hooli’ from ‘Silicon Valley’
- Almost everything Google showcased was an announcement of a demo
- Google copying SpeedRun? (See tweet below)
Gaming is the tip of the spear for tech
— Troy Kirwin (@tkexpress11) May 14, 2024
We’ve been copied! @SamiraBehrouzan @speedrun @marcrebillet https://t.co/iCYEMSOEOU pic.twitter.com/VjwdsdqdRd
AI was clearly the central theme of I/O this year. Techcrunch did a good job of summarizing everything.
In case you missed today's #GoogleIO keynote presentation, we summed it up for you pic.twitter.com/TdMDTSmc88
— TechCrunch (@TechCrunch) May 14, 2024
Some key highlights:
- Gemini gets a big upgrade. Gemini 1.5 pro now has a 2million token context window. The largest context window of any commercially available chatbot in the world. Wow, what a flex 💪.
- Gemini 1.5 Flash – their lighter and fastest model was announced.
- Imagen3 and Veo – Google’s Text-to-Image and Text-to-Video tools were announced. (Unofficially we’ve heard that Imagen3 is actually very good. But we haven’t been able to test it yet).
- Google search will get step-by-step reasoning so that Google Search can do the Google Searching for you.
- Project Astra is the future of Multimodal AI. This was Google’s response to GPT-4o. The demo was amazing, but Google has a bad history with demos.
- Gemini will soon be integrated into all Google products. Which means Gemini will be able to do returns for you and a host of other personal tasks since it’ll have access to your e-mails and data.
- Oh and a lot of mumbo jumbo about AI safety and privacy. 🤷 – We’re way past believing any of that. As long as the service is great, we’ve made our peace with our data being traded for convenience.
You can read a list of 100 things that were announced at Google I/O over at the Google blog.
Drama at OpenAI
Some key senior people at OpenAI resigned immediately after their spring announcement. The list includes OpenAI co-founder and Chief Data Scientist – Ilya Sutskever, Jan Leike – leader of the superalignment group, There’s a lot of speculation around why. Ranging from wild allegations that OpenAI has been sitting on AGI all the way to some reasonably believable arguments that OpenAI was selling out and the purists didn’t like that. Our bet is with the latter. Yann LeCun is in agreement.
Lot of absurd takes like this on the superalignment team leaving OpenAI.
— Daniel Jeffries (@Dan_Jeffries1) May 18, 2024
The more likely reason they left is not because Ilya and Jan saw some super advanced AI emerging that they couldn't handle but that they didn't and as the cognitive dissonance hit, OpenAI and other… pic.twitter.com/TH36beJnTX
Of course, there was news that OpenAI treats its NDA very seriously and that some Dr. Evil level agreements were forcing shut the mouths of those who left recently from speaking against OpenAI. Of course, Sam, being the master he is, taught us a good lesson in PR. See his response below. We’re impressed but we also know that Sam is an extremely smart hustler. So who knows! 🤷
in regards to recent stuff about how openai handles equity:
— Sam Altman (@sama) May 18, 2024
we have never clawed back anyone's vested equity, nor will we do that if people do not sign a separation agreement (or don't agree to a non-disparagement agreement). vested equity is vested equity, full stop.
there was…
Opensource hardware is progressing in leaps and bounds
You can now turn any glasses into AI smart glasses for just $20
— Nik Shevchenko (@kodjima33) May 12, 2024
At today's hackathon we built Open Glass AI
It will then record your life and remember people names, count calories, live translate, and much more
Ant it's fully open source. Link 👇 pic.twitter.com/oOCLauayH4
We are already big fans of Nik Shevchenko – Founder of Based Hardware. They just won a Meta AI hackathon for ‘Open Glass AI’ – open source smart glasses. Open Glass is very similar to Meta’s Ray Ban Smart Glasses. It’s not as trendy, yet, but has great promise. For $20, it’s really value for money! Based hardware released ‘Friend’ – an open source AI wearable that outclasses the likes of Humane’s AI pin a couple of months ago. And for $35 – one can’t argue with the value proposition.
Meta’s new Camerabuds – camera integrated, AI enabled earbuds
If you think that’s a little bit strange, you wouldn’t be the only one. Much like Rabit and Human and some of the others in the space, Big Tech is now trying to find new ways and means to integrate AI (and themselves) in to our lives. There isn’t much news about it other than that they are working on it. We’re looking forward to seeing how these turn out, especially since the Meta Ray Ban Smartglasses are really cool.
ElevenLabs announced Native-Audio – have your web pages automatically be read out by AI
Introducing Audio Native. Our embeddable audio player that automatically narrates your blog or news site.
— ElevenLabs (@elevenlabsio) May 17, 2024
Head to https://t.co/9r1jfAPFfQ to customize your player. Check out https://t.co/MpwphX2SXO for CMS platform integration guides.
Your readers (and soon to be listeners)… pic.twitter.com/y6VLJSWGs2
Natively enabling your web pages to be auto-read by AI is really cool. Wonderful for accessibility. Especially convenient for those who like to consume their content while doing other things. What we like most are the setup instructions. Login, whitelist your domain, choose a voice, customize your player and just copy the embed code. No fidgeting with code and trying to get it to work. This is how AI is supposed to work. You can find out more here.
There’s a new humanoid robot on the block: The G1
Unitree revealed a new humanoid robot, the G1, that looks kickass. The robot can do a host of things, include being a ninja. At USD 16k, this robot is a big step forward in accessible robotics and should usher in many new use cases. Check out the video below.
Clearly the G1 is packed with some cutting edge tech – and in a 35kg package, the use case as a personal robot assistant at home becomes more and more tangible.
That’s it for the week. Please do share this article with others who might need a catch-me-up on the AI News for the week. See you in the next one.