Last weeks Google IO conference was a showcase of all their new AI tech coming this year. I have extracted the best bits for you.
Google Astra
This is a project from Google Deepmind designed to showcase AI agents in everyday life. It responds to voice and video input and can remember details from the scene. For example, you could video you living room and ask it ‘where can I find my keys’.
Its understanding of the world and relationship of objects in the word is quite amazing.
Veo
We have all seen the videos produced by OpenAI’s SORA, well this is Google’s equivalent, and the results are outstanding. This will create so many opportunities for videographers, filmmakers, and more importantly content creators.
Imagen-3
This is now released and available to use. It is Google’s text-image generator, the results are far superior than DALLE-3. Sample images below.
Gemini Gems
Gemini Gems is the alternative to OpenAI's custom GPT'S. It allows users to Create your own personalised model for your specific use case
Gemini Flash
Gemini Flash is a new efficient, quick and cost effective alternative model to Gemini 1.5 pro available as an API.
Multi-Step Reasoning
This will bring new multi-step reasoning capabilities to Google Search. It breaks your bigger question down into parts and figures out which problems to solve and in what order, so research that might've taken you minutes or even hours can be done in seconds.
There is so much development going on in the AI space at the moment it is hard to keep up and understand what some of these things mean for you, and more importantly how you can use them to build an income.
I am going to be investigating some of these topics over the next couple of weeks because everything is changing so fast at the moment, what was useful or relevant a year ago no longer is.
Exciting and scary times all rolled in to one ..