1. YouTube Summaries
  2. AI Weekly Roundup: OpenAI, Google, Meta, and More

AI Weekly Roundup: OpenAI, Google, Meta, and More

By scribe 5 minute read

Create articles from any YouTube video or use our API to get YouTube transcriptions

Start for free
or, create a free article to see how easy it is.

OpenAI Updates

OpenAI had several noteworthy announcements this week:

ChatGPT Search Feature

OpenAI has finally rolled out their ChatGPT search feature to all ChatGPT Pro users after months of testing. This new capability allows users to search the web directly within ChatGPT conversations.

To use the feature:

  1. Look for the new search icon in the ChatGPT interface
  2. Ask a question and select the search option
  3. ChatGPT will then provide search results based on current web information

While similar to existing tools like Perplexity, the ChatGPT search currently provides less detailed results. However, this is likely to improve over time as OpenAI refines the feature.

Chat History Search

ChatGPT now includes a search function for past conversations. Users can easily find previous chats by searching for keywords or topics.

Voice Mode on Desktop

The advanced voice feature, previously available only on mobile, has been added to the ChatGPT desktop app for both Windows and Mac. Users can now have voice conversations with ChatGPT on their computers.

To access this feature:

  1. Ensure you have the latest version of the ChatGPT app
  2. Look for the "Use voice mode" option
  3. Click the button to activate voice interactions

Leadership AMA on Reddit

Several key figures from OpenAI, including CEO Sam Altman, participated in an "Ask Me Anything" (AMA) session on Reddit. Some notable points from the AMA:

  • A new text-to-image model update is in the works, though no release date was given
  • No plans for "GPT-5" in 2024, but other significant releases are expected
  • OpenAI believes Artificial General Intelligence (AGI) is achievable with current hardware
  • AI agents that can perform tasks autonomously are expected to be a major focus in 2025

Anthropic Developments

Anthropoic, another leading AI company, had several updates:

Voice Dictation for Claude

The Anthropic mobile app now supports voice dictation, allowing users to speak their queries to Claude. However, Claude still responds in text rather than voice.

Desktop Apps

Anthropoc has released desktop applications for both Windows and Mac, providing a native experience similar to the web interface.

Claude Integration with GitHub Copilot

GitHub Copilot, owned by Microsoft, now offers integration with Claude alongside other AI models like Google's Gemini. This is particularly interesting given Microsoft's investment in OpenAI, showcasing a willingness to incorporate competing AI technologies.

Google AI News

Google continues to expand its AI offerings:

Global Expansion of AI Overviews in Search

Google's AI-generated overviews in search results, previously limited to the US, have now been expanded to over 100 countries.

Gemini API Search Capability

Developers using the Gemini API can now incorporate web search functionality into their applications, similar to the AI overviews in Google Search.

AI-Generated Code at Google

Google CEO Sundar Pichai revealed that more than 25% of new code at Google is generated by AI and then reviewed by human engineers, significantly accelerating development processes.

Meta AI Initiatives

Meta (formerly Facebook) has been active in the AI space:

Open-Source Podcast Generator

Meta has released an open-source version of a podcast generator, similar to Google's Notebook LM. This tool can create conversations between two AI hosts based on uploaded content.

Reuters Partnership

Meta has struck a deal with Reuters, marking its first partnership with a news content platform. This move is particularly significant given reports that Meta is developing its own AI-powered search engine.

AI-Powered Search Engine

While not officially confirmed, reports suggest Meta is working on an AI-powered search engine, potentially competing with similar offerings from OpenAI, Google, and Perplexity.

X.AI (formerly Twitter) Updates

X.AI has added image understanding capabilities to its Grok AI model. Users can now upload images and ask Grok to describe or analyze them.

Apple's AI Integration

Apple is rolling out AI features across its ecosystem:

iOS 18.1 and 18.2

New iOS updates include AI-powered features like improved writing refinement, notification summarization, and enhanced Siri capabilities.

New Hardware with M4 Chips

Apple has released new iMac, MacBook Pro, and Mac Mini models featuring M4 chips designed for "Apple intelligence" - the company's term for its AI capabilities.

App Store Review Summaries

Apple plans to introduce AI-generated summaries of user reviews in the App Store, similar to features seen on platforms like Amazon.

Emerging AI Tools and Research

Recraft V3 Image Generator

A mysterious AI image generator called "Red Panda" appeared on leaderboards, outperforming established models. It was later revealed to be Recraft V3, capable of generating images with long text prompts.

Eleven Labs X2 Voice

Eleven Labs introduced X2 Voice, a tool that creates AI-generated voice clips based on a user's X (formerly Twitter) profile and tweets.

Suno's Persona Feature

Suno, an AI music generation platform, now offers a "Personas" feature that allows users to save and apply specific vocal styles and vibes across multiple compositions.

Didi's High-Quality Avatar

Didi launched a new high-quality avatar feature capable of real-time conversations, potentially allowing users to create interactive digital versions of themselves.

Wonder Animation

Wonder Dynamics showcased Wonder Animation, a tool that can transform live-action footage into animated scenes, potentially revolutionizing animation production.

AI-Generated Minecraft

Decart and Etched collaborated to create a real-time, AI-generated Minecraft-like experience where each frame is newly generated as the player moves.

Coinbase's Based Agent

Coinbase introduced Based Agent, a tool for creating AI agents capable of handling various crypto-related tasks.

Meta's Embodied AI Research

Meta shared progress in embodied AI, particularly in touch perception for robots, enabling more sensitive and precise object manipulation.

Boston Dynamics' Autonomous Robot

Boston Dynamics demonstrated a fully autonomous robot capable of sorting and organizing items without human intervention.

Physical Intelligence's PiZ

Physical Intelligence unveiled PiZ, a robot foundation model that enables robots to learn tasks by observing humans or other robots, with applications in household chores and industrial tasks.

Conclusion

The AI landscape continues to evolve rapidly, with major tech companies and startups pushing the boundaries of what's possible. From improvements in natural language processing and image generation to advancements in robotics and embodied AI, the field is progressing at an unprecedented pace. As we approach the end of the year, it will be interesting to see if the typical holiday slowdown occurs or if the momentum in AI development continues unabated.

For those looking to stay informed about the latest AI developments, resources like Future Tools offer daily updates on AI news and emerging technologies. Additionally, many of these new AI tools and features are available for public use or testing, providing hands-on experience with cutting-edge AI capabilities.

As AI becomes increasingly integrated into our daily lives and various industries, it's crucial to stay informed about these advancements and their potential impacts on society, work, and technology as a whole.

Article created from: https://youtu.be/2V5qINlecBA?feature=shared

Ready to automate your
LinkedIn, Twitter and blog posts with AI?

Start for free