OpenAI Drops Advanced Voice AI

Perplexity to Acquire TikTok?!

In partnership with

AI PlanetX

Welcome to another edition of AI PlanetX.

OpenAI pushes Voice AI forward; Perplexity eyes TikTok deal, algorithm rollout; Neo Gamma robots begin large-scale testing.

Inside This Edition: 💎

  • Hottest AI News

  • Top AI & SaaS Tools

  • Top AI & Tech News

  • Interesting Uses of AI

  • AI Video Tutorial

  • F-R-E-E AI Course of the Day: Learn to Program Alexa

Hottest AI News

OpenAI

OpenAI Unveils Next Generation of Voice AI Tools

OpenAI has launched three voice AI models—gpt-4o-transcribe, gpt-4o-mini-transcribe, and gpt-4o-mini-tts—enabling easy integration of speech features into apps. These models are accessible via the API and OpenAI FM (for F-R-E-E).

Details:

  • These models build on GPT-4o base from May 2024, with extra training for transcription and text-to-speech tasks. Transcription models have 2.46% word error rate in English and support 100+ languages, outperforming OpenAI's Whisper

  • Voice customization allows users to adjust accents, pitch, tone, and emotion via text prompts. OpenAI demonstrated how a voice could shift from a "mad scientist" to a "zen yoga teacher" with simple instructions

  • Integration is easy, requiring only nine lines of code using OpenAI's Agents SDK, making it ideal for customer service, meeting transcription, and AI assistants

These models show OpenAI's progress in voice AI, addressing past controversies like the Scarlett Johansson voice issue by offering customizable voices instead of mimicking individuals.

Learn AI in 5 minutes a day

This is the easiest way for a busy person wanting to learn AI in as little time as possible:

  1. Sign up for The Rundown AI newsletter

  2. They send you 5-minute email updates on the latest AI news and how to use it

  3. You learn how to become 2x more productive by leveraging AI

Perplexity AI

Perplexity Targets TikTok Acquisition, Plans Algorithm Release

Perplexity has shocked the tech industry by announcing plans to acquire TikTok and open-source its algorithm, amid uncertainty over TikTok's future in the U.S. ahead of a potential ban on April 5.

Details:

  • Perplexity plans to rebuild TikTok's algorithm in U.S. data centers with American oversight, making it transparent and open-source, using Nvidia Dynamo tech for AI upgrades

  • The acquisition would combine Perplexity's search features with TikTok's video library, adding citations and improved personalization across both platforms

  • Competition is strong, with Oracle, Microsoft, and investor groups interested. ByteDance has been hesitant to sell, and TikTok’s U.S. operations are valued at $30-50 billion, far more than Perplexity’s $18 billion valuation

Perplexity, known for attention-grabbing moves like mocking Google’s AI, sponsoring an F1 team, and claiming Jimmy O. Yang as CSO, raises questions about whether its TikTok acquisition is a serious bid or another publicity stunt.

1X

Neo Gamma Robot Testing Coming to Thousands of Homes

1X plans to test its humanoid robot, Neo Gamma, in hundreds to thousands of homes by 2025, marking a key step toward AI-powered humanoid assistants, though challenges remain.

Details:

  • CEO Bernt Børnich announced Neo Gamma will be tested in homes this year, with teleoperators remotely controlling it, as it's not fully autonomous. Customers can control permission to the robot's sensors to address privacy concerns

  • Neo Gamma features improved AI and a nylon bodysuit for safer human-robot interaction. During an Nvidia GTC demo, it performed basic tasks but faced issues like Wi-Fi problems and low battery

  • Competition is rising in the humanoid robot market, with Figure planning similar tests and seeking $1.5 billion in funding, while OpenAI is rumored to be developing its own robots

While a few will experience Neo Gamma via 1X's waitlist, consumer-ready humanoid robots remain years away. Early data will refine AI and shape domestic robotics.

Top AI & SaaS Tools

  • Bika (Life-time Deal): Automate with Business AI Agents—a no-code platform combining a billion-row database, forms, wikis, and automation for marketing, sales, and projects [Price increases in 24 hours]

  • FLORA: Creative canvas that integrates features like text-to-image, text-to-video, and Google Gemini 2.0's advanced image editing, etc (Watch demo) [F-R-E-E]

  • Hunyuan-T1: Ultra-large-scale reasoning model with complex instruction following, fast speed, and excellent long-text processing, surpasses DeepSeek R1 [F-R-E-E]

  • SkyReels: First text-to-film AI agent that creates film scripts, storyboards, characters, videos, voiceovers, lip sync, and can edit films autonomously [F-R-E-E]

  • ElevenReader by ElevenLabs: Use this app to read text aloud, listen to audiobooks, and read PDFs, eBooks, and Kindle with top-quality voice AI [F-R-E-E]

There’s a reason 400,000 professionals read this daily.

Join The AI Report, trusted by 400,000+ professionals at Google, Microsoft, and OpenAI. Get daily insights, tools, and strategies to master practical AI skills that drive results.

Top AI & Tech News

  • Anthropic has added web search to Claude, enabling real-time information availability and narrowing the feature gap with competitors like ChatGPT and Gemini

  • METR researchers found that AI task length has doubled every 7 months since 2019, indicating a "Moore's Law" for AI capabilities

  • OpenAI launched its o1-pro model via API, charging developers $150 and $600 per million input and output tokens, 10 times the cost of the standard o1

  • LG introduced South Korea's first reasoning model, EXAONE Deep-32B, which excels in complex math, programming challenges, and tasks in Korean and English

  • CloudFlare launched AI Labyrinth, a tool that redirects unauthorized AI crawlers to irrelevant content, wasting their resources while protecting data and identifying threats

  • 20 top-trending open source startups around the world, more than half of which are closely aligned with AI

Interesting Uses of AI

AI Art Spotlight

Model: Midjourney

Prompt:

A black and white photograph of an Oriental Shorthair cat cuddling with another Oriental Shorthair, both eyes closed in peaceful slumber, with soft lighting highlighting their features against the stark background. The composition focuses on capturing moments between these two cats, creating a sense of warmth within their bond, in the style of Black and White photography. --ar 101:128 --v 6.1

Top AI Video Tutorial

Google’s NotebookLM Launched Some INSANE Features 🤯 (NEW USE CASES)

Complimentary AI Course of the Day

Learn to Program Alexa

Learn to create custom Alexa skills with hands-on projects. Explore interaction models and Amazon Lambda functions to design personalized voice responses. This course equips you with the tools to develop, publish, and enhance your skills for voice user interfaces (VUIs).