- Superintelligence
- Posts
- Meet OpenAI's 01 Pro
Meet OpenAI's 01 Pro
Hey friends! Welcome to the development of the AI world. Today's top AI news highlights OpenAI’s 01 Pro version, Microsoft unveils copilot vision, and Google’s PaliGemma 2 Model. Additionally, meet Avalue's advanced AI edge platform. Let’s dive in—enjoy this AI ride in just 3 minutes!
The AI World Today
OpenAI Advances with o1
Microsoft Unveils Copilot Vision
Google Unveils PaliGemma 2 Model
+
Heads Up
AI Solution
OpenAI's Big Moves: New Milestones and AGI Predictions

Screenshot: Andrew Sorkin/X
At the NYT Dealbook Summit, Sam Altman shared remarkable stats: ChatGPT now has 300M weekly active users, 1B daily messages, and 1.3M US developers. Altman predicts glimpses of AGI by 2025, with initial subtle impacts but intense long-term shifts. Altman acknowledged "some tension" with Microsoft but reaffirmed aligned priorities. On Elon Musk, he expressed sadness over legal disputes, calling Musk a "mega hero" of his youth, and doubted Musk would harm competitors with political influence. As part of 12 Days of OpenAI, the company also launched the ChatGPT Pro tier ($200/month), offering exclusive access to o1—its powerful new reasoning model with superior coding, math, and image analysis capabilities. Pro users also gain Advanced Voice mode and unlimited GPT-4o access. Vision and file upload features are next in line.
Microsoft Revolutionizing Real-Time Web Assistance

Image: Microsoft
Microsoft has introduced Copilot Vision, an AI feature embedded in the Edge browser, designed to monitor real-time online behavior and provide context-based assistance. Acting as a "second pair of eyes," it synchronizes with web browsing to analyze and simplify content, offering personalized suggestions and guidance. The tool can extract key details or hidden information from web pages, simplifying complex content for users. For example, when browsing product pages, it suggests suitable items based on user needs. Copilot Vision delivers real-time analysis and interaction, enhancing productivity and decision-making. This innovative feature transforms web browsing by combining content understanding and AI-driven insights, ensuring tailored assistance across various tasks and scenarios.
Revolutionizing Visual Language Understanding

Image: Google
Google has open-sourced PaliGemma 2, a cutting-edge visual language model offering sizes of 3B, 10B, and 28B parameters and resolutions of 224px, 448px, and 896px to address diverse tasks. It excels at generating detailed image captions that narrate actions, emotions, and scene contexts, making it ideal for applications like image search and generation. The model showcases superior performance in recognizing chemical formulas, musical notation, spatial reasoning, and chest X-ray report generation, expanding its utility across fields. As an evolution of the Gemma family, PaliGemma 2 boosts visual understanding, providing enhanced efficiency and scalability. Its contextual narrative capabilities promise breakthroughs in AI's interpretation of images, bridging the gap between vision and language tasks.
Heads Up
Elon Musk's xAI raises $6 billion from 97 investors, including major firms and Qatar Investment Authority, with minimum contributions of $77,593, per SEC filing.
Perplexity expands its publisher program, partnering with LA Times, Adweek, and others. Publishers share ad revenue and access performance metrics, boosting collaboration in AI-powered search.
President-elect Trump appoints David Sacks as "White House AI & Crypto Czar," placing a prominent Silicon Valley figure and podcaster in charge of emerging-tech policy.
Clone Robotics unveils Clone Alpha, a humanoid robot with synthetic organs and water-powered muscles, with 279 units available for preorder starting in 2025.
Google launches Gemini 1.5 updates on Android, featuring AI photo descriptions in Lookout, Spotify integration for Gemini Assistant, and enhanced phone controls and communication tools.
AI Solution
Avalue's Advanced AI Edge Platform Unveiled
Avalue Technology introduces a robust AI Edge platform powered by NVIDIA Jetson, featuring models like AIB-NINX-S and AIB-NIAO-S. Designed for machine vision applications, it supports object detection, facial recognition, and image segmentation, ideal for AI edge computing, defect inspection, medical imaging, and traffic monitoring. With performance levels ranging from 20 to 275 TOPS, these devices cater to varying AI workloads, ensuring scalability. The included AI SDK accelerates proof-of-concept development for retail, manufacturing, and transportation. Compatible with diverse modules and capture cards, the platform simplifies integration and customization. Avalue also offers Linux/BSP customization and certification support, speeding deployment and market readiness. This solution empowers businesses with efficient, adaptable tools for cutting-edge AI applications.