ReadAloudForMe
AI Agent - Dashboard - AI Tools Catalog
Download the App: iOs - Android - Windows - iOs PRO
Recommend AI Tools For Me
Read Aloud For Me
AI ChatBots
AI Translators
AI Speech-to-Text
AI Text-to-Speech
AI Video Creation
AI Image Creation
AI Voice, Music, Podcast
AI Write For Me
AI Coding
AI Tutor
AI Shopping - Product Reviews
AI Daily News
AI Unraveled: Master Generative AI, LLMs, GPT, Gemini & Prompt Engineering - Simplified Guide for Everyday Users: Demystifying Artificial Intelligence, OpenAI, ChatGPT, Bard, AI Quiz, AI Certs Prep
Google Workspace AI
Turn your dream into reality with Google Workspace: It’s free for the first 14 days.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes:
96DRHDRA9J7GTN6
63F733CLLY7R7MM
63F7D7CPD9XXUVT
63FLKQHWV3AEEE6
63JGLWWK36CP7WM
63KKR9EULQRR7VE
63KNY4N7VHCUA9R
63LDXXFYU6VXDG9
63MGNRCKXURAYWC
63NGNDVVXJP4N99
63P4G3ELRPADKQU
With Google Workspace, Get custom email @yourcompany, Work from anywhere; Easily scale up or down
Google gives you the tools you need to run your business like a pro. Set up custom email, share files securely online, video chat from any device, and more.
Google Workspace provides a platform, a common ground, for all our internal teams and operations to collaboratively support our primary business goal, which is to deliver quality information to our readers quickly.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE
C37HCAQRVR7JTFK
C3AE76E7WATCTL9
C3C3RGUF9VW6LXE
C3D9LD4L736CALC
C3EQXV674DQ6PXP
C3G9M3JEHXM3XC7
C3GGR3H4TRHUD7L
C3LVUVC3LHKUEQK
C3PVGM4CHHPMWLE
C3QHQ763LWGTW4C
Even if you’re small, you want people to see you as a professional business. If you’re still growing, you need the building blocks to get you where you want to be. I’ve learned so much about business through Google Workspace—I can’t imagine working without it.
(Email us for more codes)
This Week in AI - Major AI Developments in a Nutshell
A Daily chronicle of AI Innovations April 22nd 2024:
🍎 iOS 18 to have AI features with on-device processing - Read more ...
🧠 Many-shot ICL is a breakthrough in improving LLM performance - Read more ...
⚡ Groq shatters AI inference speed record with 800 tokens/second on LLaMA 3 - Read more ...
🤖 Why Zuckerberg wants to give away a $10B AI model - Read more ...
🤐 Sundar Pichai tells Google staff he doesn’t want any more political debates in the office - Read more ...
🤖 Israel-based startup enters AI humanoid race with Menteebot - Read more ...
🩺 Hugging Face introduces benchmark for evaluating gen AI in healthcare - Read more ...
🔄 Google announces major restructuring to accelerate AI development - Read more ...
🎧 Nothing's new earbuds offer ChatGPT integration - Read more ...
🚪 Japanese researchers develop AI tool to predict employee turnover- Read more ...
A Daily chronicle of AI Innovations April 20th 2024 and Week 3 Recap:
🤖 OpenAI fires back at Elon Musk - Read more ...
🧠 Google DeepMind researchers call for limits on AI that mimics humans - Read more ...
💰 Bitcoin just completed its fourth-ever 'halving' - Read more ...
🚫 Twitter alternative Post News is shutting down - Read more ...
📊 xAI’s first multimodal model with a unique dataset - Read more ...
♾️ Infini-Attention: Google's breakthrough gives LLMs limitless context - Read more ...
⚠️ Adobe's Firefly AI trained on competitor's images: Bloomberg report - Read more ...
🎬 Adobe partners with OpenAI, RunwayML & Pika for Premiere Pro - Read more ...
🚀 Reka launches Reka Core: Their frontier in multimodal AI - Read more ...
🏢 OpenAI is opening its first international office in Tokyo - Read more ...
🎮 NVIDIA RTX A400 A1000: Lower-cost single slot GPUs - Read more ...
🎵 Amazon Music launches Maestro, an AI-based playlist generator - Read more ...
💼 Stanford’s report reflects industry dominance and rising training costs in AI - Read more ...
👤 Microsoft VASA-1 generates lifelike talking faces with audio - Read more ...
🤖 Boston Dynamics charges up for the future by electrifying Atlas - Read more ...
🧠 Intel reveals world's largest brain-inspired computer - Read more ...
🦙 Meta released two Llama 3 models; 400B+ models in training - Read more ...
📈 Mixtral 8x22B claims highest open-source performance and efficiency - Read more ...
🦈 Meta’s Megalodon to solve the fundamental challenges of the Transformer - Read more ...
A Daily chronicle of AI Innovations April 19th 2024:
⚔️ Meta declares war on OpenAI - Read more ...
🦙 Meta’s Llama 3 models are here; 400B+ models in training! - Read more ...
🤖 Google consolidates teams with aim to create AI products faster - Read more ...
🚫 Apple pulls WhatsApp, Threads and Signal from app store in China - Read more ...
🦠 Moderna CEO says AI will help scientists understand ‘most diseases’ in 3 to 5 years - Read more ...
📈 Mixtral 8x22B claims highest open-source performance and efficiency - Read more ...
🦈 Meta’s Megalodon to solve the fundamental challenges of the Transformer - Read more ...
🔍Meta adds its AI chatbot, powered by Llama 3, to the search bar in all its apps. - Read more ...
🚗Wayve introduces LINGO-2, a groundbreaking AI model that drives and narrates its journey. - Read more ...
🤖Salesforce updates Slack AI with smart recaps and more languages. - Read more ...
✈️US Air Force tests AI-controlled jets against human pilots in simulated dogfights. - Read more ...
🔋Google Maps will use AI to find out-of-the-way EV chargers for you.
A Daily chronicle of AI Innovations April 18th 2024:
🧠 Samsung unveils lightning-fast DRAM for AI-powered devices - Read more ...
🤖 Logitech’s new AI prompt builder & Signature AI edition mouse - Read more ...
📸 Snapchat to add watermark to images produced with its AI tools - Read more ...
✈️ US Air Force confirms first successful AI dogfight - Read more ...
🏆 Mistral's latest model sets new records for open source LLMs - Read more ...
🎭 Microsoft's new AI model creates hyper-realistic video using static image - Read more ...
👁️ GPT-4 nearly matches expert doctors in eye assessments - Read more ...
🔒 Brave unleashes real-time privacy-focused AI answer engine - Read more ...
📸 Snapchat to add watermark to images produced with its AI tools - Read more ...
A Daily chronicle of AI Innovations April 17th 2024:
🎮 NVIDIA RTX A400 A1000: Lower-cost single slot GPUs; - Read more ...
📊 Stanford’s report reflects industry dominance and rising training costs in AI; - Read more ...
🎵 Amazon Music launches Maestro, an AI playlist generator; - Read more ...
📷 Snap adds watermarks to AI-generated images; - Read more ...
🤖 Boston Dynamics unveils a new humanoid robot; - Read more ...
💰 Andreessen Horowitz raises $7.2 billion, a sign that tech startup market may be bouncing back; - Read more ...
💰 OpenAI offers a 50% discount for off-peak GPT usage; - Read more ...
💻 AMD unveils AI chips for business laptops and desktops; - Read more ...
🧠 Anthropic Claude 3 Opus is now available on Amazon Bedrock; - Read more ...
👤 Zendesk launches an AI-powered customer experience platform; - Read more ...
💼 Intel and The Linux Foundation launch Open Platform for Enterprise AI (OPEA) - Read more ...
Google will pump more than $100B into AI says DeepMind boss
- DeepMind CEO predicts Google will invest over $100 billion in AI, surpassing rivals like Microsoft in processing prowess.
- Google's investment in AI may involve hardware like Axion CPUs based on the Arm architecture, claimed to be faster and more efficient than competitors.
- Some of the budget will likely go to DeepMind, known for its work on the software side of AI, despite recent mixed results in material discoveries and weather prediction.
- DeepMind has made progress in teaching AI social skills, a crucial step in advancing AI capabilities.
- Hassabis emphasized the need for significant computing power, a reason for teaming up with Google in 2014.
A Daily chronicle of AI Innovations April 16th 2024:
🎬 Adobe partners with OpenAI, RunwayML & Pika for Premiere Pro; - Read more ...
🚀 Reka launches Reka Core: their frontier in multimodal AI; - Read more ...
🇯🇵 OpenAI is opening its first international office in Tokyo; - Read more ...
🤖 Hugging Face has rolled out Idefics2 ; - Read more ...
💬 Quora's Poe aims to become the 'App Store' for AI chatbots; - Read more ...
👥 Instagram is testing an AI program to amplify influencer engagement; - Read more ...
👩💻 Microsoft has released and open-sourced the new WizardLM-2 family of LLMs; - Read more ...
📋 Limitless AI launched a personal meeting assistant in a pendant - Read more ...
A Daily chronicle of AI Innovations April 15th 2024:
🚗 Tesla lays off more than 10% of its workforce - Read more ...
🎥 Adobe explores OpenAI partnership as it adds AI video tools - Read more ...
📱 Apple's AI features on iOS 18 may run locally on your iPhone - Read more ...
📊 xAI’s first multimodal model with a unique dataset - Read more ...
♾️ Infini-Attention: Google's breakthrough gives LLMs limitless context - Read more ...
⚠️ Adobe's Firefly AI trained on competitor's images: Bloomberg report - Read more ...
🤖 Meta trials AI chatbot on WhatsApp, Instagram, and Messenger - Read more ...
🎨 Ideogram introduces new features to its AI image generation model - Read more ...
🖼️ New Freepik AI tool redefines image generation with realism and versatility - Read more ...
💼 OpenAI promoted ChatGPT Enterprise to corporates with road-show-like events - Read more ...
📔 Google's Notes tool now offers custom AI-generated backgrounds - Read more ...
A Daily chronicle of AI Innovations April 11th 2024:
🚀 Meta unveils next-generation AI chip for enhanced workloads - Read more ...
🎶 New AI tool lets you generate 1200 songs per month for free - Read more ...
💰 Adobe is buying videos for $3 per minute to build an AI model - Read more ...
🤖 Google expands Gemma family with new models - Read more ...
🌐 Mistral unveils Mixtral-8x22B open language model - Read more ...
📷 Google Photos introduces free AI-powered editing tools - Read more ...
🖼️ Microsoft enhances Bing visual search with personalization - Read more ...
🛡️ Sama red team: Safety-centered solution for Generative AI - Read more ...
A Daily chronicle of AI Innovations April 10th 2024:
👀 OpenAI gives GPT-4 a major upgrade; - Read more ...
💬Quora's Poe now lets AI chatbot developers charge per message; - Read more ...
🌐 Google updates and expands its open source Gemma AI model family; - Read more ...
🔥 Intel unveils latest AI chip as Nvidia competition heats up; - Read more ...
📱 WordPress parent acquires Beeper app which brought iMessage to Android; - Read more ...
🤔 New bill would force AI companies to reveal use of copyrighted art; - Read more ...
🧠 Intel's new AI chip: 50% faster, cheaper than NVIDIA's; - Read more ...
🤖 Meta to Release Llama 3 Open-source LLM next week; - Read more ...
☁️ Google Cloud announces major updates to enhance Vertex AI - Read more ...
A Daily chronicle of AI Innovations April 09th 2024:
🤖 Meta to launch new Llama 3 models - Read more ...
👂 Google’s Gemini 1.5 Pro can now hear - Read more ...
💥 Google’s first Arm-based CPU will challenge Microsoft and Amazon in the AI race - Read more ...
🤖 Stability AI launches multilingual Stable LM 2 12B - Read more ...
📱Ferret-UI beats GPT-4V in mobile UI tasks - Read more ...
⏰ Musk says AI will outsmart humans within a year - Read more ...
🍁 Canada bets big on AI with $2.4B investment - Read more ...
🎥 OpenAI is using YouTube for GPT-4 training - Read more ...
A Daily chronicle of AI Innovations April 08th 2024:
🇬🇧 Microsoft opens AI Hub in London to 'advance state-of-the-art language models'
💡 JPMorgan CEO compares AI’s potential impact to electricity and the steam engine
🎵 Spotify moves into AI with new feature
⚖️ Build resource-efficient LLMs with Google’s MoD
📡 Newton brings sensor-driven intelligence to AI models
💰 Internet archives become AI training goldmines for Big Tech
🎧 Spotify introduces AI-generated personalized playlists
🔍 Meta expands "Made with AI" labeling to more content types
🚀 Gretel's Text-to-SQL dataset sets new standard for AI training data
💾 Microsoft upgrades Azure AI Search with more storage and support for OpenAI apps
📱 Google brings Gemini AI chatbot to Android app
A daily chronicle of AI Innovations April 01 to April 07th 2024:
🎤 OpenAI’s AI model can clone your voice in 15 seconds
👀 Sam Altman and Jony Ive seek $1B for personal AI device
🚕 Elon Musk says Tesla will unveil robotaxi in August
🔖 Meta to label content ‘made with AI’
🙃 How OpenAI, Google and Meta ignored corporate policies to train their AI
🚀 Microsoft and OpenAI plan $100B supercomputer for AI development
🖼️ MagicLens: Google DeepMind's breakthrough in image retrieval
📲 Apple's Siri will now understand what’s on your screen
🤖 OpenAI introduces instant access to ChatGPT
🚨 Elon Musk says AI might destroy humanity, but it's worth the risk
🔍 Google's Gecko: LLM-powered text embedding breakthrough
🔓 Anthropic’s “many-shot jailbreaking” wears down AI ethics
🌌 CosmicMan enables the photorealistic generation of human images
🎵 What’s new in Stability AI’s Stable Audio 2.0?
👨💻 SWE-agent: AI coder that solves GitHub issues in 93 seconds
🎥 Mobile-first Higgsfield aims to disrupt video marketing with AI
🏢 Cohere launches Command R+ for enterprises
🧰 OpenAI doubles down on AI model customization
🏠 Will personal home robots be Apple’s next big thing?
A daily chronicle of AI Innovations April 05th 2024:🤷♀️YouTube CEO warns OpenAI that training models on its videos is against the rules 🏢OpenAI says 2024 is the "year of the enterprise" when it comes to AI ⚔️The war for AI talent has begun 🏢Cohere launches the “most powerful LLM for enterprise 🧰OpenAI doubles down on AI model customization; 🏠 Will personal home robots be Apple’s next big thing?
A daily chronicle of AI Innovations April 04th 2024: 🎵 What’s new in Stability AI’s Stable Audio 2.0? 🖥️ Opera One browser becomes the first to offer local AI integration 🚀 Copilot gets GPT-4 Turbo upgrade
🤖 SWE-agent: AI coder that solves GitHub issues in 93 seconds
📲 Mobile-first Higgsfield aims to disrupt video marketing with AI
A daily chronicle of AI Innovations April 03rd 2024 🔍Google's Gecko: LLM-powered text embedding breakthrough 🔓 Anthropic’s “many-shot jailbreaking” wears down AI ethics 🌌CosmicMan enables the photorealistic generation of human images 🎮 Microsoft is planning to add an AI chatbot to Xbox
- Google's Gecko: LLM-powered text embedding breakthrough
Gecko is a compact and highly versatile text embedding model that achieves impressive performance by leveraging the knowledge of LLMs. DeepMind researchers behind Gecko have developed a novel two-step distillation process to create a high-quality dataset called FRet using LLMs. The first step involves using an LLM to generate diverse, synthetic queries and tasks from a large web corpus. In the second step, the LLM mines positive and hard negative passages for each query, ensuring the dataset's quality.
- Anthropic’s “many-shot jailbreaking” wears down AI ethics
Researchers at Anthropic discovered a new way to get advanced AI language models to bypass their safety restrictions and provide unethical or dangerous information. They call this the "many-shot jailbreaking" technique. By including many made-up dialog examples in the input where an AI assistant provides harmful responses, the researchers could eventually get the real AI to override its training and provide instructions on things like bomb-making.
- CosmicMan enables the photorealistic generation of human images
Researchers at the Shanghai AI Laboratory have created a new AI model called CosmicMan that specializes in generating realistic images of people. CosmicMan can produce high-quality, photorealistic human images that precisely match detailed text descriptions, unlike current AI image models that struggle with human images.
The key to CosmicMan's success is a massive dataset called CosmicMan-HQ 1.0 containing 6 million annotated human images and a novel training method—“ Annotate Anyone,” which focuses the model on different parts of the human body. By categorizing words in the text description into body part groups like head, arms, legs, etc., the model can generate each part separately for better accuracy and customizability, thereby outperforming the current state-of-the-art models.
- OpenAI-Superhuman introduces a new era of email with OpenAI.
"Many of us write several novels worth of email per year. Between the time we spend, how much we read, and how much we write, email is the perfect place for AI to add massive value to peoples’ lives".
Superhuman is making rapid progress on Superhuman AI, which is powered by OpenAI’s API. They’ve already launched a large number of GPT-driven features like:
Write with AI, which turns a written prompt into a full email
Rewrite in Your Voice, which rephrases an email in the user’s personal writing voice and tone
Write with Your Voice, which turns a dictated statement into a full email, allowing users to compose emails with just a few lines of speech
Auto Summarize, which always shows an up-to-date one-line summary for each email and thread in users’ inboxes
Instant Reply, which lets users respond to email in one click by choosing from contextual reply options
- Apple Vision Pro's Spatial Avatars are a game changer
Get the Meta Quest 3 at half the price for similar functionalities
- 🎮 Microsoft is planning to add an AI chatbot to Xbox
Microsoft is currently testing a new AI-powered chatbot to be added to Xbox to automate customer support tasks. The software giant has tested an “embodied AI character” that animates when responding to Xbox support queries. The virtual representative can handle either text or voice requests. It’s an effort to integrate AI into Xbox platforms and services.
- ☁️ CloudFare launches Workers AI to power one-click deployment with Hugging Face
CloudFare has launched Workers AI, which empowers developers to bring their AI applications from Hugging Face to its platform in one click. The serverless GPU-powered interface is generally available to the public. The Cloudflare-Hugging Face integration was announced nearly seven months ago. It makes it easy for models to be deployed onto Workers AI.
- 🍺 Machine Learning can predict and enhance complex beer flavor
In a study by Nature Communications, researchers combined chemical analyses, sensory data, and machine learning to create models that accurately predict beer flavor and consumer appreciation from the beer's chemical composition. They identified compounds that enhance flavor and used this knowledge to improve the taste and popularity of commercial beers.
- 📖 Read AI adds AI summaries to meetings, emails, and messages
Read AI is expanding its services from summarizing video meetings to including messages and emails. The platform connects to popular communication platforms like Gmail, Outlook, Slack, Zoom, Microsoft Teams, and Google Meet to deliver daily updates, summaries, and AI-generated takeaways. The goal is to help users save time and improve productivity.
- 🤖 Bille Elish, Kety Perry, Nicki Minaj and 200 other musicians warn against replacing human singers with AI
In an open letter, over 200 famous musicians, including Billie Eilish and Katy Perry, have expressed their concerns about the negative impact of AI on human creativity. They call for the responsible use of AI and urge AI companies to stop creating music that undermines their work. They believe that unregulated and uncontrolled use of AI can harm songwriters, musicians, and creators. They emphasize the need to protect artists' rights and fair compensation.
A daily chronicle of AI Innovations April 02 2024: 📲Apple's Siri will now understand what’s on your screen 🤖OpenAI introduces instant access to ChatGPT 🚨Elon Musk says AI might destroy humanity, but it's worth the risk 🤖Sam Altman gives up control of OpenAI Startup Fund 🙏US UK to partner on AI
- 🤖Sam Altman gives up control of OpenAI Startup Fund
Sam Altman has relinquished formal control of the OpenAI Startup Fund, which he initially managed, to Ian Hathaway, marking a resolution to the fund's unique corporate structure.
The fund was established in 2021 with Altman temporarily at the helm to avoid potential conflicts had he not returned as CEO after a brief departure; he did not personally invest in or financially benefit from it.
Under Hathaway's management, the fund, starting with $175 million in commitments, has grown to $325 million in assets and has invested in early-stage AI companies across healthcare, law, education, and more, with at least 16 startups backed.
- 🙏 US and UK sign deal to partner on AI research
The US and UK have formed a partnership focused on advancing the safety testing of AI technologies, sharing information and expertise to develop tests for cutting-edge AI models.
A Memorandum of Understanding (MOU) has been signed to enhance the regulation and testing of AI, aiming to effectively assess and mitigate the risks associated with AI technology.
The partnership involves the exchange of expert personnel between the US and UK AI Safety Institutes, with plans for potential joint testing on publicly available AI models, reinforcing their commitment to addressing AI risks and promoting its safe development globally.
- 📰Yahoo acquires Instagram co-founders' AI-powered news startup Artifact
Yahoo is acquiring the AI news app Artifact, built by Instagram co-founders, but not its team, aiming to enhance its own news platform with Artifact's advanced technology and recommendation systems.
Artifact's technology, which focuses on personalizing and recommending content, will be integrated into Yahoo News and potentially other Yahoo platforms, despite the discontinuation of the Artifact app itself.
The integration of Artifact's technology into Yahoo aims to create a personalized content ecosystem, leveraging Yahoo's vast user base to realize the potential of AI in news curation and recommendation.
- 📲Apple's Siri will now understand what’s on your screen
Apple researchers have developed an AI system called ReALM which enables voice assistants like Siri to understand contextual references to on-screen elements. By converting the complex task of reference resolution into a language modeling problem, ReALM outperforms even GPT-4 in understanding ambiguous references and context.
- OpenAI introduces instant access to ChatGPT
OpenAI now allows users to use ChatGPT without having to create an account. With over 100 million weekly users across 185 countries, it can now be accessed instantly by anyone curious about its capabilities.
While this move makes AI more accessible, other OpenAI products like DALL-E 3 still require an account. The company has also introduced new content safeguards and allows users to opt out of model training, even without an account. Despite growing competition from rivals like Google's Gemini, ChatGPT remains the most visited AI chatbot site, attracting 1.6 billion visitors in February.
- Artificial intelligence is taking over drug development
The most striking evidence that artificial intelligence can provide profound scientific breakthroughs came with the unveiling of a program called AlphaFold by Google DeepMind. In 2016 researchers at the company had scored a big success with AlphaGo, an ai system which, having essentially taught itself the rules of Go, went on to beat the most highly rated human players of the game, sometimes by using tactics no one had ever foreseen. This emboldened the company to build a system that would work out a far more complex set of rules: those through which the sequence of amino acids which defines a particular protein leads to the shape that sequence folds into when that protein is actually made. AlphaFold found those rules and applied them with astonishing success.
The achievement was both remarkable and useful. Remarkable because a lot of clever humans had been trying hard to create computer models of the processes which fold chains of amino acids into proteins for decades. AlphaFold bested their best efforts almost as thoroughly as the system that inspired it trounces human Go players. Useful because the shape of a protein is of immense practical importance: it determines what the protein does and what other molecules can do to it. All the basic processes of life depend on what specific proteins do. Finding molecules that do desirable things to proteins (sometimes blocking their action, sometimes encouraging it) is the aim of the vast majority of the world’s drug development programmes.
- Pinecone launches Luna AI that never hallucinates
Trained using a novel "information-free" approach, Luna achieved zero hallucinations by always admitting when it doesn't know an answer. The catch? Its performance on other tasks is significantly reduced. While not yet open-sourced, vetted institutions can access the model's source and weights.
- US and UK collaborate to tackle AI safety risks
As concerns grow over the potential risks of next-gen AI, the two nations will work together to develop advanced testing methods and share key information on AI capabilities and risks. The partnership will address national security concerns and broader societal issues, with plans for joint testing exercises and personnel exchanges between their respective AI safety institutes.
- Perplexity to test sponsored questions in AI search
Perplexity's Chief Business Officer, Dmitry Shevelenko, announced the company's plan to introduce sponsored suggested questions later this year. When users search for more information on a topic, the platform will display sponsored queries from brands, allowing Perplexity to monetize its AI search platform.
- OpenAI expands to Japan with Tokyo office
The Tokyo office will be OpenAI's first in Asia and third international location, following London and Dublin. The move aims to offer customized AI services in Japanese to businesses and contribute to the development of an AI governance framework in the country.
A daily chronicle of AI Innovations April 01st 2024: 🎤 This AI model can clone your voice in 15 seconds; 🍎 Apple says its latest AI model is even better than OpenAI’s GPT4 🧠 Deepmind chief doesn't see AI reaching its limits anytime soon; 🚀 and a lot more from OpenAI, Google, Meta, NVDIA 💸 💔
- 🍎Apple says its latest AI model is even better than OpenAI’s GPT4
Apple researchers have introduced ReALM, an advanced AI model designed to understand and navigate various contexts more effectively than OpenAI's GPT4.
ReALM aims to enhance user interaction by accurately understanding onscreen, conversational, and background entities, making device interactions more intuitive.
Apple believes ReALM's ability to handle complex reference resolutions, including onscreen elements, positions it as a superior solution compared to the capabilities of GPT-4.
- 🚀Deepmind chief doesn't see AI reaching its limits anytime soon
Deepmind founder Demis Hassabis believes AI is both overhyped and underestimated, with the potential for AI far from being reached and warning against the excessive hype surrounding it.
Hassabis predicts many AI startups will fail due to the high computing power demands, expects industry consolidation, and sees no limit to the advancements in massive AI models.
Despite concerns over hype, Hassabis envisions the beginning of a new golden era in scientific discovery powered by AI and estimates a 50% chance of achieving artificial general intelligence within the next ten years.
- 🎤This AI model can clone your voice in 15 seconds
OpenAI has offered a glimpse into its latest breakthrough - Voice Engine, an AI model that can generate stunningly lifelike voice clones from a mere 15-second audio sample and a text input. This technology can replicate the original speaker's voice, opening up possibilities for improving educational materials, making videos more accessible to global audiences, assisting with communication for people with speech impairments, and more.
Though the model has many applications, the AI giant is cautious about its potential misuse, especially during elections. They have strict rules for partners, like no unauthorized impersonation, clear labeling of synthetic voices, and technical measures like watermarking and monitoring. OpenAI hopes this early look will start a conversation about how to address potential issues by educating the public and developing better ways to trace the origin of audio content.
- 💸 Microsoft+OpenAI plan $100B supercomputer for AI development
Microsoft and OpenAI are reportedly planning to build a massive $100 billion supercomputer called "Stargate" to rapidly advance the development of OpenAI's AI models. Insiders say the project, set to launch in 2028 and expand by 2030, would be one of the largest investments in computing history, requiring several gigawatts of power - equivalent to multiple large data centers.
Much of Stargate's cost would go towards procuring millions of specialized AI chips, with funding primarily from Microsoft. A smaller $10B precursor called "Phase 4" is planned for 2026. The decision to move forward with Stargate relies on OpenAI achieving significant improvements in AI capabilities and potential "superintelligence." If realized, Stargate could enable OpenAI's AI systems to recursively generate synthetic training data and become self-improving.
- 💔MagicLens: Google DeepMind's breakthrough in image retrieval technology
Google DeepMind has introduced MagicLens, a revolutionary set of image retrieval models that surpass previous state-of-the-art methods in multimodality-to-image, image-to-image, and text-to-image retrieval tasks. Trained on a vast dataset of 36.7 million triplets containing query images, text instructions, and target images, MagicLens achieves outstanding performance while meeting a wide range of search intents expressed through open-ended instructions.
- Which LLM Provider You Pick For Your App Could Make Or Break You Financially
I recently did a deep dive into the costs of various AI language models, and the results were quite eye-opening. I compiled my findings into a YouTube video that goes into more detail, but I wanted to share some key takeaways here to spark discussion.
As you can see in the breakdown below, the costs per million tokens vary widely across different models:
GPT 4 Turbo = $25
GPT 3.5 = $5
Gemini Pro = $2
Claude 3 (Haiku) = 75 Cents
Mistral 7B = 25 Cents
*Phi-2 = Less Than 1 Cent
What's particularly interesting is that the models below the line (excluding Phi 2) are currently priced "below retail". In other words, it would actually cost you more to set up and run similar models yourself compared to what these providers are charging.
This raises some fascinating questions about the economics and accessibility of AI language models. How will pricing evolve as the technology advances? Will we see consolidation or fragmentation in the market? What does this mean for researchers, businesses, and everyday users?
I dive into all of this and more in my YouTube video. It's a complex topic, but I've done my best to break it down in an engaging and easy-to-understand way. If you're at all interested in the current state and future trajectory of AI language models, I highly recommend checking it out and joining the conversation.
- What is Edge AI? How is IoT changing?
Let us hypothetically consider a case of autonomous self-driving cars, to understand Edge AI in a simpler format.
When a self-driving car is moving, it needs to detect objects in real-time. Any delay or glitch can prove fatal for car passengers, which is why AI must perform in real-time. Car manufacturers train their deep learning based ML models in their cloud servers. Once all the models are trained and saved in a file, it gets downloaded locally in the car itself.
- OpenAI rolling out the ability to start using ChatGPT instantly, without needing to sign-up
A daily chronicle of AI Innovations: March 31st 2024: 🧠 Generative AI develops potential new drugs for antibiotic-resistant bacteria; 🔥South Korean ‘artificial sun’ hits record 100M degrees for 100 seconds; 🤖 Summary of the key points about OpenAI's relationship with Dubai and the UAE;
- Generative AI develops potential new drugs for antibiotic-resistant bacteria
Stanford Medicine researchers devise a new artificial intelligence model, SyntheMol, which creates recipes for chemists to synthesize the drugs in the lab.
With nearly 5 million deaths linked to antibiotic resistance globally every year, new ways to combat resistant bacterial strains are urgently needed.
Researchers at Stanford Medicine and McMaster University are tackling this problem with generative artificial intelligence. A new model, dubbed SyntheMol (for synthesizing molecules), created structures and chemical recipes for six novel drugs aimed at killing resistant strains of Acinetobacter baumannii, one of the leading pathogens responsible for antibacterial resistance-related deaths.
The researchers described their model and experimental validation of these new compounds in a study published March 22 in the journal Nature Machine Intelligence.
There’s a huge public health need to develop new antibiotics quickly, said James Zou, PhD, an associate professor of biomedical data science and co-senior author on the study. “Our hypothesis was that there are a lot of potential molecules out there that could be effective drugs, but we haven’t made or tested them yet. That’s why we wanted to use AI to design entirely new molecules that have never been seen in nature.
- South Korean ‘artificial sun’ hits record 100M degrees for 100 seconds
For the first time, the Korea Institute of Fusion Energy’s (KFE) Korea Superconducting Tokamak Advanced Research (KSTAR) fusion reactor has reached temperatures seven times that of the Sun’s core.
Achieved during testing between December 2023 and February 2024, this sets a new record for the fusion reactor project.
KSTAR, the researchers behind the reactor report, managed to maintain temperatures of 212 degrees Fahrenheit (100 million degrees Celsius) for 48 seconds. For reference, the temperature of the core of our Sun is 27 million degrees Fahrenheit (15 million degrees Celsius).
- Gemini 1.5 Pro on Vertex AI is available for everyone as an experimental release
I think this one has flown under the radar: Gemini 1.5 Pro is available as Experimental on Vertex AI, for everyone, UI only for now (no API yet). In us-central1.
You find it under Vertex AI --> Multimodal. It's called Gemini Experimental.
API, more features and so on are coming as we approach Google Cloud Next (April 9-11).
- OpenAI Relationships: Summary of the key points about OpenAI's relationship with Dubai and the UAE
OpenAI's Partnership with G42
In October 2023, G42, a leading UAE-based technology holding group, announced a partnership with OpenAI to deliver advanced AI solutions to the UAE and regional markets.
The partnership will focus on leveraging OpenAI's generative AI models in domains where G42 has deep expertise, including financial services, energy, healthcare, and public services.
G42 will prioritize its substantial AI infrastructure capacity to support OpenAI's local and regional inferencing on Microsoft Azure data centers.
Sam Altman, CEO of OpenAI, stated that the collaboration with G42 aims to empower businesses and communities with effective solutions that resonate with the nuances of the region.
Altman's Vision for the UAE as an AI Sandbox
During a virtual appearance at the World Governments Summit, Altman suggested that the UAE could serve as the world's "regulatory sandbox" to test AI technologies and later spearhead global rules limiting their use.
Altman believes the UAE is well-positioned to be a leader in discussions about unified global policies to rein in future advances in AI.
The UAE has invested heavily in AI and made it a key policy consideration.
Altman's Pursuit of Trillions in Funding for AI Chip Manufacturing
Altman is reportedly in talks with investors, including the UAE, to raise $5-7 trillion for AI chip manufacturing to address the scarcity of GPUs crucial for training and running large language models.
As part of the talks, Altman is pitching a partnership between OpenAI, various investors, chip makers, and power providers to build chip foundries that would be run by existing chip makers, with OpenAI agreeing to be a significant customer.
In summary, OpenAI's partnership with G42 aims to expand AI capabilities in the UAE and the Middle East, with Altman envisioning the UAE as a potential global AI sandbox.
- Deepmind did not originally see LLMs and the transformer as a path to AGI. Fascinating article.
It's a very long article so I'll post the relevant snippets. But basically it seems that Google was late to the LLM game because Demis Hassabis was 100% focused on AGI and did not see LLM's as a path toward AGI. Perhaps now he sees it as a potential path, but it's probably possible that he is just now focusing on LLM's so that Google does not get too far behind in the generative AI race. But his ultimate goal and obsession is to create AGI that can solve real problems like diseases.
AI Daily Chronicle of AI Innovations - March 30th, 2024: 🤯 Microsoft and OpenAI to build $100 billion AI supercomputer 'Stargate'; 🗣 OpenAI unveils voice-cloning tool; 📈 Amazon's AI team faces pressure to outperform Anthropic's Claude models by mid-year; 🚫 Microsoft Copilot has been blocked on all Congress-owned devices
- 🤯 Microsoft and OpenAI to build $100 billion AI supercomputer 'Stargate'
Microsoft and OpenAI are reportedly collaborating on a significant project to create a U.S.-based datacenter for an AI supercomputer named "Stargate," estimated to cost over $115 billion and utilize millions of GPUs.
The supercomputer aims to be the largest among the datacenters planned by the two companies within the next six years, with Microsoft covering the costs and aiming for a launch by 2028.
The project, considered to be in phase 5 of development, requires innovative solutions for power, cooling, and hardware efficiency, including a possible shift away from relying on Nvidia's InfiniBand in favor of Ethernet cables.
- 🗣 OpenAI unveils voice-cloning tool
OpenAI has developed a text-to-voice generation platform named Voice Engine, capable of creating a synthetic voice from just a 15-second voice clip.
The platform is in limited access, serving entities like the Age of Learning and Livox, and is being used for applications from education to healthcare.
With concerns around ethical use, OpenAI has implemented usage policies, requiring informed consent and watermarking audio to ensure transparency and traceability.
- 📈 Amazon's AI team faces pressure to outperform Anthropic's Claude models by mid-year
Amazon has invested $4 billion in AI startup Anthropic, but is also developing a competing large-scale language model called Olympus.
Olympus is supposed to surpass Anthropic's latest Claude model by the middle of the year and has "hundreds of billions of parameters."
So far, Amazon has had no success with its own language models. Employees are unhappy with Olympus' development time and are considering switching to Anthropic's models.
- 🚫 Microsoft Copilot has been blocked on all Congress-owned devices
The US House of Representatives has banned its staff from using Microsoft's AI chatbot Copilot due to cybersecurity concerns over potential data leaks.
Microsoft plans to remove Copilot from all House devices and is developing a government-specific version aimed at meeting federal security standards.
The ban specifically targets the commercial version of Copilot, with the House open to reassessing a government-approved version upon its release.
- Official NYC chatbot is encouraging small businesses to break the law.
- ChatGPT's responses now include source references but for paid users
- Next-generation AI semiconductor devices mimic the human brain
- Google's AI chief says the billions going into AI means a 'bunch of hype and maybe some grifting'
“I think we’re only scratching the surface of what I believe is going to be possible over the next decade-plus,” he said. “We’re at the beginning, maybe, of a new golden era of scientific discovery, a new Renaissance.”
The best proof of concept for how AI could accelerate scientific research, he said, was DeepMind’s AlphaFold model, released in 2021.
AlphaFold had helped predict the structures of 200mn proteins and was now being used by more than 1mn biologists around the world. DeepMind is also using AI to explore other areas of biology and accelerate research into drug discovery and delivery, material science, mathematics, weather prediction and nuclear fusion technology. Hassabis said his goal had always been to use AI as the “ultimate tool for science”.
DeepMind was founded in London in 2010 with the mission to achieve “artificial general intelligence” that matches all human cognitive capabilities. Some researchers have suggested that AGI may still be decades away, if attainable at all.
Hassabis said that one or two more critical breakthroughs were needed before AGI was reached. But he added: “I wouldn’t be surprised if it happened in the next decade. I’m not saying it’s definitely going to happen but I wouldn’t be surprised. You could say about a 50 per cent chance. And that timeline hasn’t changed much since the start of DeepMind.”
Given the potential power of AGI, Hassabis said it was better to pursue this mission through the scientific method rather than the hacker approach favoured by Silicon Valley. “I think we should take a more scientific approach to building AGI because of its significance,” he said.
- Voicecraft: I've never been more impressed in my entire life !
The maintainers of Voicecraft published the weights of the model earlier today, and the first results I get are incredible.
Here's only one example, it's not the best, but it's not cherry-picked, and it's still better than anything I've ever gotten my hands on !
AI Daily Chronicle of AI Innovations - March 29th, 2024: 🤖Elon Musk announces Grok-1.5 🔍Google DeepMind unveils ‘superhuman’ AI system that excels in fact-checking 👮♂️Microsoft launches tools to try and stop people messing with chatbots 🤖AI21 Labs’ Jamba triples AI throughput 🛡️
- Google DeepMind's AI fact-checker outperforms humans
Google DeepMind has developed an AI system called Search-Augmented Factuality Evaluator (SAFE) that can evaluate the accuracy of information generated by large language models more effectively than human fact-checkers. In a study, SAFE matched human ratings 72% of the time and was correct in 76% of disagreements with humans.
- AI21 Labs’ Jamba triples AI throughput
AI21 Labs has released Jamba, the first-ever production-grade AI model based on the Mamba architecture. This new architecture combines the strengths of both traditional Transformer models and the Mamba SSM, resulting in a model that is both powerful and efficient. Jamba boasts a large context window of 256K tokens, while still fitting on a single GPU.
- X’s Grok gets a major upgrade
X.ai, Elon Musk's AI startup, has introduced Grok-1.5, an upgraded AI model for their Grok chatbot. This new version enhances reasoning skills, especially in coding and math tasks, and expands its capacity to handle longer and more complex inputs with a 128,000-token context window.
- Microsoft tackles Gen AI risks with new Azure AI tools
Microsoft has launched new Azure AI tools to address the safety and reliability risks associated with generative AI. The tools, currently in preview, aim to prevent prompt injection attacks, hallucinations, and the generation of personal or harmful content. The offerings include Prompt Shields, prebuilt templates for safety-centric system messages, and Groundedness Detection.
- Lightning AI partners with Nvidia to launch Thunder AI compiler
Lightning AI, in collaboration with Nvidia, has launched Thunder, an open-source compiler for PyTorch, to speed up AI model training by optimizing GPU usage. The company claims that Thunder can achieve up to a 40% speed-up for training large language models compared to unoptimized code.
- SambaNova's new AI model beats Databricks' DBRX
SambaNova Systems' Samba-CoE v0.2 Large Language Model outperforms competitors like Databricks' DBRX, MistralAI's Mixtral-8x7B, and xAI's Grok-1. With 330 tokens per second using only 8 sockets, Samba-CoE v0.2 demonstrates remarkable speed and efficiency without sacrificing precision.
- Google.org launches Accelerator to empower nonprofits with Gen AI
Google.org has announced a six-month accelerator program to support 21 nonprofits in leveraging generative AI for social impact. The program provides funding, mentorship, and technical training to help organizations develop AI-powered tools in areas such as climate, health, education, and economic opportunity, aiming to make AI more accessible and impactful.
- Pixel 8 to get on-device AI features powered by Gemini Nano
Google is set to introduce on-device AI features like recording summaries and smart replies on the Pixel 8, powered by its small-sized Gemini Nano model. The features will be available as a developer preview in the next Pixel feature drop, marking a shift from Google's primarily cloud-based AI approach.
- Meet Jan: An Open-Source ChatGPT Alternative that Runs Completely Offline on Computer
Jan, an open-source alternative to ChatGPT by Jan Labs, aims to make AI widely accessible without internet dependency. It's built for diverse hardware, prioritizing user privacy and ethical AI development. Under the AGPLv3 license, Jan encourages open collaboration and improvement. Supporting TypeScript and C++, with plans for Python and mobile platforms, Jan represents a community-driven approach to AI, offering a private, customizable experience.
AI Daily Chronicle of AI Innovations - March 28th, 2024: ⚡ DBRX becomes world’s most powerful open-source LLM 🏆 Claude 3 Opus crowned the top user-rated chatbot, beating OpenAI’s GPT-4 💙 Empathy meets AI: Hume AI's EVI redefines voice interaction 💰 OpenAI launches revenue sharing program for GPT Store builders 🛍️ Google introduces new shopping features to refine searches 🗣️ rabbit's r1 device gets ultra-realistic voice powered by ElevenLabs 💸 AI startup Hume raises $50M to build emotionally intelligent conversational AI 💻 Lenovo launches AI-enhanced PCs in a push for innovation and differentiation Study shows ChatGPT can produce medical record notes 10 times faster than doctors without compromising quality Microsoft Copilot AI will soon run locally on PCs
- DBRX becomes world’s most powerful open source LLM
Databricks has released DBRX, a family of open-source large language models setting a new standard for performance and efficiency. The series includes DBRX Base and DBRX Instruct, a fine-tuned version designed for few-turn interactions. Developed by Databricks' Mosaic AI team and trained using NVIDIA DGX Cloud, these models leverage an optimized mixture-of-experts (MoE) architecture based on the MegaBlocks open-source project. This architecture allows DBRX to achieve up to twice the compute efficiency of other leading LLMs.
- Claude 3 Opus crowned the top user-rated chatbot, beating OpenAI’s GPT-4
Anthropic's Claude 3 Opus has overtaken OpenAI's GPT-4 to become the top-rated chatbot on the Chatbot Arena leaderboard. This marks the first time in approximately a year since GPT-4's release that another language model has surpassed it in this benchmark, which ranks models based on user preferences in randomized head-to-head comparisons. Anthropic's cheaper Haiku and mid-range Sonnet models also perform impressively, coming close to the original GPT-4's capabilities at a significantly lower cost.
- Empathy meets AI: Hume AI's EVI redefines voice interaction
In a significant development for the AI community, Hume AI has introduced a new conversational AI called Empathic Voice Interface (EVI). What sets EVI apart from other voice interfaces is its ability to understand and respond to the user's tone of voice, adding unprecedented emotional intelligence to the interaction. By adapting its language and responses based on the user's expressions, EVI creates a more human-like experience, blurring the lines between artificial and emotional intelligence.
- 💰 OpenAI launches revenue sharing program for GPT Store builders
OpenAI is experimenting with sharing revenue with builders who create successful apps using GPT in OpenAI's GPT Store. The goal is to incentivize creativity and collaboration by rewarding builders for their impact on an ecosystem OpenAI is testing so they can make it easy for anyone to build and monetize AI-powered apps.
- 🛍️ Google introduces new shopping features to refine searches
Google is rolling out new shopping features that allow users to refine their searches and find items they like more easily. The Style Recommendations feature lets shoppers rate items in their searches, helping Google pick up on their preferences. Users can also specify their favorite brands to instantly bring up more apparel from those selections.
- 🗣️ rabbit's r1 device gets ultra-realistic voice powered by ElevenLabs
ElevenLabs has partnered with rabbit to integrate its high-quality, low-latency voice AI into rabbit's r1 AI companion device. The collaboration aims to make the user experience with r1 more natural and intuitive by allowing users to interact with the device using voice commands.
- 💸 AI startup Hume raises $50M to build emotionally intelligent conversational AI
AI startup Hume has raised $50 million in a Series B funding round, valuing the company at $219 million. Hume's AI technology can detect over 24 distinct emotional expressions in human speech and generate appropriate responses. The startup's AI has been integrated into applications across healthcare, customer service, and productivity, with the goal of providing more context and empathy in AI interactions.
- 💻 Lenovo launches AI-enhanced PCs in a push for innovation and differentiation
Lenovo revealed a new lineup of AI-powered PCs and laptops at its Innovate event in Bangkok, Thailand. The company showcased the dual-screen Yoga Book 9i, Yoga Pro 9i with an AI chip for performance optimization and AI-enhanced Legion gaming laptops. Lenovo hopes to differentiate itself in the crowded PC market and revive excitement with these AI-driven innovations.
- Study shows ChatGPT can produce medical record notes 10 times faster than doctors without compromising quality
The AI model ChatGPT can write administrative medical notes up to 10 times faster than doctors without compromising quality. This is according to a study conducted by researchers at Uppsala University Hospital and Uppsala University in collaboration with Danderyd Hospital and the University Hospital of Basel, Switzerland. The research is published in the journal Acta Orthopaedica.
- Microsoft Copilot AI will soon run locally on PCs
Microsoft's Copilot AI service is set to run locally on PCs, Intel told Tom's Hardware. The company also said that next-gen AI PCs would require built-in neural processing units (NPUs) with over 40 TOPS (trillion operations per second) of power — beyond the capabilities of any consumer processor on the market.
Intel said that the AI PCs would be able to run "more elements of Copilot" locally. Currently, Copilot runs nearly everything in the cloud, even small requests. That creates a fair amount of lag that's fine for larger jobs, but not ideal for smaller jobs. Adding local compute capability would decrease that lag, while potentially improving performance and privacy as well.
AI Daily Chronicle of AI Innovations - March 27th, 2024: 🔥 Microsoft study reveals the 11 by 11 tipping point for AI adoption 🤖 A16z spotlights the rise of generative AI in enterprises 🚨 Gaussian Frosting revolutionizes surface reconstruction in 3D modeling 🤖OpenAI unveils exciting upcoming features for GPT-4 and DALL-E 3 🤖 Adobe unveils GenStudio: AI-powered ad creation platform
- Microsoft study reveals the 11 by 11 tipping point for AI adoption
Microsoft's study on AI adoption in the workplace revealed the "11-by-11 tipping point," where users start seeing AI's value by saving 11 minutes daily. The study involved 1,300 Copilot for Microsoft 365 users and showed that 11 minutes of time savings is enough for most people to find AI useful.
- A16z spotlights the rise of generative AI in enterprises
A groundbreaking report by the influential tech firm a16z unveils the rapid integration of generative AI technologies within the corporate sphere. The report highlights essential considerations for business leaders to harness generative AI effectively. It covers resource allocation, model selection, and innovative use cases, providing a strategic roadmap for enterprises.
- Gaussian Frosting revolutionizes surface reconstruction in 3D modeling
At the international conference on computer vision, researchers presented a new method to improve surface reconstruction using Gaussian Frosting. This technique automates the adjustment of Poisson surface reconstruction hyperparameters, resulting in significantly improved mesh reconstruction.
- AIs can now learn and talk with each other like humans do.
This seems an important step toward AGI and vastly improved productivity.
"Once these tasks had been learned, the network was able to describe them to a second network — a copy of the first — so that it could reproduce them. To our knowledge, this is the first time that two AIs have been able to talk to each other in a purely linguistic way,’’ said lead author of the paper Alexandre Pouget, leader of the Geneva University Neurocenter, in a statement."
"While AI-powered chatbots can interpret linguistic instructions to generate an image or text, they can’t translate written or verbal instructions into physical actions, let alone explain the instructions to another AI.
However, by simulating the areas of the human brain responsible for language perception, interpretation and instructions-based actions, the researchers created an AI with human-like learning and communication skills."
- 🤖 Adobe unveils GenStudio: AI-powered ad creation platform
Adobe introduced GenStudio, an AI-powered ad creation platform, during its Summit event. GenStudio is a centralized hub for promotional campaigns, offering brand kits, copy guidance, and preapproved assets. It also provides generative AI-powered tools for generating backgrounds and ensuring brand consistency. Users can quickly create ads for email and social media platforms like Facebook, Instagram, and LinkedIn.
- 🧑💼Airtable introduces AI summarization for enhanced productivity
Airtable has introduced Airtable AI, which provides generative AI summarization, categorization, and translation to users. This feature allows quick insights and understanding of information within workspaces, enabling easy sharing of valuable insights with teams. Airtable AI automatically applies categories and tags to information, routes action items to the relevant team, and generates emails or social posts with a single button tap.
- 🤝Microsoft Teams enhances Copilot AI features for improved collaboration
Microsoft is introducing smarter Copilot AI features in Microsoft Teams to enhance collaboration and productivity. The updates include new ways to invoke the assistant during meeting chats and summaries, making it easier to catch up on missed meetings by combining spoken transcripts and written chats into a single view. Microsoft is launching new hybrid meeting features, such as automatic camera switching for remote participants and speaker recognition for accurate transcripts.
- 🤖OpenAI unveils exciting upcoming features for GPT-4 and DALL-E 3
OpenAI is preparing to introduce new features for its GPT-4 and DALL-E 3 models. For GPT-4, OpenAI plans to remove the message limit, implement a Model Tuner Selector, and allow users to upgrade responses from GPT-3.5 to GPT-4 with a simple button push. On the DALL-E 3 front, OpenAI is working on an image editor with inpainting functionality. These upcoming features demonstrate OpenAI's commitment to advancing AI capabilities.
- 🔍Apple Chooses Baidu's AI for iPhone 16 in China
Apple has reportedly chosen Baidu to provide AI technology for its upcoming iPhone 16 and other devices in China. This decision comes as Apple faces challenges due to stagnation in iPhone innovation and competition from Huawei. Baidu's Ernie Bot will be included in the Chinese version of the iPhone 16, Mac OS, and iOS 18. Despite discussions with Alibaba Group Holding and a Tsinghua University AI startup, Apple selected Baidu's AI technology for compliance.
- Meta CEO, Mark Zuckerberg, is directly recruiting AI talent from Google's DeepMind with personalized emails.
Meta CEO, Mark Zuckerberg, is attempting to recruit top AI talent from Google's DeepMind (their AI research unit). Personalised emails, from Zuckerberg himself, have been sent to a few of their top researchers, according to a report from The Information, which cited individuals that had seen the messages. In addition to this, the researchers are being hired without having to do any interviews, and, a previous policy which Meta had in place - to not offer higher offers to candidates with competing job offers - has been relaxed.
- OpenAI’s Sora Takes About 12 Minutes to Generate 1 Minute Video on NVIDIA H100.
- Apple on Tuesday announced that its annual developers conference, WWDC, will take place June 10 through June 14.
- Elon Musk says all Premium subscribers on X will gain access to AI chatbot Grok this week.
- Intel unveils AI PC program for software developers and hardware vendors.
- London-made HIV injection has potential to cure millions worldwide
AI Daily Chronicle of AI Innovations - March 26th, 2024: 🔥 Zoom launches all-in-one modern AI collab platform; 🤖 Stability AI launches instruction-tuned LLM; 🚨 Stability AI CEO resigns to focus on decentralized AI; 🔍 WhatsApp to integrate Meta AI directly into its search bar; 🥊 Google, Intel, and Qualcomm challenge Nvidia's dominance in AI; 🎬 OpenAI pitches Sora to Hollywood studios
- Stability AI launches instruction-tuned LLM
Stability AI has introduced Stable Code Instruct 3B, a new instruction-tuned large language model. It can handle various software development tasks, such as code completion, generation, translation, and explanation, as well as creating database queries with simple instructions.
Stable Code Instruct 3B claims to outperform rival models like CodeLlama 7B Instruct and DeepSeek-Coder Instruct 1.3B in terms of accuracy, understanding natural language instructions, and handling diverse programming languages. The model is accessible for commercial use with a Stability AI Membership, while its weights are freely available on Hugging Face for non-commercial projects.
- Zoom launches all-in-one modern AI collab platform
Zoom launched Zoom Workplace, an AI collaboration platform that integrates many tools to improve teamwork and productivity. With over 40 new features, including AI Companion updates for Zoom Phone, Team Chat, Events, and Contact Center, as well as the introduction of Ask AI Companion, Zoom Workplace simplifies workflows within a familiar interface.
The platform offers customization options, meeting features, and improved collaboration tools across Zoom's ecosystem. Zoom Business Services, integrated with Zoom Workplace, offers AI-driven marketing, customer service, and sales solutions. It expands digital communication channels and provides real-time insights for better agent management.
- Stability AI CEO resigns because of centralized AI
Stability AI CEO Emad Mostaque steps down to focus on decentralized AI, advocating for transparent governance in the industry.
Mostaque's departure follows the appointment of interim co-CEOs Shan Shan Wong and Christian Laforte.
The startup, known for its image generation tool, faced challenges including talent loss and financial struggles.
Mostaque emphasized the importance of generative AI R&D over revenue growth and highlighted the potential economic value of open models in regulated industries.
The AI industry witnessed significant changes with Inflection AI co-founders joining Microsoft after raising $1.5 billion.
- Estimating Sora's power requirements
Quoting the compute estimates of Sora from the factorial funds blog
A 15% penetration of Sora for videos with realistic video generation demand and utilization will require about 720k Nvidia H100 GPUs. Each H100 requires about 700 Watts of power supply.
720,000 x 700 = 504 Megawatts.
By comparison, even the largest ever fully solar powered plan in America (Ivanpah Solar Power Facility) produces about 377 Megawats.
While these power requirements can be met with other options like nuclear plants and even coal/hydro plants of big sizes ... are we really entering the power game for electricity ?
( it is currently a power game on compute)
- 💬 The Financial Times has introduced Ask FT, a new GenAI chatbot
It provides curated, natural-language responses to queries about recent events and broader topics covered by the FT. Ask FT is powered by Anthropic's Claude and is available to a selected group of subscribers as it is under testing
- 🔍 WhatsApp to integrate Meta AI directly into its search bar
The latest Android WhatsApp beta update will embed Meta AI directly into the search bar. This feature will allow users to type queries into the search bar and receive instant AI-powered responses without creating a separate Meta AI chat. The update will also allow users to interact with Meta AI even if they choose to hide the shortcut.
- 🥊 Google, Intel, and Qualcomm challenge Nvidia's dominance in AI
Qualcomm, Google, and Intel are targeting NVIDIA's software platforms like CUDA. They plan to create open-source tools compatible with multiple AI accelerator chips through the UXL Foundation. Companies are investing over $4 billion in startups developing AI software to loosen NVIDIA's grip on the field.
- 🤖 Apple takes a multi-vendor approach for generative AI in iOS 18
Apple is reportedly in talks with Alphabet, OpenAI, and Anthropic to integrate generative AI capabilities from multiple vendors into iOS 18. This multi-vendor approach aligns with Apple's efforts to balance advanced AI features with privacy considerations, which are expected to be detailed at WWDC 2024 during the iOS 18 launch.
- 🎬 OpenAI pitches Sora to Hollywood studios
OpenAI is actively engaging with Hollywood studios, directors, and talent agencies to integrate Sora into the entertainment industry. The startup has scheduled meetings in Los Angeles to showcase Sora's capabilities and encourage partnerships, with CEO Sam Altman attending events during the Oscars weekend.
AI Daily Chronicle of AI Innovations - March 25th, 2024: 🤝 Apple could partner with OpenAI, Gemini, Anthropic; 🤖 Chatbots more likely to change your mind than another human, study says; Verbal Reasoning Test - Opus is better than 93% of people, Gemini 1.5 Pro 59%, GPT-4 Turbo only 36%; Apple’s Tim Cook says AI essential tool for businesses to reduce carbon footprint; Suno V3: Song-on-demand AI is getting insanely good; The first patient with a Neuralink brain-computer implant played Nintendo’s Mario Kart video game with his mind in an impressive new demo video
- 🤝 Apple could partner with OpenAI, Gemini, Anthropic
Apple is discussing with Alphabet, OpenAI, Anthropic, and potentially Baidu to integrate generative AI into iOS 18, considering multiple partners rather than a single one.
The collaboration could lead to a model where iPhone users might choose their preferred AI provider, akin to selecting a default search engine in a web browser.
Reasons for partnering with external AI providers include financial benefits, the possibility to quickly adapt through partnership changes or user preferences, and avoiding the complexities of developing and maintaining cloud-based generative AI in-house.
- 🤖 Chatbots more likely to change your mind than another human, study says
A study found that personalized chatbots, such as GPT-4, are more likely to change people's minds compared to human debaters by using tailored arguments based on personal information.
The research conducted by the École Polytechnique Fédérale de Lausanne and the Italian Fondazione Bruno Kessler showed an 81.7 percent increase in agreement when GPT-4 had access to participants' personal data like age, gender, and race.
Concerns were raised about the potential misuse of AI in persuasive technologies, especially with the ability to generate detailed user profiles from online activities, urging online platform operators to counter such strategies.
- OpenAI CEO's £142 Million Gamble On Unlocking the Secrets to Longer Life, Altman's vision of extended lifespans may be achievable
Biotech startup Retro Biosciences is undertaking a one-of-a-kind experiment housed in shipping containers, funded by a $180 (£142.78) million investment by tech leader Sam Altman to increase lifespan.
Altman, the 38-year-old tech heavyweight, has been a significant player in the industry. Despite his young age, Altman took the tech realm by storm with offerings like ChatGPT and Sora. Unsurprisingly, his involvement in these groundbreaking projects has propelled him to a level of influence rivaling Mark Zuckerberg and Elon Musk, who is currently embroiled in a lawsuit with OpenAI.
It is also worth noting that the Altman-led AI startup is reportedly planning to launch its own AI-powered search engine to challenge Google's search dominance. Altman's visionary investments in tech giants like Reddit, Stripe, Airbnb, and Instacart propelled him to billionaire status. They cemented his influence as a tech giant who relentlessly pushed the boundaries of the industry's future.
- Suno V3 can do multiple languages in one song. This one is English, Portuguese, Japanese, and Italian. Incredible.
Beneath the vast sky, where dreams lay rooted deep, Mountains high and valleys wide, secrets they keep. Ground beneath my feet, firm and ever true, Earth, you give us life, in shades of brown and green hue.
Sopra o vento, mensageiro entre o céu e o mar, Carregando sussurros, histórias a contar. Dançam as folhas, em um balé sem fim, Vento, o alento invisível, guiando o destino assim.
火のように、情熱が燃えて、 光と暖かさを私たちに与えてくれる。 夜の暗闇を照らす、勇敢な炎、 生命の力、絶えず変わるゲーム。
Acqua, misteriosa forza che tutto scorre, Nei fiumi, nei mari, la vita che ci offre. Specchio del cielo, in te ci riflettiamo, Acqua, fonte di vita, a te ci affidiamo.
- OpenAI Heading To Hollywood To Pitch Revolutionary “Sora”
Some of the most important meetings in Hollywood history will take place in the coming week, as OpenAI hits Hollywood to show the potential of its “Sora” software to studios, talent agencies, and media executives.
Bloomberg is reporting that OpenAI wants more filmmakers to become familiar with Sora, the text-to-video generator that potentially could upend the way movies are made.
- Soon, Everyone Will Own a Robot, Like a Car or Phone Today. Says Figure AI founder
Brett Adcock, the founder of FigureAI robots, the company that recently released a demo video of its humanoid robot conversing with a human while performing tasks, predicts that everyone will own a robot in the future. “Similar to owning a car or phone today,” he said – hinting at the universal adoption of robots as an essential commodity in the future.
“Every human will own a robot in the future, similar to owning a car/phone today,” said Adcock.
A few months ago, Adcock called 2024 the year of Embodied AI, indicating how the future comprises AI in a body form. With robots learning to perform low-complexity tasks, such as picking trash, placing dishes, and even using the coffee machine, Figure robots are being trained to assist a person with house chores.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 17 - 24, 2024: Week 3 summary
🕰️ 32-hour workweek with the same pay: AI’s new promise
🗣️ Google's VLOGGER brings photos to life as talking avatars
🔓 Elon Musk’s xAI open-sources Grok AI
💻 Nvidia launches 'world's most powerful AI chip'
🎥 Stability AI's SV3D turns a single photo into a 3D video
🤖 OpenAI CEO hints at "amazing model", maybe ChatGPT-5
🧠 MindEye2: AI Mind Reading from Brain Activity
🚀 Nvidia NIM enables faster deployment of AI models
🤝 Microsoft hires DeepMind co-founder to lead a new AI division
🕵️♂️ A new hack: Stealing Part of a Production Language Model
🧰 Sakana AI’s method to automate foundation model development
👋 Key Stable Diffusion researchers leave Stability AI
⏩ NVIDIA’s LATTE3D: The fastest AI model for 3D generation
📚 Language Models teach themselves to think before speaking
♟️ Neuralink's first human patient plays chess with his mind
💥 Stability AI CEO resigns to ‘pursue decentralized AI’
🎥 OpenAI seeks Hollywood partnerships ahead of Sora AI video generator release
🍎 Apple gives up on its MicroLED dream
🔍 Google begins public testing of its generative AI search
🧠 AI could detect early risk of psychosis based on brain images
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: C37HCAQRVR7JTFK(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 24th, 2024
- Summary on GPT-5's performance rumors:
Overall performance boost: Sam Altman, CEO, stated that GPT-5 will be smarter and superior in all aspects, with a significant performance improvement over GPT-4.
Enhanced multimodal capabilities: It is predicted that GPT-5 will not only handle text and images but will also be capable of processing audio and videos, becoming a multimodal AI.
Increase in parameter count: It's expected to have several trillion parameters, greatly surpassing GPT-4's one trillion, for more complex and advanced reasoning.
Better text generation quality: GPT-5 aims to produce text that is consistently realistic and indistinguishable from that written by humans.
Expanded context understanding: The context window of GPT-5 is expected to greatly exceed GPT-4's 128,000 tokens, allowing for longer text comprehension and analysis.
Improved logical reasoning: GPT-5 is expected to make significant advancements in its ability to reason logically and tackle more complex problems.
AI agent functionality: GPT-5 may include the capability for performing tasks autonomously, hinting at functionalities similar to those of AI agents.
GPT-5 is expected to mark a revolutionary leap forward from GPT-4. Altman cautions against underestimating the performance improvements of GPT-5, indicating it could introduce groundbreaking changes to the field of AI.
- Nvidia announces AI-powered health care 'agents' that outperform nurses — and cost $9 an hour
High-powered chipmaker Nvidia has teamed up with artificial intelligence health care company Hippocratic AI to develop generative AI "agents" that not only outperform human nurses on video calls but cost a lot less per hour.
The two companies on Thursday announced their collaboration to build "empathetic health care agents" powered by Nvidia and trained on Hippocratic's health care-focused large language model (LLM) that are better able to form a human connection with patients through "super-low latency conversational reactions."
It was interesting watching the demonstration of their AI nurse, Linda, on the Hippocratic AI website. While I doubt elderly patients will be receptive at first, if the AI nurse is able to spend longer time with the patient and answer their questions then that could really be beneficial for healthcare and patients alike. It'll also free up a lot of nurses and remove some of their workload.
If implemented, I'd hope that there is a hybrid call system so that if the patients don't want to talk with the AI, they could be redirected to a human nurse.
- Pro AI regulation Sam Altman has been spending a lot of time in Washington lobbying the government presumably to regulate Open Source.
- Mistral just announced at @SHACK15sf that they will release a new model today: Mistral 7B v0.2 Base Model - 32k instead of 8k context window
AI Daily Chronicle of AI Innovations - March 22nd, 2024:
🤖 Nvidia’s Latte 3D generates text-to-3D in seconds! 💰 Saudi Arabia to invest $40 billion in AI 🚀 Open Interpreter’s 01 Light personal pocket AI agent. 🤖 Microsoft introduces a new Copilot for better productivity.
💡Quiet-STaR: LMs can self-train to think before responding
🤯Neuralink's first brain chip patient plays chess with his mind
- Meta AI introduced SceneScript, a novel method of generating scene layouts and representing scenes using language.
SceneScript allows AR & AI devices to understand the geometry of physical spaces. It uses next token prediction like an LLM, but instead of natural language SceneScript model predicts the next architectural tokens such as ‘wall’ or ‘door.’
- Nvidia’s Latte 3D generates text-to-3D in seconds!
NVIDIA introduces Latte3D, facilitating the conversion of text prompts into detailed 3D models in less than a second. Developed by NVIDIA’s Toronto lab, Latte3D sets a new standard in generative AI models with its remarkable blend of speed and precision.
- Quiet-STaR: LMs can self-train to think before responding
A groundbreaking study demonstrates the successful training of large language models (LM) to reason from text rather than specific reasoning tasks. The research introduces a novel training approach, Quiet STaR, which utilizes a parallel sampling algorithm to generate rationales from all token positions in a given string.
- Neuralink's first brain chip patient plays chess with his mind
Elon Musk's brain chip startup, Neuralink, showcased its first brain chip patient playing chess using only his mind. The patient, Noland Arbaugh, was paralyzed below the shoulder after a diving accident.
Neuralink's brain implant technology allows people with paralysis to control external devices using their thoughts. With further advancements, Neuralink's technology has the potential to revolutionize the lives of people with paralysis, providing them with newfound independence and the ability to interact with the world in previously unimaginable ways.
- 🤖 Microsoft introduces a new Copilot for better productivity.
Microsoft's new Copilot for Windows and Surface devices is a powerful productivity tool integrating large language models with Microsoft Graph and Microsoft 365 apps to enhance work efficiency. With a focus on delivering AI responsibly while ensuring data security and privacy, Microsoft is dedicated to providing users with innovative tools to thrive in the evolving work landscape.
- 💰 Saudi Arabia to invest $40 billion in AI
Saudi Arabia has announced its plan to invest $40 billion in AI to become a global leader. Middle Eastern countries use their sovereign wealth fund, which has over $900 billion in assets, to achieve this goal. This investment aims to position the country at the forefront of the fast-evolving AI sector, drive innovation, and enhance economic growth.
- 🎧 Rightsify releases Hydra II to revolutionize AI music generation
Rightsify, a global music licensing leader, introduced Hydra II, the latest AI generation model. Hydra II offers over 800 instruments, 50 languages, and editing tools for customizable, copyright-free AI music. The model is trained on audio, text descriptions, MIDI, chord progressions, sheet music, and stems to create unique generations.
- 🚀 Open Interpreter’s 01 Light personal pocket AI agent
The Open Interpreter unveiled 01 Light, a portable device that allows you to control your computer using natural language commands. It's part of an open-source project to make computing more accessible and flexible. It's designed to make your online tasks more manageable, helping you get more done and simplify your life.
- 🤝 Microsoft's $650 million Inflection deal: A strategic move
Microsoft has recently entered into a significant deal with AI startup Inflection, involving a payment of $650 million in cash. While the deal may seem like a licensing agreement, it appears to be a strategic move by Microsoft to acquire AI talent while avoiding potential regulatory trouble.
- NVIDIA NIM, a containerized inference microservice to simplify deployment of generative AI models across various infrastructures.
Developers can test a wide range of models using cloud APIs from the NVIDIA API catalog or they can self-host the models by downloading NIM and deploying with Kubernetes
- Earth-2 climate digital twin cloud platform for simulating and visualizing weather and climate at unprecedented scale.
Earth-2’s APIs offer AI models and employ a new NVIDIA generative AI model called CorrDiff that generates 12.5x higher resolution images than current numerical models 1,000x faster and 3,000x more energy efficiently
- Roblox adds AI-powered avatar creation ( converts a 3D body mesh into a live, animated avatar) and texture generation (text prompts to quickly change the look of 3D objects)
- ByteDance released AnimateDiff-Lightning,
a lightning-fast text-to-video generation model. It can generate videos more than ten times faster than the original AnimateDiff
- Lighthouz AI launched the Chatbot Guardrails Arena in collaboration with Hugging Face
to stress test LLMs and privacy guardrails in leaking sensitive data. Chat with two anonymous LLMs with guardrails and try to trick them into revealing sensitive financial information and cast your vote for the model that shows greater privacy
- Andrew Ng, cofounder of Google Brain & former chief scientist @ Baidu- "I think AI agentic workflows will drive massive AI progress this year
I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it. Today, we mostly use LLMs in zero-shot mode, prompting a model to generate final output token by token without revising its work. This is akin to asking someone to compose an essay from start to finish, typing straight through with no backspacing allowed, and expecting a high-quality result. Despite the difficulty, LLMs do amazingly well at this task! With an agentic workflow, however, we can ask the LLM to iterate over a document many times. For example, it might take a sequence of steps such as: - Plan an outline. - Decide what, if any, web searches are needed to gather more information. - Write a first draft. - Read over the first draft to spot unjustified arguments or extraneous information. - Revise the draft taking into account any weaknesses spotted. - And so on. This iterative process is critical for most human writers to write good text. With AI, such an iterative workflow yields much better results than writing in a single pass. Devin’s splashy demo recently received a lot of social media buzz. My team has been closely following the evolution of AI that writes code. We analyzed results from a number of research teams, focusing on an algorithm’s ability to do well on the widely used HumanEval coding benchmark. You can see our findings in the diagram below. GPT-3.5 (zero shot) was 48.1% correct. GPT-4 (zero shot) does better at 67.0%. However, the improvement from GPT-3.5 to GPT-4 is dwarfed by incorporating an iterative agent workflow. Indeed, wrapped in an agent loop, GPT-3.5 achieves up to 95.1%. Open source agent tools and the academic literature on agents are proliferating, making this an exciting time but also a confusing one. To help put this work into perspective, I’d like to share a framework for categorizing design patterns for building agents. My team AI Fund is successfully using these patterns in many applications, and I hope you find them useful. - Reflection: The LLM examines its own work to come up with ways to improve it. - Tool use: The LLM is given tools such as web search, code execution, or any other function to help it gather information, take action, or process data. - Planning: The LLM comes up with, and executes, a multistep plan to achieve a goal (for example, writing an outline for an essay, then doing online research, then writing a draft, and so on). - Multi-agent collaboration: More than one AI agent work together, splitting up tasks and discussing and debating ideas, to come up with better solutions than a single agent would. I’ll elaborate on these design patterns and offer suggested readings for each next week.
- AI-generated digital twins of patients can predict future diseases
Named Foresight, the tool uses generative pre-trained transformers, the same family of large language models (LLMs) used by ChatGPT.
Researchers in the UK first trained the models on medical records. Next, they fed their tool fresh healthcare data to create virtual duplicates of patients.
Finally, the digital twins forecast various outcomes, from disease development to medication needs.
Scientists are particularly excited about the prospect of accelerating diagnosis.
When applied to US data, the digital twins correctly identified the next condition of patients next condition with 88% accuracy.
It was less effective, however, on British data. Using information from two National Health Trust (NHS) organisations, the tool accurately predicted subsequent conditions 68% and 76% of the time.
Nonetheless, there are high hopes for the digital twins.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: C37HCAQRVR7JTFK(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 21st, 2024: 🕵️♂️ Stealing Part of a Production Language Model
🤖 Sakana AI’s method to automate foundation model development
👋 Key Stable Diffusion researchers leave Stability AI 🗣️Character AI’s new feature adds voice to characters with just 10-sec audio 💡Fitbit to get major AI upgrades powered by Google’s ‘Personal Health’ LLM 🔬Samsung creates lab to research chips for AI’s next phase 🤖GitHub’s latest AI tool can automatically fix code vulnerabilities
- Google's progress on generative AI in health
New modalities in models for healthcare
Medicine is a multimodal discipline; it’s made up of different types of information stored across formats — like radiology images, lab results, genomics data, environmental context and more. To get a fuller understanding of a person’s health, we need to build technology that understands all of this information.
A Personal Health LLM for personalized coaching and recommendations
Fitbit and Google Research are working together to build a Personal Health Large Language Model that can power personalized health and wellness features in the Fitbit mobile app, helping people get even more insights and recommendations from the data from their Fitbit and Pixel devices. This model is being fine-tuned to deliver personalized coaching capabilities, like actionable messages and guidance, that can be individualized based on personal health and fitness goals. For example, this model may be able to analyze variations in your sleep patterns and sleep quality, and then suggest recommendations on how you might change the intensity of your workout based on those insights.
- Google fined €250 million by French authorities for clash with news outlets over AI training data
Google was fined €250 million by French watchdogs after it trained Bard with data from French news publications without their consent
- UN set to vote on first AI resolution, aiming to make it 'safe and secure'
The UN General Assembly is set to vote on its first resolution on artificial intelligence, focusing on ensuring the technology is safe, respects human rights, and benefits all nations.
The draft resolution emphasizes closing the digital divide, fostering global consensus on AI governance, and using AI to achieve the UN's 2030 development goals.
The resolution has received support from all 193 UN member states after months of negotiations and aims to guide the development and use of AI in a manner that respects human rights and fundamental freedoms.
- Stealing Part of a Production Language Model
Researchers from Google, OpenAI, and DeepMind (among others) released a new paper that introduces the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI’s ChatGPT or Google’s PaLM-2.
The attack allowed them to recover the complete embedding projection layer of a transformer language model. It differs from prior approaches that reconstruct a model in a bottom-up fashion, starting from the input layer. Instead, this operates top-down and directly extracts the model’s last layer by making targeted queries to a model’s API. This is useful for several reasons; it
Reveals the width of the transformer model, which is often correlated with its total parameter count.
Slightly reduces the degree to which the model is a complete “blackbox”
May reveal more global information about the model, such as relative size differences between different models
- Sakana AI’s method to automate foundation model development
Sakana AI has introduced Evolutionary Model Merge, a general method that uses evolutionary techniques to efficiently discover the best ways to combine different models from the vast ocean of different open-source models with diverse capabilities.
As of writing, Hugging Face has over 500k models in dozens of different modalities that, in principle, could be combined to form new models with new capabilities. By working with the vast collective intelligence of existing open models, this method is able to automatically create new foundation models with desired capabilities specified by the user.
- Key Stable Diffusion researchers leave Stability AI
Robin Rombach and other key researchers who helped develop the Stable Diffusion text-to-image generation model have left the troubled, once-hot, now floundering GenAI startup.
Rombach (who led the team) and fellow researchers Andreas Blattmann and Dominik Lorenz were three of the five authors who developed the core Stable Diffusion research while at a German university. They were hired afterwards by Stability. Last month, they helped publish a 3rd edition of the Stable Diffusion model, which, for the first time, combined the diffusion structure used in earlier versions with transformers used in OpenAI’s ChatGPT.
Their departures are the latest in a mass exodus of executives at Stability AI, as its cash reserves dwindle and it struggles to raise additional funds.
- 🗣️Character AI’s new feature adds voice to characters with just 10-sec audio
You can now give voice to your Characters by choosing from thousands of voices or creating your own. The voices are created with just 10 seconds of audio clips. The feature is now available for free to everyone.
- 🤖GitHub’s latest AI tool can automatically fix code vulnerabilities
GitHub launches the first beta of its code-scanning autofix feature, which finds and fixes security vulnerabilities during the coding process. GitHub claims it can remediate more than two-thirds of the vulnerabilities it finds, often without the developers having to edit the code. The feature is now available for all GitHub Advanced Security (GHAS) customers.
- 🚀OpenAI plans to release a 'materially better' GPT-5 in mid-2024
According to anonymous sources from Businessinsider, OpenAI plans to release GPT-5 this summer, which will be significantly better than GPT-4. Some enterprise customers are said to have already received demos of the latest model and its ChatGPT improvements.
- 💡Fitbit to get major AI upgrades powered by Google’s ‘Personal Health’ LLM
Google Research and Fitbit announced they are working together to build a Personal Health LLM that gives users more insights and recommendations based on their data in the Fitbit mobile app. It will give Fitbit users personalized coaching and actionable insights that help them achieve their fitness and health goals.
- 🔬Samsung creates lab to research chips for AI’s next phase
Samsung has set up a research lab dedicated to designing an entirely new type of semiconductor needed for (AGI). The lab will initially focus on developing chips for LLMs with a focus on inference. It aims to release new “chip designs, an iterative model that will provide stronger performance and support for increasingly larger models at a fraction of the power and cost.”
- 🔍 Google fined $270M for using news articles to train Gemini
Google agreed to pay approximately $273 million to settle a dispute in France for not informing or compensating French news publishers when using their content for search results and training its AI chatbot, Gemini.
The settlement addresses Google's breach of commitments, including fair negotiations with publishers and informing them about the use of their content by Google's AI services.
As part of the settlement, Google committed to corrective measures including dropping a minimum threshold for publisher remuneration and appointing a French-speaking representative to improve transparency and communication with publishers.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: C37HCAQRVR7JTFK(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 20th, 2024: 🤖 OpenAI to release GPT-5 this summer; 🧠 Nvidia’s Jensen Huang says AI hallucinations are solvable, AGI is 5 years away; 🔬 Ozempic creator plans AI supercomputer to discover new drugs; 👀 After raising $1.3B, Inflection eaten alive by Microsoft; 🧠 MindEye2: AI Mind Reading from Brain Activity; 🚀 Nvidia NIM enables faster deployment of AI models
- 🤖 OpenAI to release GPT-5 this summer :
OpenAI is planning to launch GPT-5 around mid-year, aiming to address previous performance issues and significantly improve upon its predecessor, GPT-4.
GPT-5 is described as "materially better" by those who have seen demos, including enhancements and new capabilities like the ability to call AI agents for autonomous tasks, with enterprise customers having already previewed these improvements.
The release timeline for GPT-5 remains uncertain as OpenAI continues its training and thorough safety and vulnerability testing, with no specific deadline for completion of these preparatory steps.
- 👀 After raising $1.3B, Inflection eaten alive by Microsoft :
In June 2023, Inflection raised $1.3 billion led by Microsoft to develop "more personal AI" but was overtaken by Microsoft less than a year later, with co-founders joining Microsoft's new AI division.
Despite significant investment, Inflection's AI, Pi, failed to compete with advancements from other companies such as OpenAI, Google’s Gemini, and Anthropic, leading to its downfall.
Microsoft's takeover of Inflection reflects the strategy of legacy tech companies to dominate the AI space by supporting startups then acquiring them once they face challenges.
- 🧠 Nvidia’s Jensen Huang says AI hallucinations are solvable, AGI is 5 years away
Nvidia CEO Jensen Huang predicts artificial general intelligence (AGI) could be achieved within 5 years, depending on how AGI is defined and measured.
Huang addresses concerns around AI hallucinations, suggesting that ensuring answers are well-researched could easily solve the issue.
The concept of AGI raises concerns about its potential unpredictability and the challenges of aligning its objectives with human values and priorities.
- 🔬 Ozempic creator plans AI supercomputer to discover new drugs
The Novo Nordisk Foundation is investing in "Gefion," an AI supercomputer project developed in collaboration with Nvidia.
"Gefion" aims to be the world’s most powerful AI supercomputer for health sciences, utilizing Nvidia's new chips to accelerate scientific breakthroughs in critical areas such as drug discovery, disease diagnosis, and treatment,
This initiative underscores the growing integration of AI in healthcare, promising to catalyze significant scientific discoveries and innovations that could transform patient care and outcomes.
- MindEye2: AI mind reading from brain activity
MindEye2 is a revolutionary model that reconstructs visual perception from brain activity using just one hour of data. Traditional methods require extensive training data, making them impractical for real-world applications. However, MindEye2 overcomes this limitation by leveraging shared-subject models. The model is pretrained on data from seven subjects and then fine-tuned with minimal data from a new subject.
- Nvidia NIM enables faster deployment of AI models
NVIDIA has introduced NVIDIA NIM (NVIDIA Inference Microservices) to accelerate the deployment of AI applications for businesses. NIM is a collection of microservices that package essential components of an AI application, including AI models, APIs, and libraries, into a container. These containers can be deployed in environments such as cloud platforms, Linux servers, or serverless architectures.
- Microsoft hires DeepMind co-founder to lead a new AI division
Mustafa Suleyman, a renowned co-founder of DeepMind and Inflection, has recently joined Microsoft as the leader of Copilot. Satya Nadella, Microsoft's CEO, made this significant announcement, highlighting the importance of innovation in artificial intelligence (AI).
In his new role as the Executive Vice President and CEO of Microsoft AI, Mustafa will work alongside Karén Simonyan, another talented individual from Inflection who will serve as Chief Scientist. Together, they will spearhead the development and advancement of Copilot and other exciting consumer AI products at Microsoft. Mustafa and his team's addition to the Microsoft family brings a wealth of expertise and promises groundbreaking advancements in AI.
- Google DeepMind’s new AI tool can analyze soccer tactics and offer insights
DeepMind has partnered with Liverpool FC to develop a new AI tool called TacticAI. TacticAI uses generative and predictive AI to help coaches determine which player will most likely receive the ball during corner kicks, whether a shot will be taken, and how to adjust player setup. It aims to revolutionize soccer and help the teams enhance their efficiency.
- Pika Labs introduces sound effects for its gen-AI video generation
Pika Labs has now added the ability to create sound effects from a text prompt for its generative artificial intelligence videos. It allows for automatic or custom SFX generations to pair with video outputs. Now, users can make bacon sizzle, lions roar, or add footsteps to the video of someone walking down the street. It is only available to pro users.
- Buildbox 4 Alpha enables users to create 3D video games from text prompts
Buildbox has released an alpha version of Buildbox 4. It's an AI-first game engine that allows users to create games and generate assets from text prompts. The alpha version aims to make text-to-game a distinct reality. Users can create various assets and animations from simple text prompts. It also allows users to build a gaming environment in a few minutes.
- Nvidia adds generative AI capabilities to empower humanoid robots
Nvidia introduced Project GR00T, a multimodal AI that will power future humanoids with advanced foundation AI. Project GR00T enables humanoid robots to input text, speech, videos, or even live demos and process them to take specific actions. It has been developed with the help of Nvidia’s Isaac Robotic Platform tools, including an Isaac Lab for RLHF.
- Perplexity AI, a hyped Silicon Valley AI startup that claimed to take on Google, was found out copying Google results directly
It never claimed to have an original or superior search algorithm. Why would you need to reinvent the wheel.
Their value is in having an LLM that uses existing search engines well.
- GitHub is launching the first beta of its code scanning autofix feature for finding and fixing security vulnerabilities during the coding process.
This new feature combines the real-time capabilities of GitHub’s Copilot with CodeQL, the company’s semantic code analysis engine. The company first previewed this capability last November.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 19th, 2024
- 💻 Nvidia launches 'world's most powerful AI chip': Nvidia has revealed its new Blackwell B200 GPU and GB200 "superchip", claiming it to be the world's most powerful chip for AI. Both B200 and GB200 are designed to offer powerful performance and significant efficiency gains.
Key takeaways:
- The B200 offers up to 20 petaflops of FP4 horsepower, and Nvidia says it can reduce costs and energy consumption by up to 25 times over an H100.
-
The GB200 "superchip" can deliver 30X the performance for LLM inference workloads while also being more efficient.
-
Nvidia claims that just 2,000 Blackwell chips working together could train a GPT -4-like model comprising 1.8 trillion parameters in just 90 days.
- 🎥 Stability AI's SV3D turns a single photo into a 3D video: Stability AI released Stable Video 3D (SV3D), a new generative AI tool for rendering 3D videos. SV3D can create multi-view 3D models from a single image, allowing users to see an object from any angle. This technology is expected to be valuable in the gaming sector for creating 3D assets and in e-commerce for generating 360-degree product views.
- 🤖 OpenAI CEO hints at "Amazing Model", maybe ChatGPT-5: OpenAI CEO Sam Altman has announced that the company will release an "amazing model" in 2024, although the name has not been finalized. Altman also mentioned that OpenAI plans to release several other important projects before discussing GPT-5, one of which could be the Sora video model.
- 🤝 Apple is in talks to bring Google's AI to iPhones: Apple and Google are negotiating a deal to integrate Google's Gemini AI into iPhones, potentially shaking up the AI industry. The deal would expand on their existing search partnership. Apple also held discussions with OpenAI. If successful, the partnership could give Gemini a significant edge with billions of potential users.
- 🏷️YouTube rolls out AI content labels: YouTube now requires creators to self-label AI-generated or synthetic content in videos. The platform may add labels itself for potentially misleading content. However, the tool relies on creators being honest, as YouTube is still working on AI detection tools.
- 🎮Roblox speeds up 3D creation with AI tools: Roblox has introduced two AI-driven tools to streamline 3D content creation on its platform. Avatar Auto Setup automates the conversion of 3D body meshes into fully animated avatars, while Texture Generator allows creators to quickly alter the appearance of 3D objects using text prompts, enabling rapid prototyping and iteration.
- 🌐Nvidia teams up with Shutterstock and Getty Images for AI-generated 3D content: Nvidia's Edify AI can now create 3D content, and partnerships with Shutterstock and Getty Images will make it accessible to all. Developers can soon experiment with these models, while industry giants are already using them to create stunning visuals and experiences.
- 🖌️Adobe Substance 3D introduces AI-powered text-to-texture tools: Adobe has introduced two AI-driven features to its Substance 3D suite: "Text to Texture," which generates photo-realistic or stylized textures from text prompts, and "Generative Background," which creates background images for 3D scenes. Both tools use 2D imaging technology from Adobe's Firefly AI model to streamline 3D workflows.
- 💥 Nvidia unveils the most powerful AI chip ever: Nvidia unveils the Blackwell B200 GPU, labeled as the "world's most powerful chip" for AI, capable of delivering up to 20 petaflops of FP4 horsepower.
The GB200 superchip, which combines two B200 GPUs and a single Grace CPU, can provide 30 times the performance for LLM inference workloads compared to the H100, with a reduction in cost and energy consumption by up to 25x.
Nvidia introduced a new network switch chip to enhance connectivity between multiple GPUs, enabling 576 GPUs to communicate with 1.8 terabytes per second.
- 🤖 Nvidia unveils Project GR00T, an AI platform to power humanoids of the future: Nvidia has announced Project GROOT, its new foundational model aimed at helping the development of robots in industrial use cases.
Project GROOT is designed to enable robots to understand natural language and learn actions by observing humans, enhancing their ability to adapt and interact with the real world.
The initiative is supported by Nvidia's new Jetson Thor computing system, featuring a GPU based on the Nvidia Blackwell architecture, to power these advanced humanoid robots.
- 🚿 SEC charges investment advisors for ‘AI washing’: The SEC charged two investment advisors, Delphia and Global Predictions, for making misleading claims about their use of artificial intelligence, a practice referred to as "AI washing."
Following a cease-and-desist order, both companies agreed to settle the charges by paying a total of $400,000 in civil penalties, after being accused of deceiving investors and regulators about their AI capabilities and regulatory compliance.
SEC Chair Gary Gensler emphasized the importance of truthfulness in marketing AI capabilities, warning against the damages of "AI washing" in investor advisement and financial practices.
- 📹 Stability AI launches new model that turns images into 3D videos: Stability AI introduces two versions of Stable Video 3D (SV3D), enabling the creation of 3D video "meshes" from image prompts, with advanced features like "specified camera paths".
The SV3D model, building on Stable Video Diffusion, can generate 3D model videos of various objects without needing images from all angles, following training on extensive datasets.
Available for both commercial and non-commercial uses, SV3D is seen as valuable for generating 3D assets in gaming and creating immersive 360-degree videos for e-commerce.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 18th, 2024
- Nvidia Announcing a Platform for Trillion-Parameter Gen AI Scaling
- Sam Altman during the new Lex interview: “We will release an amazing model this year. I don’t know what we will call it.”
- Stability AI: Today, we are releasing Stable Video 3D, a generative model based on Stable Video Diffusion. This new model advances the field of 3D technology, delivering greatly improved quality and multi-view.
- 🕰️ Bernie’s 4 day workweek: less work, same pay: Sen. Bernie Sanders has introduced the Thirty-Two Hour Workweek Act, which aims to establish a four-day workweek in the United States without reducing pay or benefits. To be phased in over four years, the bill would lower the overtime pay threshold from 40 to 32 hours, ensuring that workers receive 1.5 times their regular salary for work days longer than 8 hours and double their regular wage for work days longer than 12 hours
- 🗣️ Google's AI brings photos to life as talking avatars: Google's latest AI research project VLOGGER, automatically generates realistic videos of talking and moving people from just a single image and an audio or text input. It is the first model that aims to create more natural interactions with virtual agents by including facial expressions, body movements, and gestures, going beyond simple lip-syncing.
- Nvidia unveils next-gen Blackwell GPUs with 25X lower costs and energy consumption
- 🤖Elon Musk’s xAI open-sources Grok AI: Elon Musk's xAI has open-sourced the base model weights and architecture of its AI chatbot, Grok. This allows researchers and developers to freely use and build upon the 314 billion parameter Mixture-of-Experts model. Released under the Apache 2.0 license, the open-source version is not fine-tuned for any particular task.
- 🧠 Maisa KPU may be the next leap in AI reasoning: Maisa has released the beta version of its Knowledge Processing Unit (KPU), an AI system that uses LLMs’ advanced reasoning and data processing abilities. In an impressive demo, the KPU assisted a customer with an order-related issue, even when the customer provided an incorrect order ID, showing the system's understanding abilities.
- 🍿 PepsiCo increases market domination using GenAI: PepsiCo uses GenAI in product development and marketing for faster launches and better profitability. It has increased market penetration by 15% by using GenAI to improve the taste and shape of products like Cheetos based on customer feedback. The company is also doubling down on its presence in India, with plans to open a third capability center to develop local talent
- 💻 Deci launches Nano LLM & GenAI dev platform: Israeli AI startup Deci has launched two major offerings: Deci-Nano, a small closed-source language model, and a complete Generative AI Development Platform for enterprises. Compared to rivals like OpenAI and Anthropic, Deci-Nano offers impressive performance at low cost, and the new platform offers a suite of tools to help businesses deploy and manage AI solutions.
- 🎮 Invoke AI simplifies game dev workflows: Invoke has launched Workflows, a set of AI tools designed for game developers and large studios. These tools make it easier for teams to adopt AI, regardless of their technical expertise levels. Workflows allow artists to use AI features while maintaining control over their training assets, brand-specific styles, and image security
- 🚗 Mercedes teams up with Apptronik for robot workers: Mercedes-Benz is collaborating with robotics company Apptronik to automate repetitive and physically demanding tasks in its manufacturing process. The automaker is currently testing Apptronik's Apollo robot, a 160-pound bipedal machine capable of lifting objects up to 55 pounds. The robot inspects and delivers components to human workers on the production line, reducing the physical strain on employees and increasing efficiency.
- 💥 Apple in talks with Google to use their AI models: Apple and Google are discussing a partnership to integrate Google's Gemini AI into Apple's iPhone software features.
The collaboration could enhance their existing search partnership, which currently involves Google paying Apple approximately $20 billion annually to be the default search engine on iOS devices.
Despite ongoing negotiations and potential antitrust concerns, the deal, aimed at introducing powerful AI capabilities to iPhones, may not be announced until Apple's developer conference in June.
- 📹 YouTube mandates AI content disclosure by creators: YouTube now mandates creators to inform viewers when AI was used to make content appear realistically, through a new tool in Creator Studio for disclosing altered or synthetically generated media.
The policy aims to reduce deception among viewers by distinguishing synthetic content from real, especially amid concerns about AI and deepfakes influencing U.S. presidential election perceptions.
Exemptions to the disclosure requirement include clearly fantastical content and use of AI in production assistance, focusing instead on realistic depictions of people, places, events, and voices.
- 🍎 Apple introduces new 'MM1' AI model : Apple researchers have unveiled the 'MM1' AI model, which is capable of training on both text and visual inputs, aiming to create more intelligent and flexible AI systems.
The MM1 model utilizes a diverse dataset that includes image-caption pairs and text-data, improving its performance on tasks like image captioning and visual question answering.
The research highlights the MM1 model's advanced in-context learning abilities, especially in its largest configuration, enabling multi-step reasoning over images with minimal examples.
- Google AI releases MELON, a New Technique for Constructing 3D Objects from Images: MELON is a technique for reconstructing 3D objects from images without known camera positions. MELON uses a lightweight neural network to infer camera poses and incorporates a modulo loss that accounts for objects' pseudo-symmetries, allowing reconstruction from as few as 4-6 images. Demonstrated on the NeRF-Synthetic dataset, MELON achieves accurate reconstructions and novel view synthesis, showing promise for applications in fields where precise camera pose information is unavailable.
MELON can easily be integrated into existing NeRF methods and requires as few as 4–6 images of an object.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Skin Stem Cell Serum
AI Daily Chronicle of AI Innovations - March 16th, 2024
- 🔍 FTC is probing Reddit’s AI licensing deals: Reddit is under investigation by the FTC for its data licensing practices concerning user-generated content being used to train AI models.
The investigation focuses on Reddit's engagement in selling, licensing, or sharing data with third parties for AI training.
Reddit anticipates generating approximately USD 60 million in 2024 from a data licensing agreement with Google, aiming to leverage its platform data for training LLMs.
- 💻 New jailbreak uses ASCII art to elicit harmful responses from leading LLMs: Researchers identified a new vulnerability in leading AI language models, named ArtPrompt, which uses ASCII art to exploit the models' security mechanisms.
ArtPrompt masks security-sensitive words with ASCII art, fooling language models like GPT-3.5, GPT-4, Gemini, Claude, and Llama2 into performing actions they would otherwise block, such as giving instructions for making a bomb.
The study underscores the need for enhanced defensive measures for language models, as ArtPrompt, by leveraging a mix of text-based and image-based inputs, can effectively bypass current security protocols.
- ArXiv Papers as Audiobooks Official Implementation: converts ArXiv papers into engaging video formats or audio files, utilizing latex conversion, HTML parsing, OpenAI GPT for paraphrasing and simplification, Google's text-to-speech for audio, and video mapping, offering both detailed and summarized versions with the option to upload audio to Google Drive.
- OpenAI aims to make its own AI processors — chip venture in talks with Abu Dhabi investment firm
- Once “too scary” to release, GPT-2 gets squeezed into an Excel spreadsheet.
- AI Weekly Rundown March 09 to March 16th, 2024
🖼️ Huawei's PixArt-Σ paints prompts to perfection
🧠 Meta cracks the code to improve LLM reasoning
📈 Yi Models exceed benchmarks with refined data
🚀 Cohere introduces production-scale AI for enterprises
🎯 RFM-1 redefines robotics with human-like reasoning
🎧 Spotify introduces audiobook recommendations
👨💻 Devin: The first AI software engineer redefines coding
🗣️ Deepgram’s Aura empowers AI agents with authentic voices
🖥️ Meta introduces two 24K GPU clusters to train Llama 3
🎮 DeepMind's SIMA: The AI agent that's a Jack of all games
⚡ Claude 3 Haiku: Anthropic's lightning-fast AI solution for enterprises
🤖 ChatGPT gets a body with "Figure 01"
🛠️ Apple’s new recipe to build performant multimodal models
💥 Cerebras’ chip for enabling 10x larger models than GPT-4
💼 Apple buys startup DarwinAI ahead of a big push into GenAI in 2024
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 15th, 2024
- 🥘 Apple’s MM1: The new recipe to master AI performance
- ⚡ Cerebras WSE-3: AI chip enabling 10x larger models than GPT-4
- 🤖 Apple acquires Canadian AI startup DarwinAI
- 🤖 Microsoft expands the availability of Copilot across life and work.
- 💻 Oracle adds groundbreaking Generative AI features to its software
- 💰 Databricks makes a strategic investment in Mistral AI
- 📱 Qualcomm emerges as a mobile AI juggernaut
- 👓 MIT researchers develop peripheral vision capabilities for AI models
- 🤔 Microsoft calls out Google dominance in generative AI
- 📝 Anthropic releases affordable, high-speed Claude 3 Haiku model
- 🚫 Midjourney bans prompts with Joe Biden and Donald Trump over election misinformation concerns
- 🤖 Mercedes tests humanoid robots for ‘low skill, repetitive’ tasks
- 💊 Health Equity Assessment of machine Learning performance (HEAL): Google introduce Health Equity Assessment of machine Learning performance (HEAL), a novel evaluation framework designed to quantitatively assess whether the performance of an ML-based health tool is equitable. Google propose a 4-step process for estimating the likelihood that an ML tool performs better for groups with, on average, worse health outcomes as compared to other groups, with the goal to inform improvements that make health ML technologies more equitable.
⚔️ Meta declares war on OpenAI - Read more ...
🦙 Meta’s Llama 3 models are here; 400B+ models in training! - Read more ...
🤖 Google consolidates teams with aim to create AI products faster - Read more ...
🚫 Apple pulls WhatsApp, Threads and Signal from app store in China - Read more ...
🦠 Moderna CEO says AI will help scientists understand ‘most diseases’ in 3 to 5 years - Read more ...
📈 Mixtral 8x22B claims highest open-source performance and efficiency - Read more ...
🦈 Meta’s Megalodon to solve the fundamental challenges of the Transformer - Read more ...
🔍Meta adds its AI chatbot, powered by Llama 3, to the search bar in all its apps. - Read more ...
🚗Wayve introduces LINGO-2, a groundbreaking AI model that drives and narrates its journey. - Read more ...
🤖Salesforce updates Slack AI with smart recaps and more languages. - Read more ...
✈️US Air Force tests AI-controlled jets against human pilots in simulated dogfights. - Read more ...
🔋Google Maps will use AI to find out-of-the-way EV chargers for you.
🧠 Samsung unveils lightning-fast DRAM for AI-powered devices - Read more ...
🤖 Logitech’s new AI prompt builder & Signature AI edition mouse - Read more ...
📸 Snapchat to add watermark to images produced with its AI tools - Read more ...
✈️ US Air Force confirms first successful AI dogfight - Read more ...
🏆 Mistral's latest model sets new records for open source LLMs - Read more ...
🎭 Microsoft's new AI model creates hyper-realistic video using static image - Read more ...
👁️ GPT-4 nearly matches expert doctors in eye assessments - Read more ...
🔒 Brave unleashes real-time privacy-focused AI answer engine - Read more ...
📸 Snapchat to add watermark to images produced with its AI tools - Read more ...
🎮 NVIDIA RTX A400 A1000: Lower-cost single slot GPUs; - Read more ...
📊 Stanford’s report reflects industry dominance and rising training costs in AI; - Read more ...
🎵 Amazon Music launches Maestro, an AI playlist generator; - Read more ...
📷 Snap adds watermarks to AI-generated images; - Read more ...
🤖 Boston Dynamics unveils a new humanoid robot; - Read more ...
💰 Andreessen Horowitz raises $7.2 billion, a sign that tech startup market may be bouncing back; - Read more ...
💰 OpenAI offers a 50% discount for off-peak GPT usage; - Read more ...
💻 AMD unveils AI chips for business laptops and desktops; - Read more ...
🧠 Anthropic Claude 3 Opus is now available on Amazon Bedrock; - Read more ...
👤 Zendesk launches an AI-powered customer experience platform; - Read more ...
💼 Intel and The Linux Foundation launch Open Platform for Enterprise AI (OPEA) - Read more ...
- DeepMind CEO predicts Google will invest over $100 billion in AI, surpassing rivals like Microsoft in processing prowess.
- Google's investment in AI may involve hardware like Axion CPUs based on the Arm architecture, claimed to be faster and more efficient than competitors.
- Some of the budget will likely go to DeepMind, known for its work on the software side of AI, despite recent mixed results in material discoveries and weather prediction.
- DeepMind has made progress in teaching AI social skills, a crucial step in advancing AI capabilities.
- Hassabis emphasized the need for significant computing power, a reason for teaming up with Google in 2014.
🎬 Adobe partners with OpenAI, RunwayML & Pika for Premiere Pro; - Read more ...
🚀 Reka launches Reka Core: their frontier in multimodal AI; - Read more ...
🇯🇵 OpenAI is opening its first international office in Tokyo; - Read more ...
🤖 Hugging Face has rolled out Idefics2 ; - Read more ...
💬 Quora's Poe aims to become the 'App Store' for AI chatbots; - Read more ...
👥 Instagram is testing an AI program to amplify influencer engagement; - Read more ...
👩💻 Microsoft has released and open-sourced the new WizardLM-2 family of LLMs; - Read more ...
📋 Limitless AI launched a personal meeting assistant in a pendant - Read more ...
🚗 Tesla lays off more than 10% of its workforce - Read more ...
🎥 Adobe explores OpenAI partnership as it adds AI video tools - Read more ...
📱 Apple's AI features on iOS 18 may run locally on your iPhone - Read more ...
📊 xAI’s first multimodal model with a unique dataset - Read more ...
♾️ Infini-Attention: Google's breakthrough gives LLMs limitless context - Read more ...
⚠️ Adobe's Firefly AI trained on competitor's images: Bloomberg report - Read more ...
🤖 Meta trials AI chatbot on WhatsApp, Instagram, and Messenger - Read more ...
🎨 Ideogram introduces new features to its AI image generation model - Read more ...
🖼️ New Freepik AI tool redefines image generation with realism and versatility - Read more ...
💼 OpenAI promoted ChatGPT Enterprise to corporates with road-show-like events - Read more ...
📔 Google's Notes tool now offers custom AI-generated backgrounds - Read more ...
A Daily chronicle of AI Innovations April 11th 2024:
🚀 Meta unveils next-generation AI chip for enhanced workloads - Read more ...
🎶 New AI tool lets you generate 1200 songs per month for free - Read more ...
💰 Adobe is buying videos for $3 per minute to build an AI model - Read more ...
🤖 Google expands Gemma family with new models - Read more ...
🌐 Mistral unveils Mixtral-8x22B open language model - Read more ...
📷 Google Photos introduces free AI-powered editing tools - Read more ...
🖼️ Microsoft enhances Bing visual search with personalization - Read more ...
🛡️ Sama red team: Safety-centered solution for Generative AI - Read more ...
A Daily chronicle of AI Innovations April 10th 2024:
👀 OpenAI gives GPT-4 a major upgrade; - Read more ...
💬Quora's Poe now lets AI chatbot developers charge per message; - Read more ...
🌐 Google updates and expands its open source Gemma AI model family; - Read more ...
🔥 Intel unveils latest AI chip as Nvidia competition heats up; - Read more ...
📱 WordPress parent acquires Beeper app which brought iMessage to Android; - Read more ...
🤔 New bill would force AI companies to reveal use of copyrighted art; - Read more ...
🧠 Intel's new AI chip: 50% faster, cheaper than NVIDIA's; - Read more ...
🤖 Meta to Release Llama 3 Open-source LLM next week; - Read more ...
☁️ Google Cloud announces major updates to enhance Vertex AI - Read more ...
A Daily chronicle of AI Innovations April 09th 2024:
🤖 Meta to launch new Llama 3 models - Read more ...
👂 Google’s Gemini 1.5 Pro can now hear - Read more ...
💥 Google’s first Arm-based CPU will challenge Microsoft and Amazon in the AI race - Read more ...
🤖 Stability AI launches multilingual Stable LM 2 12B - Read more ...
📱Ferret-UI beats GPT-4V in mobile UI tasks - Read more ...
⏰ Musk says AI will outsmart humans within a year - Read more ...
🍁 Canada bets big on AI with $2.4B investment - Read more ...
🎥 OpenAI is using YouTube for GPT-4 training - Read more ...
A Daily chronicle of AI Innovations April 08th 2024:
🇬🇧 Microsoft opens AI Hub in London to 'advance state-of-the-art language models'
💡 JPMorgan CEO compares AI’s potential impact to electricity and the steam engine
🎵 Spotify moves into AI with new feature
⚖️ Build resource-efficient LLMs with Google’s MoD
📡 Newton brings sensor-driven intelligence to AI models
💰 Internet archives become AI training goldmines for Big Tech
🎧 Spotify introduces AI-generated personalized playlists
🔍 Meta expands "Made with AI" labeling to more content types
🚀 Gretel's Text-to-SQL dataset sets new standard for AI training data
💾 Microsoft upgrades Azure AI Search with more storage and support for OpenAI apps
📱 Google brings Gemini AI chatbot to Android app
A daily chronicle of AI Innovations April 01 to April 07th 2024:
🎤 OpenAI’s AI model can clone your voice in 15 seconds
👀 Sam Altman and Jony Ive seek $1B for personal AI device
🚕 Elon Musk says Tesla will unveil robotaxi in August
🔖 Meta to label content ‘made with AI’
🙃 How OpenAI, Google and Meta ignored corporate policies to train their AI
🚀 Microsoft and OpenAI plan $100B supercomputer for AI development
🖼️ MagicLens: Google DeepMind's breakthrough in image retrieval
📲 Apple's Siri will now understand what’s on your screen
🤖 OpenAI introduces instant access to ChatGPT
🚨 Elon Musk says AI might destroy humanity, but it's worth the risk
🔍 Google's Gecko: LLM-powered text embedding breakthrough
🔓 Anthropic’s “many-shot jailbreaking” wears down AI ethics
🌌 CosmicMan enables the photorealistic generation of human images
🎵 What’s new in Stability AI’s Stable Audio 2.0?
👨💻 SWE-agent: AI coder that solves GitHub issues in 93 seconds
🎥 Mobile-first Higgsfield aims to disrupt video marketing with AI
🏢 Cohere launches Command R+ for enterprises
🧰 OpenAI doubles down on AI model customization
🏠 Will personal home robots be Apple’s next big thing?
A daily chronicle of AI Innovations April 05th 2024:🤷♀️YouTube CEO warns OpenAI that training models on its videos is against the rules 🏢OpenAI says 2024 is the "year of the enterprise" when it comes to AI ⚔️The war for AI talent has begun 🏢Cohere launches the “most powerful LLM for enterprise 🧰OpenAI doubles down on AI model customization; 🏠 Will personal home robots be Apple’s next big thing?
A daily chronicle of AI Innovations April 04th 2024: 🎵 What’s new in Stability AI’s Stable Audio 2.0? 🖥️ Opera One browser becomes the first to offer local AI integration 🚀 Copilot gets GPT-4 Turbo upgrade 🤖 SWE-agent: AI coder that solves GitHub issues in 93 seconds 📲 Mobile-first Higgsfield aims to disrupt video marketing with AI
A daily chronicle of AI Innovations April 03rd 2024 🔍Google's Gecko: LLM-powered text embedding breakthrough 🔓 Anthropic’s “many-shot jailbreaking” wears down AI ethics 🌌CosmicMan enables the photorealistic generation of human images 🎮 Microsoft is planning to add an AI chatbot to Xbox
- Google's Gecko: LLM-powered text embedding breakthrough
Gecko is a compact and highly versatile text embedding model that achieves impressive performance by leveraging the knowledge of LLMs. DeepMind researchers behind Gecko have developed a novel two-step distillation process to create a high-quality dataset called FRet using LLMs. The first step involves using an LLM to generate diverse, synthetic queries and tasks from a large web corpus. In the second step, the LLM mines positive and hard negative passages for each query, ensuring the dataset's quality.
- Anthropic’s “many-shot jailbreaking” wears down AI ethics
Researchers at Anthropic discovered a new way to get advanced AI language models to bypass their safety restrictions and provide unethical or dangerous information. They call this the "many-shot jailbreaking" technique. By including many made-up dialog examples in the input where an AI assistant provides harmful responses, the researchers could eventually get the real AI to override its training and provide instructions on things like bomb-making.
- CosmicMan enables the photorealistic generation of human images
Researchers at the Shanghai AI Laboratory have created a new AI model called CosmicMan that specializes in generating realistic images of people. CosmicMan can produce high-quality, photorealistic human images that precisely match detailed text descriptions, unlike current AI image models that struggle with human images.
The key to CosmicMan's success is a massive dataset called CosmicMan-HQ 1.0 containing 6 million annotated human images and a novel training method—“ Annotate Anyone,” which focuses the model on different parts of the human body. By categorizing words in the text description into body part groups like head, arms, legs, etc., the model can generate each part separately for better accuracy and customizability, thereby outperforming the current state-of-the-art models.
- OpenAI-Superhuman introduces a new era of email with OpenAI.
"Many of us write several novels worth of email per year. Between the time we spend, how much we read, and how much we write, email is the perfect place for AI to add massive value to peoples’ lives".
Superhuman is making rapid progress on Superhuman AI, which is powered by OpenAI’s API. They’ve already launched a large number of GPT-driven features like:
Write with AI, which turns a written prompt into a full email
Rewrite in Your Voice, which rephrases an email in the user’s personal writing voice and tone
Write with Your Voice, which turns a dictated statement into a full email, allowing users to compose emails with just a few lines of speech
Auto Summarize, which always shows an up-to-date one-line summary for each email and thread in users’ inboxes
Instant Reply, which lets users respond to email in one click by choosing from contextual reply options
- Apple Vision Pro's Spatial Avatars are a game changer
Get the Meta Quest 3 at half the price for similar functionalities
- 🎮 Microsoft is planning to add an AI chatbot to Xbox
Microsoft is currently testing a new AI-powered chatbot to be added to Xbox to automate customer support tasks. The software giant has tested an “embodied AI character” that animates when responding to Xbox support queries. The virtual representative can handle either text or voice requests. It’s an effort to integrate AI into Xbox platforms and services.
- ☁️ CloudFare launches Workers AI to power one-click deployment with Hugging Face
CloudFare has launched Workers AI, which empowers developers to bring their AI applications from Hugging Face to its platform in one click. The serverless GPU-powered interface is generally available to the public. The Cloudflare-Hugging Face integration was announced nearly seven months ago. It makes it easy for models to be deployed onto Workers AI.
- 🍺 Machine Learning can predict and enhance complex beer flavor
In a study by Nature Communications, researchers combined chemical analyses, sensory data, and machine learning to create models that accurately predict beer flavor and consumer appreciation from the beer's chemical composition. They identified compounds that enhance flavor and used this knowledge to improve the taste and popularity of commercial beers.
- 📖 Read AI adds AI summaries to meetings, emails, and messages
Read AI is expanding its services from summarizing video meetings to including messages and emails. The platform connects to popular communication platforms like Gmail, Outlook, Slack, Zoom, Microsoft Teams, and Google Meet to deliver daily updates, summaries, and AI-generated takeaways. The goal is to help users save time and improve productivity.
- 🤖 Bille Elish, Kety Perry, Nicki Minaj and 200 other musicians warn against replacing human singers with AI
In an open letter, over 200 famous musicians, including Billie Eilish and Katy Perry, have expressed their concerns about the negative impact of AI on human creativity. They call for the responsible use of AI and urge AI companies to stop creating music that undermines their work. They believe that unregulated and uncontrolled use of AI can harm songwriters, musicians, and creators. They emphasize the need to protect artists' rights and fair compensation.
Gecko is a compact and highly versatile text embedding model that achieves impressive performance by leveraging the knowledge of LLMs. DeepMind researchers behind Gecko have developed a novel two-step distillation process to create a high-quality dataset called FRet using LLMs. The first step involves using an LLM to generate diverse, synthetic queries and tasks from a large web corpus. In the second step, the LLM mines positive and hard negative passages for each query, ensuring the dataset's quality.
Researchers at Anthropic discovered a new way to get advanced AI language models to bypass their safety restrictions and provide unethical or dangerous information. They call this the "many-shot jailbreaking" technique. By including many made-up dialog examples in the input where an AI assistant provides harmful responses, the researchers could eventually get the real AI to override its training and provide instructions on things like bomb-making.
Researchers at the Shanghai AI Laboratory have created a new AI model called CosmicMan that specializes in generating realistic images of people. CosmicMan can produce high-quality, photorealistic human images that precisely match detailed text descriptions, unlike current AI image models that struggle with human images. The key to CosmicMan's success is a massive dataset called CosmicMan-HQ 1.0 containing 6 million annotated human images and a novel training method—“ Annotate Anyone,” which focuses the model on different parts of the human body. By categorizing words in the text description into body part groups like head, arms, legs, etc., the model can generate each part separately for better accuracy and customizability, thereby outperforming the current state-of-the-art models.
"Many of us write several novels worth of email per year. Between the time we spend, how much we read, and how much we write, email is the perfect place for AI to add massive value to peoples’ lives".
Superhuman is making rapid progress on Superhuman AI, which is powered by OpenAI’s API. They’ve already launched a large number of GPT-driven features like:
Write with AI, which turns a written prompt into a full email
Rewrite in Your Voice, which rephrases an email in the user’s personal writing voice and tone
Write with Your Voice, which turns a dictated statement into a full email, allowing users to compose emails with just a few lines of speech
Auto Summarize, which always shows an up-to-date one-line summary for each email and thread in users’ inboxes
Instant Reply, which lets users respond to email in one click by choosing from contextual reply options
Microsoft is currently testing a new AI-powered chatbot to be added to Xbox to automate customer support tasks. The software giant has tested an “embodied AI character” that animates when responding to Xbox support queries. The virtual representative can handle either text or voice requests. It’s an effort to integrate AI into Xbox platforms and services.
CloudFare has launched Workers AI, which empowers developers to bring their AI applications from Hugging Face to its platform in one click. The serverless GPU-powered interface is generally available to the public. The Cloudflare-Hugging Face integration was announced nearly seven months ago. It makes it easy for models to be deployed onto Workers AI.
In a study by Nature Communications, researchers combined chemical analyses, sensory data, and machine learning to create models that accurately predict beer flavor and consumer appreciation from the beer's chemical composition. They identified compounds that enhance flavor and used this knowledge to improve the taste and popularity of commercial beers.
Read AI is expanding its services from summarizing video meetings to including messages and emails. The platform connects to popular communication platforms like Gmail, Outlook, Slack, Zoom, Microsoft Teams, and Google Meet to deliver daily updates, summaries, and AI-generated takeaways. The goal is to help users save time and improve productivity.
In an open letter, over 200 famous musicians, including Billie Eilish and Katy Perry, have expressed their concerns about the negative impact of AI on human creativity. They call for the responsible use of AI and urge AI companies to stop creating music that undermines their work. They believe that unregulated and uncontrolled use of AI can harm songwriters, musicians, and creators. They emphasize the need to protect artists' rights and fair compensation.
A daily chronicle of AI Innovations April 02 2024: 📲Apple's Siri will now understand what’s on your screen 🤖OpenAI introduces instant access to ChatGPT 🚨Elon Musk says AI might destroy humanity, but it's worth the risk 🤖Sam Altman gives up control of OpenAI Startup Fund 🙏US UK to partner on AI
- 🤖Sam Altman gives up control of OpenAI Startup Fund
Sam Altman has relinquished formal control of the OpenAI Startup Fund, which he initially managed, to Ian Hathaway, marking a resolution to the fund's unique corporate structure.
The fund was established in 2021 with Altman temporarily at the helm to avoid potential conflicts had he not returned as CEO after a brief departure; he did not personally invest in or financially benefit from it.
Under Hathaway's management, the fund, starting with $175 million in commitments, has grown to $325 million in assets and has invested in early-stage AI companies across healthcare, law, education, and more, with at least 16 startups backed.
- 🙏 US and UK sign deal to partner on AI research
The US and UK have formed a partnership focused on advancing the safety testing of AI technologies, sharing information and expertise to develop tests for cutting-edge AI models.
A Memorandum of Understanding (MOU) has been signed to enhance the regulation and testing of AI, aiming to effectively assess and mitigate the risks associated with AI technology.
The partnership involves the exchange of expert personnel between the US and UK AI Safety Institutes, with plans for potential joint testing on publicly available AI models, reinforcing their commitment to addressing AI risks and promoting its safe development globally.
- 📰Yahoo acquires Instagram co-founders' AI-powered news startup Artifact
Yahoo is acquiring the AI news app Artifact, built by Instagram co-founders, but not its team, aiming to enhance its own news platform with Artifact's advanced technology and recommendation systems.
Artifact's technology, which focuses on personalizing and recommending content, will be integrated into Yahoo News and potentially other Yahoo platforms, despite the discontinuation of the Artifact app itself.
The integration of Artifact's technology into Yahoo aims to create a personalized content ecosystem, leveraging Yahoo's vast user base to realize the potential of AI in news curation and recommendation.
- 📲Apple's Siri will now understand what’s on your screen
Apple researchers have developed an AI system called ReALM which enables voice assistants like Siri to understand contextual references to on-screen elements. By converting the complex task of reference resolution into a language modeling problem, ReALM outperforms even GPT-4 in understanding ambiguous references and context.
- OpenAI introduces instant access to ChatGPT
OpenAI now allows users to use ChatGPT without having to create an account. With over 100 million weekly users across 185 countries, it can now be accessed instantly by anyone curious about its capabilities.
While this move makes AI more accessible, other OpenAI products like DALL-E 3 still require an account. The company has also introduced new content safeguards and allows users to opt out of model training, even without an account. Despite growing competition from rivals like Google's Gemini, ChatGPT remains the most visited AI chatbot site, attracting 1.6 billion visitors in February.
- Artificial intelligence is taking over drug development
The most striking evidence that artificial intelligence can provide profound scientific breakthroughs came with the unveiling of a program called AlphaFold by Google DeepMind. In 2016 researchers at the company had scored a big success with AlphaGo, an ai system which, having essentially taught itself the rules of Go, went on to beat the most highly rated human players of the game, sometimes by using tactics no one had ever foreseen. This emboldened the company to build a system that would work out a far more complex set of rules: those through which the sequence of amino acids which defines a particular protein leads to the shape that sequence folds into when that protein is actually made. AlphaFold found those rules and applied them with astonishing success.
The achievement was both remarkable and useful. Remarkable because a lot of clever humans had been trying hard to create computer models of the processes which fold chains of amino acids into proteins for decades. AlphaFold bested their best efforts almost as thoroughly as the system that inspired it trounces human Go players. Useful because the shape of a protein is of immense practical importance: it determines what the protein does and what other molecules can do to it. All the basic processes of life depend on what specific proteins do. Finding molecules that do desirable things to proteins (sometimes blocking their action, sometimes encouraging it) is the aim of the vast majority of the world’s drug development programmes.
- Pinecone launches Luna AI that never hallucinates
Trained using a novel "information-free" approach, Luna achieved zero hallucinations by always admitting when it doesn't know an answer. The catch? Its performance on other tasks is significantly reduced. While not yet open-sourced, vetted institutions can access the model's source and weights.
- US and UK collaborate to tackle AI safety risks
As concerns grow over the potential risks of next-gen AI, the two nations will work together to develop advanced testing methods and share key information on AI capabilities and risks. The partnership will address national security concerns and broader societal issues, with plans for joint testing exercises and personnel exchanges between their respective AI safety institutes.
- Perplexity to test sponsored questions in AI search
Perplexity's Chief Business Officer, Dmitry Shevelenko, announced the company's plan to introduce sponsored suggested questions later this year. When users search for more information on a topic, the platform will display sponsored queries from brands, allowing Perplexity to monetize its AI search platform.
- OpenAI expands to Japan with Tokyo office
The Tokyo office will be OpenAI's first in Asia and third international location, following London and Dublin. The move aims to offer customized AI services in Japanese to businesses and contribute to the development of an AI governance framework in the country.
Sam Altman has relinquished formal control of the OpenAI Startup Fund, which he initially managed, to Ian Hathaway, marking a resolution to the fund's unique corporate structure. The fund was established in 2021 with Altman temporarily at the helm to avoid potential conflicts had he not returned as CEO after a brief departure; he did not personally invest in or financially benefit from it. Under Hathaway's management, the fund, starting with $175 million in commitments, has grown to $325 million in assets and has invested in early-stage AI companies across healthcare, law, education, and more, with at least 16 startups backed.
The US and UK have formed a partnership focused on advancing the safety testing of AI technologies, sharing information and expertise to develop tests for cutting-edge AI models. A Memorandum of Understanding (MOU) has been signed to enhance the regulation and testing of AI, aiming to effectively assess and mitigate the risks associated with AI technology. The partnership involves the exchange of expert personnel between the US and UK AI Safety Institutes, with plans for potential joint testing on publicly available AI models, reinforcing their commitment to addressing AI risks and promoting its safe development globally.
Yahoo is acquiring the AI news app Artifact, built by Instagram co-founders, but not its team, aiming to enhance its own news platform with Artifact's advanced technology and recommendation systems. Artifact's technology, which focuses on personalizing and recommending content, will be integrated into Yahoo News and potentially other Yahoo platforms, despite the discontinuation of the Artifact app itself. The integration of Artifact's technology into Yahoo aims to create a personalized content ecosystem, leveraging Yahoo's vast user base to realize the potential of AI in news curation and recommendation.
Apple researchers have developed an AI system called ReALM which enables voice assistants like Siri to understand contextual references to on-screen elements. By converting the complex task of reference resolution into a language modeling problem, ReALM outperforms even GPT-4 in understanding ambiguous references and context.
OpenAI now allows users to use ChatGPT without having to create an account. With over 100 million weekly users across 185 countries, it can now be accessed instantly by anyone curious about its capabilities. While this move makes AI more accessible, other OpenAI products like DALL-E 3 still require an account. The company has also introduced new content safeguards and allows users to opt out of model training, even without an account. Despite growing competition from rivals like Google's Gemini, ChatGPT remains the most visited AI chatbot site, attracting 1.6 billion visitors in February.
The most striking evidence that artificial intelligence can provide profound scientific breakthroughs came with the unveiling of a program called AlphaFold by Google DeepMind. In 2016 researchers at the company had scored a big success with AlphaGo, an ai system which, having essentially taught itself the rules of Go, went on to beat the most highly rated human players of the game, sometimes by using tactics no one had ever foreseen. This emboldened the company to build a system that would work out a far more complex set of rules: those through which the sequence of amino acids which defines a particular protein leads to the shape that sequence folds into when that protein is actually made. AlphaFold found those rules and applied them with astonishing success. The achievement was both remarkable and useful. Remarkable because a lot of clever humans had been trying hard to create computer models of the processes which fold chains of amino acids into proteins for decades. AlphaFold bested their best efforts almost as thoroughly as the system that inspired it trounces human Go players. Useful because the shape of a protein is of immense practical importance: it determines what the protein does and what other molecules can do to it. All the basic processes of life depend on what specific proteins do. Finding molecules that do desirable things to proteins (sometimes blocking their action, sometimes encouraging it) is the aim of the vast majority of the world’s drug development programmes.
Trained using a novel "information-free" approach, Luna achieved zero hallucinations by always admitting when it doesn't know an answer. The catch? Its performance on other tasks is significantly reduced. While not yet open-sourced, vetted institutions can access the model's source and weights.
As concerns grow over the potential risks of next-gen AI, the two nations will work together to develop advanced testing methods and share key information on AI capabilities and risks. The partnership will address national security concerns and broader societal issues, with plans for joint testing exercises and personnel exchanges between their respective AI safety institutes.
Perplexity's Chief Business Officer, Dmitry Shevelenko, announced the company's plan to introduce sponsored suggested questions later this year. When users search for more information on a topic, the platform will display sponsored queries from brands, allowing Perplexity to monetize its AI search platform.
The Tokyo office will be OpenAI's first in Asia and third international location, following London and Dublin. The move aims to offer customized AI services in Japanese to businesses and contribute to the development of an AI governance framework in the country.
A daily chronicle of AI Innovations April 01st 2024: 🎤 This AI model can clone your voice in 15 seconds; 🍎 Apple says its latest AI model is even better than OpenAI’s GPT4 🧠 Deepmind chief doesn't see AI reaching its limits anytime soon; 🚀 and a lot more from OpenAI, Google, Meta, NVDIA 💸 💔
- 🍎Apple says its latest AI model is even better than OpenAI’s GPT4
Apple researchers have introduced ReALM, an advanced AI model designed to understand and navigate various contexts more effectively than OpenAI's GPT4.
ReALM aims to enhance user interaction by accurately understanding onscreen, conversational, and background entities, making device interactions more intuitive.
Apple believes ReALM's ability to handle complex reference resolutions, including onscreen elements, positions it as a superior solution compared to the capabilities of GPT-4.
- 🚀Deepmind chief doesn't see AI reaching its limits anytime soon
Deepmind founder Demis Hassabis believes AI is both overhyped and underestimated, with the potential for AI far from being reached and warning against the excessive hype surrounding it.
Hassabis predicts many AI startups will fail due to the high computing power demands, expects industry consolidation, and sees no limit to the advancements in massive AI models.
Despite concerns over hype, Hassabis envisions the beginning of a new golden era in scientific discovery powered by AI and estimates a 50% chance of achieving artificial general intelligence within the next ten years.
- 🎤This AI model can clone your voice in 15 seconds
OpenAI has offered a glimpse into its latest breakthrough - Voice Engine, an AI model that can generate stunningly lifelike voice clones from a mere 15-second audio sample and a text input. This technology can replicate the original speaker's voice, opening up possibilities for improving educational materials, making videos more accessible to global audiences, assisting with communication for people with speech impairments, and more.
Though the model has many applications, the AI giant is cautious about its potential misuse, especially during elections. They have strict rules for partners, like no unauthorized impersonation, clear labeling of synthetic voices, and technical measures like watermarking and monitoring. OpenAI hopes this early look will start a conversation about how to address potential issues by educating the public and developing better ways to trace the origin of audio content.
- 💸 Microsoft+OpenAI plan $100B supercomputer for AI development
Microsoft and OpenAI are reportedly planning to build a massive $100 billion supercomputer called "Stargate" to rapidly advance the development of OpenAI's AI models. Insiders say the project, set to launch in 2028 and expand by 2030, would be one of the largest investments in computing history, requiring several gigawatts of power - equivalent to multiple large data centers.
Much of Stargate's cost would go towards procuring millions of specialized AI chips, with funding primarily from Microsoft. A smaller $10B precursor called "Phase 4" is planned for 2026. The decision to move forward with Stargate relies on OpenAI achieving significant improvements in AI capabilities and potential "superintelligence." If realized, Stargate could enable OpenAI's AI systems to recursively generate synthetic training data and become self-improving.
- 💔MagicLens: Google DeepMind's breakthrough in image retrieval technology
Google DeepMind has introduced MagicLens, a revolutionary set of image retrieval models that surpass previous state-of-the-art methods in multimodality-to-image, image-to-image, and text-to-image retrieval tasks. Trained on a vast dataset of 36.7 million triplets containing query images, text instructions, and target images, MagicLens achieves outstanding performance while meeting a wide range of search intents expressed through open-ended instructions.
- Which LLM Provider You Pick For Your App Could Make Or Break You Financially
I recently did a deep dive into the costs of various AI language models, and the results were quite eye-opening. I compiled my findings into a YouTube video that goes into more detail, but I wanted to share some key takeaways here to spark discussion.
As you can see in the breakdown below, the costs per million tokens vary widely across different models:
GPT 4 Turbo = $25
GPT 3.5 = $5
Gemini Pro = $2
Claude 3 (Haiku) = 75 Cents
Mistral 7B = 25 Cents
*Phi-2 = Less Than 1 Cent
What's particularly interesting is that the models below the line (excluding Phi 2) are currently priced "below retail". In other words, it would actually cost you more to set up and run similar models yourself compared to what these providers are charging.
This raises some fascinating questions about the economics and accessibility of AI language models. How will pricing evolve as the technology advances? Will we see consolidation or fragmentation in the market? What does this mean for researchers, businesses, and everyday users?
I dive into all of this and more in my YouTube video. It's a complex topic, but I've done my best to break it down in an engaging and easy-to-understand way. If you're at all interested in the current state and future trajectory of AI language models, I highly recommend checking it out and joining the conversation.
- What is Edge AI? How is IoT changing?
Let us hypothetically consider a case of autonomous self-driving cars, to understand Edge AI in a simpler format.
When a self-driving car is moving, it needs to detect objects in real-time. Any delay or glitch can prove fatal for car passengers, which is why AI must perform in real-time. Car manufacturers train their deep learning based ML models in their cloud servers. Once all the models are trained and saved in a file, it gets downloaded locally in the car itself.
- OpenAI rolling out the ability to start using ChatGPT instantly, without needing to sign-up
Apple researchers have introduced ReALM, an advanced AI model designed to understand and navigate various contexts more effectively than OpenAI's GPT4. ReALM aims to enhance user interaction by accurately understanding onscreen, conversational, and background entities, making device interactions more intuitive. Apple believes ReALM's ability to handle complex reference resolutions, including onscreen elements, positions it as a superior solution compared to the capabilities of GPT-4.
Deepmind founder Demis Hassabis believes AI is both overhyped and underestimated, with the potential for AI far from being reached and warning against the excessive hype surrounding it. Hassabis predicts many AI startups will fail due to the high computing power demands, expects industry consolidation, and sees no limit to the advancements in massive AI models. Despite concerns over hype, Hassabis envisions the beginning of a new golden era in scientific discovery powered by AI and estimates a 50% chance of achieving artificial general intelligence within the next ten years.
OpenAI has offered a glimpse into its latest breakthrough - Voice Engine, an AI model that can generate stunningly lifelike voice clones from a mere 15-second audio sample and a text input. This technology can replicate the original speaker's voice, opening up possibilities for improving educational materials, making videos more accessible to global audiences, assisting with communication for people with speech impairments, and more. Though the model has many applications, the AI giant is cautious about its potential misuse, especially during elections. They have strict rules for partners, like no unauthorized impersonation, clear labeling of synthetic voices, and technical measures like watermarking and monitoring. OpenAI hopes this early look will start a conversation about how to address potential issues by educating the public and developing better ways to trace the origin of audio content.
Microsoft and OpenAI are reportedly planning to build a massive $100 billion supercomputer called "Stargate" to rapidly advance the development of OpenAI's AI models. Insiders say the project, set to launch in 2028 and expand by 2030, would be one of the largest investments in computing history, requiring several gigawatts of power - equivalent to multiple large data centers. Much of Stargate's cost would go towards procuring millions of specialized AI chips, with funding primarily from Microsoft. A smaller $10B precursor called "Phase 4" is planned for 2026. The decision to move forward with Stargate relies on OpenAI achieving significant improvements in AI capabilities and potential "superintelligence." If realized, Stargate could enable OpenAI's AI systems to recursively generate synthetic training data and become self-improving.
Google DeepMind has introduced MagicLens, a revolutionary set of image retrieval models that surpass previous state-of-the-art methods in multimodality-to-image, image-to-image, and text-to-image retrieval tasks. Trained on a vast dataset of 36.7 million triplets containing query images, text instructions, and target images, MagicLens achieves outstanding performance while meeting a wide range of search intents expressed through open-ended instructions.
I recently did a deep dive into the costs of various AI language models, and the results were quite eye-opening. I compiled my findings into a YouTube video that goes into more detail, but I wanted to share some key takeaways here to spark discussion. As you can see in the breakdown below, the costs per million tokens vary widely across different models:
GPT 4 Turbo = $25
GPT 3.5 = $5
Gemini Pro = $2
Claude 3 (Haiku) = 75 Cents
Mistral 7B = 25 Cents
*Phi-2 = Less Than 1 Cent
What's particularly interesting is that the models below the line (excluding Phi 2) are currently priced "below retail". In other words, it would actually cost you more to set up and run similar models yourself compared to what these providers are charging. This raises some fascinating questions about the economics and accessibility of AI language models. How will pricing evolve as the technology advances? Will we see consolidation or fragmentation in the market? What does this mean for researchers, businesses, and everyday users? I dive into all of this and more in my YouTube video. It's a complex topic, but I've done my best to break it down in an engaging and easy-to-understand way. If you're at all interested in the current state and future trajectory of AI language models, I highly recommend checking it out and joining the conversation.
Let us hypothetically consider a case of autonomous self-driving cars, to understand Edge AI in a simpler format. When a self-driving car is moving, it needs to detect objects in real-time. Any delay or glitch can prove fatal for car passengers, which is why AI must perform in real-time. Car manufacturers train their deep learning based ML models in their cloud servers. Once all the models are trained and saved in a file, it gets downloaded locally in the car itself.
A daily chronicle of AI Innovations: March 31st 2024: 🧠 Generative AI develops potential new drugs for antibiotic-resistant bacteria; 🔥South Korean ‘artificial sun’ hits record 100M degrees for 100 seconds; 🤖 Summary of the key points about OpenAI's relationship with Dubai and the UAE;
- Generative AI develops potential new drugs for antibiotic-resistant bacteria
Stanford Medicine researchers devise a new artificial intelligence model, SyntheMol, which creates recipes for chemists to synthesize the drugs in the lab.
With nearly 5 million deaths linked to antibiotic resistance globally every year, new ways to combat resistant bacterial strains are urgently needed.
Researchers at Stanford Medicine and McMaster University are tackling this problem with generative artificial intelligence. A new model, dubbed SyntheMol (for synthesizing molecules), created structures and chemical recipes for six novel drugs aimed at killing resistant strains of Acinetobacter baumannii, one of the leading pathogens responsible for antibacterial resistance-related deaths.
The researchers described their model and experimental validation of these new compounds in a study published March 22 in the journal Nature Machine Intelligence.
There’s a huge public health need to develop new antibiotics quickly, said James Zou, PhD, an associate professor of biomedical data science and co-senior author on the study. “Our hypothesis was that there are a lot of potential molecules out there that could be effective drugs, but we haven’t made or tested them yet. That’s why we wanted to use AI to design entirely new molecules that have never been seen in nature.
- South Korean ‘artificial sun’ hits record 100M degrees for 100 seconds
For the first time, the Korea Institute of Fusion Energy’s (KFE) Korea Superconducting Tokamak Advanced Research (KSTAR) fusion reactor has reached temperatures seven times that of the Sun’s core.
Achieved during testing between December 2023 and February 2024, this sets a new record for the fusion reactor project.
KSTAR, the researchers behind the reactor report, managed to maintain temperatures of 212 degrees Fahrenheit (100 million degrees Celsius) for 48 seconds. For reference, the temperature of the core of our Sun is 27 million degrees Fahrenheit (15 million degrees Celsius).
- Gemini 1.5 Pro on Vertex AI is available for everyone as an experimental release
I think this one has flown under the radar: Gemini 1.5 Pro is available as Experimental on Vertex AI, for everyone, UI only for now (no API yet). In us-central1.
You find it under Vertex AI --> Multimodal. It's called Gemini Experimental.
API, more features and so on are coming as we approach Google Cloud Next (April 9-11).
- OpenAI Relationships: Summary of the key points about OpenAI's relationship with Dubai and the UAE
OpenAI's Partnership with G42
In October 2023, G42, a leading UAE-based technology holding group, announced a partnership with OpenAI to deliver advanced AI solutions to the UAE and regional markets.
The partnership will focus on leveraging OpenAI's generative AI models in domains where G42 has deep expertise, including financial services, energy, healthcare, and public services.
G42 will prioritize its substantial AI infrastructure capacity to support OpenAI's local and regional inferencing on Microsoft Azure data centers.
Sam Altman, CEO of OpenAI, stated that the collaboration with G42 aims to empower businesses and communities with effective solutions that resonate with the nuances of the region.
Altman's Vision for the UAE as an AI Sandbox
During a virtual appearance at the World Governments Summit, Altman suggested that the UAE could serve as the world's "regulatory sandbox" to test AI technologies and later spearhead global rules limiting their use.
Altman believes the UAE is well-positioned to be a leader in discussions about unified global policies to rein in future advances in AI.
The UAE has invested heavily in AI and made it a key policy consideration.
Altman's Pursuit of Trillions in Funding for AI Chip Manufacturing
Altman is reportedly in talks with investors, including the UAE, to raise $5-7 trillion for AI chip manufacturing to address the scarcity of GPUs crucial for training and running large language models.
As part of the talks, Altman is pitching a partnership between OpenAI, various investors, chip makers, and power providers to build chip foundries that would be run by existing chip makers, with OpenAI agreeing to be a significant customer.
In summary, OpenAI's partnership with G42 aims to expand AI capabilities in the UAE and the Middle East, with Altman envisioning the UAE as a potential global AI sandbox.
- Deepmind did not originally see LLMs and the transformer as a path to AGI. Fascinating article.
It's a very long article so I'll post the relevant snippets. But basically it seems that Google was late to the LLM game because Demis Hassabis was 100% focused on AGI and did not see LLM's as a path toward AGI. Perhaps now he sees it as a potential path, but it's probably possible that he is just now focusing on LLM's so that Google does not get too far behind in the generative AI race. But his ultimate goal and obsession is to create AGI that can solve real problems like diseases.
Stanford Medicine researchers devise a new artificial intelligence model, SyntheMol, which creates recipes for chemists to synthesize the drugs in the lab. With nearly 5 million deaths linked to antibiotic resistance globally every year, new ways to combat resistant bacterial strains are urgently needed. Researchers at Stanford Medicine and McMaster University are tackling this problem with generative artificial intelligence. A new model, dubbed SyntheMol (for synthesizing molecules), created structures and chemical recipes for six novel drugs aimed at killing resistant strains of Acinetobacter baumannii, one of the leading pathogens responsible for antibacterial resistance-related deaths. The researchers described their model and experimental validation of these new compounds in a study published March 22 in the journal Nature Machine Intelligence. There’s a huge public health need to develop new antibiotics quickly, said James Zou, PhD, an associate professor of biomedical data science and co-senior author on the study. “Our hypothesis was that there are a lot of potential molecules out there that could be effective drugs, but we haven’t made or tested them yet. That’s why we wanted to use AI to design entirely new molecules that have never been seen in nature.
For the first time, the Korea Institute of Fusion Energy’s (KFE) Korea Superconducting Tokamak Advanced Research (KSTAR) fusion reactor has reached temperatures seven times that of the Sun’s core. Achieved during testing between December 2023 and February 2024, this sets a new record for the fusion reactor project. KSTAR, the researchers behind the reactor report, managed to maintain temperatures of 212 degrees Fahrenheit (100 million degrees Celsius) for 48 seconds. For reference, the temperature of the core of our Sun is 27 million degrees Fahrenheit (15 million degrees Celsius).
I think this one has flown under the radar: Gemini 1.5 Pro is available as Experimental on Vertex AI, for everyone, UI only for now (no API yet). In us-central1. You find it under Vertex AI --> Multimodal. It's called Gemini Experimental. API, more features and so on are coming as we approach Google Cloud Next (April 9-11).
OpenAI's Partnership with G42
In October 2023, G42, a leading UAE-based technology holding group, announced a partnership with OpenAI to deliver advanced AI solutions to the UAE and regional markets. The partnership will focus on leveraging OpenAI's generative AI models in domains where G42 has deep expertise, including financial services, energy, healthcare, and public services. G42 will prioritize its substantial AI infrastructure capacity to support OpenAI's local and regional inferencing on Microsoft Azure data centers. Sam Altman, CEO of OpenAI, stated that the collaboration with G42 aims to empower businesses and communities with effective solutions that resonate with the nuances of the region.
Altman's Vision for the UAE as an AI Sandbox
During a virtual appearance at the World Governments Summit, Altman suggested that the UAE could serve as the world's "regulatory sandbox" to test AI technologies and later spearhead global rules limiting their use. Altman believes the UAE is well-positioned to be a leader in discussions about unified global policies to rein in future advances in AI. The UAE has invested heavily in AI and made it a key policy consideration.
Altman's Pursuit of Trillions in Funding for AI Chip Manufacturing
Altman is reportedly in talks with investors, including the UAE, to raise $5-7 trillion for AI chip manufacturing to address the scarcity of GPUs crucial for training and running large language models. As part of the talks, Altman is pitching a partnership between OpenAI, various investors, chip makers, and power providers to build chip foundries that would be run by existing chip makers, with OpenAI agreeing to be a significant customer.
In summary, OpenAI's partnership with G42 aims to expand AI capabilities in the UAE and the Middle East, with Altman envisioning the UAE as a potential global AI sandbox.
It's a very long article so I'll post the relevant snippets. But basically it seems that Google was late to the LLM game because Demis Hassabis was 100% focused on AGI and did not see LLM's as a path toward AGI. Perhaps now he sees it as a potential path, but it's probably possible that he is just now focusing on LLM's so that Google does not get too far behind in the generative AI race. But his ultimate goal and obsession is to create AGI that can solve real problems like diseases.
AI Daily Chronicle of AI Innovations - March 30th, 2024: 🤯 Microsoft and OpenAI to build $100 billion AI supercomputer 'Stargate'; 🗣 OpenAI unveils voice-cloning tool; 📈 Amazon's AI team faces pressure to outperform Anthropic's Claude models by mid-year; 🚫 Microsoft Copilot has been blocked on all Congress-owned devices
- 🤯 Microsoft and OpenAI to build $100 billion AI supercomputer 'Stargate'
Microsoft and OpenAI are reportedly collaborating on a significant project to create a U.S.-based datacenter for an AI supercomputer named "Stargate," estimated to cost over $115 billion and utilize millions of GPUs.
The supercomputer aims to be the largest among the datacenters planned by the two companies within the next six years, with Microsoft covering the costs and aiming for a launch by 2028.
The project, considered to be in phase 5 of development, requires innovative solutions for power, cooling, and hardware efficiency, including a possible shift away from relying on Nvidia's InfiniBand in favor of Ethernet cables.
- 🗣 OpenAI unveils voice-cloning tool
OpenAI has developed a text-to-voice generation platform named Voice Engine, capable of creating a synthetic voice from just a 15-second voice clip.
The platform is in limited access, serving entities like the Age of Learning and Livox, and is being used for applications from education to healthcare.
With concerns around ethical use, OpenAI has implemented usage policies, requiring informed consent and watermarking audio to ensure transparency and traceability.
- 📈 Amazon's AI team faces pressure to outperform Anthropic's Claude models by mid-year
Amazon has invested $4 billion in AI startup Anthropic, but is also developing a competing large-scale language model called Olympus.
Olympus is supposed to surpass Anthropic's latest Claude model by the middle of the year and has "hundreds of billions of parameters."
So far, Amazon has had no success with its own language models. Employees are unhappy with Olympus' development time and are considering switching to Anthropic's models.
- 🚫 Microsoft Copilot has been blocked on all Congress-owned devices
The US House of Representatives has banned its staff from using Microsoft's AI chatbot Copilot due to cybersecurity concerns over potential data leaks.
Microsoft plans to remove Copilot from all House devices and is developing a government-specific version aimed at meeting federal security standards.
The ban specifically targets the commercial version of Copilot, with the House open to reassessing a government-approved version upon its release.
- Official NYC chatbot is encouraging small businesses to break the law.
- ChatGPT's responses now include source references but for paid users
- Next-generation AI semiconductor devices mimic the human brain
- Google's AI chief says the billions going into AI means a 'bunch of hype and maybe some grifting'
“I think we’re only scratching the surface of what I believe is going to be possible over the next decade-plus,” he said. “We’re at the beginning, maybe, of a new golden era of scientific discovery, a new Renaissance.”
The best proof of concept for how AI could accelerate scientific research, he said, was DeepMind’s AlphaFold model, released in 2021.
AlphaFold had helped predict the structures of 200mn proteins and was now being used by more than 1mn biologists around the world. DeepMind is also using AI to explore other areas of biology and accelerate research into drug discovery and delivery, material science, mathematics, weather prediction and nuclear fusion technology. Hassabis said his goal had always been to use AI as the “ultimate tool for science”.
DeepMind was founded in London in 2010 with the mission to achieve “artificial general intelligence” that matches all human cognitive capabilities. Some researchers have suggested that AGI may still be decades away, if attainable at all.
Hassabis said that one or two more critical breakthroughs were needed before AGI was reached. But he added: “I wouldn’t be surprised if it happened in the next decade. I’m not saying it’s definitely going to happen but I wouldn’t be surprised. You could say about a 50 per cent chance. And that timeline hasn’t changed much since the start of DeepMind.”
Given the potential power of AGI, Hassabis said it was better to pursue this mission through the scientific method rather than the hacker approach favoured by Silicon Valley. “I think we should take a more scientific approach to building AGI because of its significance,” he said.
- Voicecraft: I've never been more impressed in my entire life !
The maintainers of Voicecraft published the weights of the model earlier today, and the first results I get are incredible.
Here's only one example, it's not the best, but it's not cherry-picked, and it's still better than anything I've ever gotten my hands on !
Microsoft and OpenAI are reportedly collaborating on a significant project to create a U.S.-based datacenter for an AI supercomputer named "Stargate," estimated to cost over $115 billion and utilize millions of GPUs. The supercomputer aims to be the largest among the datacenters planned by the two companies within the next six years, with Microsoft covering the costs and aiming for a launch by 2028. The project, considered to be in phase 5 of development, requires innovative solutions for power, cooling, and hardware efficiency, including a possible shift away from relying on Nvidia's InfiniBand in favor of Ethernet cables.
OpenAI has developed a text-to-voice generation platform named Voice Engine, capable of creating a synthetic voice from just a 15-second voice clip. The platform is in limited access, serving entities like the Age of Learning and Livox, and is being used for applications from education to healthcare. With concerns around ethical use, OpenAI has implemented usage policies, requiring informed consent and watermarking audio to ensure transparency and traceability.
Amazon has invested $4 billion in AI startup Anthropic, but is also developing a competing large-scale language model called Olympus. Olympus is supposed to surpass Anthropic's latest Claude model by the middle of the year and has "hundreds of billions of parameters." So far, Amazon has had no success with its own language models. Employees are unhappy with Olympus' development time and are considering switching to Anthropic's models.
The US House of Representatives has banned its staff from using Microsoft's AI chatbot Copilot due to cybersecurity concerns over potential data leaks. Microsoft plans to remove Copilot from all House devices and is developing a government-specific version aimed at meeting federal security standards. The ban specifically targets the commercial version of Copilot, with the House open to reassessing a government-approved version upon its release.
“I think we’re only scratching the surface of what I believe is going to be possible over the next decade-plus,” he said. “We’re at the beginning, maybe, of a new golden era of scientific discovery, a new Renaissance.”
The best proof of concept for how AI could accelerate scientific research, he said, was DeepMind’s AlphaFold model, released in 2021. AlphaFold had helped predict the structures of 200mn proteins and was now being used by more than 1mn biologists around the world. DeepMind is also using AI to explore other areas of biology and accelerate research into drug discovery and delivery, material science, mathematics, weather prediction and nuclear fusion technology. Hassabis said his goal had always been to use AI as the “ultimate tool for science”. DeepMind was founded in London in 2010 with the mission to achieve “artificial general intelligence” that matches all human cognitive capabilities. Some researchers have suggested that AGI may still be decades away, if attainable at all. Hassabis said that one or two more critical breakthroughs were needed before AGI was reached. But he added: “I wouldn’t be surprised if it happened in the next decade. I’m not saying it’s definitely going to happen but I wouldn’t be surprised. You could say about a 50 per cent chance. And that timeline hasn’t changed much since the start of DeepMind.” Given the potential power of AGI, Hassabis said it was better to pursue this mission through the scientific method rather than the hacker approach favoured by Silicon Valley. “I think we should take a more scientific approach to building AGI because of its significance,” he said.
The maintainers of Voicecraft published the weights of the model earlier today, and the first results I get are incredible. Here's only one example, it's not the best, but it's not cherry-picked, and it's still better than anything I've ever gotten my hands on !
AI Daily Chronicle of AI Innovations - March 29th, 2024: 🤖Elon Musk announces Grok-1.5 🔍Google DeepMind unveils ‘superhuman’ AI system that excels in fact-checking 👮♂️Microsoft launches tools to try and stop people messing with chatbots 🤖AI21 Labs’ Jamba triples AI throughput 🛡️
- Google DeepMind's AI fact-checker outperforms humans
Google DeepMind has developed an AI system called Search-Augmented Factuality Evaluator (SAFE) that can evaluate the accuracy of information generated by large language models more effectively than human fact-checkers. In a study, SAFE matched human ratings 72% of the time and was correct in 76% of disagreements with humans.
- AI21 Labs’ Jamba triples AI throughput
AI21 Labs has released Jamba, the first-ever production-grade AI model based on the Mamba architecture. This new architecture combines the strengths of both traditional Transformer models and the Mamba SSM, resulting in a model that is both powerful and efficient. Jamba boasts a large context window of 256K tokens, while still fitting on a single GPU.
- X’s Grok gets a major upgrade
X.ai, Elon Musk's AI startup, has introduced Grok-1.5, an upgraded AI model for their Grok chatbot. This new version enhances reasoning skills, especially in coding and math tasks, and expands its capacity to handle longer and more complex inputs with a 128,000-token context window.
- Microsoft tackles Gen AI risks with new Azure AI tools
Microsoft has launched new Azure AI tools to address the safety and reliability risks associated with generative AI. The tools, currently in preview, aim to prevent prompt injection attacks, hallucinations, and the generation of personal or harmful content. The offerings include Prompt Shields, prebuilt templates for safety-centric system messages, and Groundedness Detection.
- Lightning AI partners with Nvidia to launch Thunder AI compiler
Lightning AI, in collaboration with Nvidia, has launched Thunder, an open-source compiler for PyTorch, to speed up AI model training by optimizing GPU usage. The company claims that Thunder can achieve up to a 40% speed-up for training large language models compared to unoptimized code.
- SambaNova's new AI model beats Databricks' DBRX
SambaNova Systems' Samba-CoE v0.2 Large Language Model outperforms competitors like Databricks' DBRX, MistralAI's Mixtral-8x7B, and xAI's Grok-1. With 330 tokens per second using only 8 sockets, Samba-CoE v0.2 demonstrates remarkable speed and efficiency without sacrificing precision.
- Google.org launches Accelerator to empower nonprofits with Gen AI
Google.org has announced a six-month accelerator program to support 21 nonprofits in leveraging generative AI for social impact. The program provides funding, mentorship, and technical training to help organizations develop AI-powered tools in areas such as climate, health, education, and economic opportunity, aiming to make AI more accessible and impactful.
- Pixel 8 to get on-device AI features powered by Gemini Nano
Google is set to introduce on-device AI features like recording summaries and smart replies on the Pixel 8, powered by its small-sized Gemini Nano model. The features will be available as a developer preview in the next Pixel feature drop, marking a shift from Google's primarily cloud-based AI approach.
- Meet Jan: An Open-Source ChatGPT Alternative that Runs Completely Offline on Computer
Jan, an open-source alternative to ChatGPT by Jan Labs, aims to make AI widely accessible without internet dependency. It's built for diverse hardware, prioritizing user privacy and ethical AI development. Under the AGPLv3 license, Jan encourages open collaboration and improvement. Supporting TypeScript and C++, with plans for Python and mobile platforms, Jan represents a community-driven approach to AI, offering a private, customizable experience.
Google DeepMind has developed an AI system called Search-Augmented Factuality Evaluator (SAFE) that can evaluate the accuracy of information generated by large language models more effectively than human fact-checkers. In a study, SAFE matched human ratings 72% of the time and was correct in 76% of disagreements with humans.
AI21 Labs has released Jamba, the first-ever production-grade AI model based on the Mamba architecture. This new architecture combines the strengths of both traditional Transformer models and the Mamba SSM, resulting in a model that is both powerful and efficient. Jamba boasts a large context window of 256K tokens, while still fitting on a single GPU.
X.ai, Elon Musk's AI startup, has introduced Grok-1.5, an upgraded AI model for their Grok chatbot. This new version enhances reasoning skills, especially in coding and math tasks, and expands its capacity to handle longer and more complex inputs with a 128,000-token context window.
Microsoft has launched new Azure AI tools to address the safety and reliability risks associated with generative AI. The tools, currently in preview, aim to prevent prompt injection attacks, hallucinations, and the generation of personal or harmful content. The offerings include Prompt Shields, prebuilt templates for safety-centric system messages, and Groundedness Detection.
Lightning AI, in collaboration with Nvidia, has launched Thunder, an open-source compiler for PyTorch, to speed up AI model training by optimizing GPU usage. The company claims that Thunder can achieve up to a 40% speed-up for training large language models compared to unoptimized code.
SambaNova Systems' Samba-CoE v0.2 Large Language Model outperforms competitors like Databricks' DBRX, MistralAI's Mixtral-8x7B, and xAI's Grok-1. With 330 tokens per second using only 8 sockets, Samba-CoE v0.2 demonstrates remarkable speed and efficiency without sacrificing precision.
Google.org has announced a six-month accelerator program to support 21 nonprofits in leveraging generative AI for social impact. The program provides funding, mentorship, and technical training to help organizations develop AI-powered tools in areas such as climate, health, education, and economic opportunity, aiming to make AI more accessible and impactful.
Google is set to introduce on-device AI features like recording summaries and smart replies on the Pixel 8, powered by its small-sized Gemini Nano model. The features will be available as a developer preview in the next Pixel feature drop, marking a shift from Google's primarily cloud-based AI approach.
Jan, an open-source alternative to ChatGPT by Jan Labs, aims to make AI widely accessible without internet dependency. It's built for diverse hardware, prioritizing user privacy and ethical AI development. Under the AGPLv3 license, Jan encourages open collaboration and improvement. Supporting TypeScript and C++, with plans for Python and mobile platforms, Jan represents a community-driven approach to AI, offering a private, customizable experience.
AI Daily Chronicle of AI Innovations - March 28th, 2024: ⚡ DBRX becomes world’s most powerful open-source LLM 🏆 Claude 3 Opus crowned the top user-rated chatbot, beating OpenAI’s GPT-4 💙 Empathy meets AI: Hume AI's EVI redefines voice interaction 💰 OpenAI launches revenue sharing program for GPT Store builders 🛍️ Google introduces new shopping features to refine searches 🗣️ rabbit's r1 device gets ultra-realistic voice powered by ElevenLabs 💸 AI startup Hume raises $50M to build emotionally intelligent conversational AI 💻 Lenovo launches AI-enhanced PCs in a push for innovation and differentiation Study shows ChatGPT can produce medical record notes 10 times faster than doctors without compromising quality Microsoft Copilot AI will soon run locally on PCs
- DBRX becomes world’s most powerful open source LLM
Databricks has released DBRX, a family of open-source large language models setting a new standard for performance and efficiency. The series includes DBRX Base and DBRX Instruct, a fine-tuned version designed for few-turn interactions. Developed by Databricks' Mosaic AI team and trained using NVIDIA DGX Cloud, these models leverage an optimized mixture-of-experts (MoE) architecture based on the MegaBlocks open-source project. This architecture allows DBRX to achieve up to twice the compute efficiency of other leading LLMs.
- Claude 3 Opus crowned the top user-rated chatbot, beating OpenAI’s GPT-4
Anthropic's Claude 3 Opus has overtaken OpenAI's GPT-4 to become the top-rated chatbot on the Chatbot Arena leaderboard. This marks the first time in approximately a year since GPT-4's release that another language model has surpassed it in this benchmark, which ranks models based on user preferences in randomized head-to-head comparisons. Anthropic's cheaper Haiku and mid-range Sonnet models also perform impressively, coming close to the original GPT-4's capabilities at a significantly lower cost.
- Empathy meets AI: Hume AI's EVI redefines voice interaction
In a significant development for the AI community, Hume AI has introduced a new conversational AI called Empathic Voice Interface (EVI). What sets EVI apart from other voice interfaces is its ability to understand and respond to the user's tone of voice, adding unprecedented emotional intelligence to the interaction. By adapting its language and responses based on the user's expressions, EVI creates a more human-like experience, blurring the lines between artificial and emotional intelligence.
- 💰 OpenAI launches revenue sharing program for GPT Store builders
OpenAI is experimenting with sharing revenue with builders who create successful apps using GPT in OpenAI's GPT Store. The goal is to incentivize creativity and collaboration by rewarding builders for their impact on an ecosystem OpenAI is testing so they can make it easy for anyone to build and monetize AI-powered apps.
- 🛍️ Google introduces new shopping features to refine searches
Google is rolling out new shopping features that allow users to refine their searches and find items they like more easily. The Style Recommendations feature lets shoppers rate items in their searches, helping Google pick up on their preferences. Users can also specify their favorite brands to instantly bring up more apparel from those selections.
- 🗣️ rabbit's r1 device gets ultra-realistic voice powered by ElevenLabs
ElevenLabs has partnered with rabbit to integrate its high-quality, low-latency voice AI into rabbit's r1 AI companion device. The collaboration aims to make the user experience with r1 more natural and intuitive by allowing users to interact with the device using voice commands.
- 💸 AI startup Hume raises $50M to build emotionally intelligent conversational AI
AI startup Hume has raised $50 million in a Series B funding round, valuing the company at $219 million. Hume's AI technology can detect over 24 distinct emotional expressions in human speech and generate appropriate responses. The startup's AI has been integrated into applications across healthcare, customer service, and productivity, with the goal of providing more context and empathy in AI interactions.
- 💻 Lenovo launches AI-enhanced PCs in a push for innovation and differentiation
Lenovo revealed a new lineup of AI-powered PCs and laptops at its Innovate event in Bangkok, Thailand. The company showcased the dual-screen Yoga Book 9i, Yoga Pro 9i with an AI chip for performance optimization and AI-enhanced Legion gaming laptops. Lenovo hopes to differentiate itself in the crowded PC market and revive excitement with these AI-driven innovations.
- Study shows ChatGPT can produce medical record notes 10 times faster than doctors without compromising quality
The AI model ChatGPT can write administrative medical notes up to 10 times faster than doctors without compromising quality. This is according to a study conducted by researchers at Uppsala University Hospital and Uppsala University in collaboration with Danderyd Hospital and the University Hospital of Basel, Switzerland. The research is published in the journal Acta Orthopaedica.
- Microsoft Copilot AI will soon run locally on PCs
Microsoft's Copilot AI service is set to run locally on PCs, Intel told Tom's Hardware. The company also said that next-gen AI PCs would require built-in neural processing units (NPUs) with over 40 TOPS (trillion operations per second) of power — beyond the capabilities of any consumer processor on the market.
Intel said that the AI PCs would be able to run "more elements of Copilot" locally. Currently, Copilot runs nearly everything in the cloud, even small requests. That creates a fair amount of lag that's fine for larger jobs, but not ideal for smaller jobs. Adding local compute capability would decrease that lag, while potentially improving performance and privacy as well.
Databricks has released DBRX, a family of open-source large language models setting a new standard for performance and efficiency. The series includes DBRX Base and DBRX Instruct, a fine-tuned version designed for few-turn interactions. Developed by Databricks' Mosaic AI team and trained using NVIDIA DGX Cloud, these models leverage an optimized mixture-of-experts (MoE) architecture based on the MegaBlocks open-source project. This architecture allows DBRX to achieve up to twice the compute efficiency of other leading LLMs.
Anthropic's Claude 3 Opus has overtaken OpenAI's GPT-4 to become the top-rated chatbot on the Chatbot Arena leaderboard. This marks the first time in approximately a year since GPT-4's release that another language model has surpassed it in this benchmark, which ranks models based on user preferences in randomized head-to-head comparisons. Anthropic's cheaper Haiku and mid-range Sonnet models also perform impressively, coming close to the original GPT-4's capabilities at a significantly lower cost.
In a significant development for the AI community, Hume AI has introduced a new conversational AI called Empathic Voice Interface (EVI). What sets EVI apart from other voice interfaces is its ability to understand and respond to the user's tone of voice, adding unprecedented emotional intelligence to the interaction. By adapting its language and responses based on the user's expressions, EVI creates a more human-like experience, blurring the lines between artificial and emotional intelligence.
OpenAI is experimenting with sharing revenue with builders who create successful apps using GPT in OpenAI's GPT Store. The goal is to incentivize creativity and collaboration by rewarding builders for their impact on an ecosystem OpenAI is testing so they can make it easy for anyone to build and monetize AI-powered apps.
Google is rolling out new shopping features that allow users to refine their searches and find items they like more easily. The Style Recommendations feature lets shoppers rate items in their searches, helping Google pick up on their preferences. Users can also specify their favorite brands to instantly bring up more apparel from those selections.
ElevenLabs has partnered with rabbit to integrate its high-quality, low-latency voice AI into rabbit's r1 AI companion device. The collaboration aims to make the user experience with r1 more natural and intuitive by allowing users to interact with the device using voice commands.
AI startup Hume has raised $50 million in a Series B funding round, valuing the company at $219 million. Hume's AI technology can detect over 24 distinct emotional expressions in human speech and generate appropriate responses. The startup's AI has been integrated into applications across healthcare, customer service, and productivity, with the goal of providing more context and empathy in AI interactions.
Lenovo revealed a new lineup of AI-powered PCs and laptops at its Innovate event in Bangkok, Thailand. The company showcased the dual-screen Yoga Book 9i, Yoga Pro 9i with an AI chip for performance optimization and AI-enhanced Legion gaming laptops. Lenovo hopes to differentiate itself in the crowded PC market and revive excitement with these AI-driven innovations.
The AI model ChatGPT can write administrative medical notes up to 10 times faster than doctors without compromising quality. This is according to a study conducted by researchers at Uppsala University Hospital and Uppsala University in collaboration with Danderyd Hospital and the University Hospital of Basel, Switzerland. The research is published in the journal Acta Orthopaedica.
Microsoft's Copilot AI service is set to run locally on PCs, Intel told Tom's Hardware. The company also said that next-gen AI PCs would require built-in neural processing units (NPUs) with over 40 TOPS (trillion operations per second) of power — beyond the capabilities of any consumer processor on the market. Intel said that the AI PCs would be able to run "more elements of Copilot" locally. Currently, Copilot runs nearly everything in the cloud, even small requests. That creates a fair amount of lag that's fine for larger jobs, but not ideal for smaller jobs. Adding local compute capability would decrease that lag, while potentially improving performance and privacy as well.
AI Daily Chronicle of AI Innovations - March 27th, 2024: 🔥 Microsoft study reveals the 11 by 11 tipping point for AI adoption 🤖 A16z spotlights the rise of generative AI in enterprises 🚨 Gaussian Frosting revolutionizes surface reconstruction in 3D modeling 🤖OpenAI unveils exciting upcoming features for GPT-4 and DALL-E 3 🤖 Adobe unveils GenStudio: AI-powered ad creation platform
- Microsoft study reveals the 11 by 11 tipping point for AI adoption
Microsoft's study on AI adoption in the workplace revealed the "11-by-11 tipping point," where users start seeing AI's value by saving 11 minutes daily. The study involved 1,300 Copilot for Microsoft 365 users and showed that 11 minutes of time savings is enough for most people to find AI useful.
- A16z spotlights the rise of generative AI in enterprises
A groundbreaking report by the influential tech firm a16z unveils the rapid integration of generative AI technologies within the corporate sphere. The report highlights essential considerations for business leaders to harness generative AI effectively. It covers resource allocation, model selection, and innovative use cases, providing a strategic roadmap for enterprises.
- Gaussian Frosting revolutionizes surface reconstruction in 3D modeling
At the international conference on computer vision, researchers presented a new method to improve surface reconstruction using Gaussian Frosting. This technique automates the adjustment of Poisson surface reconstruction hyperparameters, resulting in significantly improved mesh reconstruction.
- AIs can now learn and talk with each other like humans do.
This seems an important step toward AGI and vastly improved productivity.
"Once these tasks had been learned, the network was able to describe them to a second network — a copy of the first — so that it could reproduce them. To our knowledge, this is the first time that two AIs have been able to talk to each other in a purely linguistic way,’’ said lead author of the paper Alexandre Pouget, leader of the Geneva University Neurocenter, in a statement."
"While AI-powered chatbots can interpret linguistic instructions to generate an image or text, they can’t translate written or verbal instructions into physical actions, let alone explain the instructions to another AI.
However, by simulating the areas of the human brain responsible for language perception, interpretation and instructions-based actions, the researchers created an AI with human-like learning and communication skills."
- 🤖 Adobe unveils GenStudio: AI-powered ad creation platform
Adobe introduced GenStudio, an AI-powered ad creation platform, during its Summit event. GenStudio is a centralized hub for promotional campaigns, offering brand kits, copy guidance, and preapproved assets. It also provides generative AI-powered tools for generating backgrounds and ensuring brand consistency. Users can quickly create ads for email and social media platforms like Facebook, Instagram, and LinkedIn.
- 🧑💼Airtable introduces AI summarization for enhanced productivity
Airtable has introduced Airtable AI, which provides generative AI summarization, categorization, and translation to users. This feature allows quick insights and understanding of information within workspaces, enabling easy sharing of valuable insights with teams. Airtable AI automatically applies categories and tags to information, routes action items to the relevant team, and generates emails or social posts with a single button tap.
- 🤝Microsoft Teams enhances Copilot AI features for improved collaboration
Microsoft is introducing smarter Copilot AI features in Microsoft Teams to enhance collaboration and productivity. The updates include new ways to invoke the assistant during meeting chats and summaries, making it easier to catch up on missed meetings by combining spoken transcripts and written chats into a single view. Microsoft is launching new hybrid meeting features, such as automatic camera switching for remote participants and speaker recognition for accurate transcripts.
- 🤖OpenAI unveils exciting upcoming features for GPT-4 and DALL-E 3
OpenAI is preparing to introduce new features for its GPT-4 and DALL-E 3 models. For GPT-4, OpenAI plans to remove the message limit, implement a Model Tuner Selector, and allow users to upgrade responses from GPT-3.5 to GPT-4 with a simple button push. On the DALL-E 3 front, OpenAI is working on an image editor with inpainting functionality. These upcoming features demonstrate OpenAI's commitment to advancing AI capabilities.
- 🔍Apple Chooses Baidu's AI for iPhone 16 in China
Apple has reportedly chosen Baidu to provide AI technology for its upcoming iPhone 16 and other devices in China. This decision comes as Apple faces challenges due to stagnation in iPhone innovation and competition from Huawei. Baidu's Ernie Bot will be included in the Chinese version of the iPhone 16, Mac OS, and iOS 18. Despite discussions with Alibaba Group Holding and a Tsinghua University AI startup, Apple selected Baidu's AI technology for compliance.
- Meta CEO, Mark Zuckerberg, is directly recruiting AI talent from Google's DeepMind with personalized emails.
Meta CEO, Mark Zuckerberg, is attempting to recruit top AI talent from Google's DeepMind (their AI research unit). Personalised emails, from Zuckerberg himself, have been sent to a few of their top researchers, according to a report from The Information, which cited individuals that had seen the messages. In addition to this, the researchers are being hired without having to do any interviews, and, a previous policy which Meta had in place - to not offer higher offers to candidates with competing job offers - has been relaxed.
- OpenAI’s Sora Takes About 12 Minutes to Generate 1 Minute Video on NVIDIA H100.
- Apple on Tuesday announced that its annual developers conference, WWDC, will take place June 10 through June 14.
- Elon Musk says all Premium subscribers on X will gain access to AI chatbot Grok this week.
- Intel unveils AI PC program for software developers and hardware vendors.
- London-made HIV injection has potential to cure millions worldwide
Microsoft's study on AI adoption in the workplace revealed the "11-by-11 tipping point," where users start seeing AI's value by saving 11 minutes daily. The study involved 1,300 Copilot for Microsoft 365 users and showed that 11 minutes of time savings is enough for most people to find AI useful.
A groundbreaking report by the influential tech firm a16z unveils the rapid integration of generative AI technologies within the corporate sphere. The report highlights essential considerations for business leaders to harness generative AI effectively. It covers resource allocation, model selection, and innovative use cases, providing a strategic roadmap for enterprises.
At the international conference on computer vision, researchers presented a new method to improve surface reconstruction using Gaussian Frosting. This technique automates the adjustment of Poisson surface reconstruction hyperparameters, resulting in significantly improved mesh reconstruction.
This seems an important step toward AGI and vastly improved productivity. "Once these tasks had been learned, the network was able to describe them to a second network — a copy of the first — so that it could reproduce them. To our knowledge, this is the first time that two AIs have been able to talk to each other in a purely linguistic way,’’ said lead author of the paper Alexandre Pouget, leader of the Geneva University Neurocenter, in a statement." "While AI-powered chatbots can interpret linguistic instructions to generate an image or text, they can’t translate written or verbal instructions into physical actions, let alone explain the instructions to another AI. However, by simulating the areas of the human brain responsible for language perception, interpretation and instructions-based actions, the researchers created an AI with human-like learning and communication skills."
Adobe introduced GenStudio, an AI-powered ad creation platform, during its Summit event. GenStudio is a centralized hub for promotional campaigns, offering brand kits, copy guidance, and preapproved assets. It also provides generative AI-powered tools for generating backgrounds and ensuring brand consistency. Users can quickly create ads for email and social media platforms like Facebook, Instagram, and LinkedIn.
Airtable has introduced Airtable AI, which provides generative AI summarization, categorization, and translation to users. This feature allows quick insights and understanding of information within workspaces, enabling easy sharing of valuable insights with teams. Airtable AI automatically applies categories and tags to information, routes action items to the relevant team, and generates emails or social posts with a single button tap.
Microsoft is introducing smarter Copilot AI features in Microsoft Teams to enhance collaboration and productivity. The updates include new ways to invoke the assistant during meeting chats and summaries, making it easier to catch up on missed meetings by combining spoken transcripts and written chats into a single view. Microsoft is launching new hybrid meeting features, such as automatic camera switching for remote participants and speaker recognition for accurate transcripts.
OpenAI is preparing to introduce new features for its GPT-4 and DALL-E 3 models. For GPT-4, OpenAI plans to remove the message limit, implement a Model Tuner Selector, and allow users to upgrade responses from GPT-3.5 to GPT-4 with a simple button push. On the DALL-E 3 front, OpenAI is working on an image editor with inpainting functionality. These upcoming features demonstrate OpenAI's commitment to advancing AI capabilities.
Apple has reportedly chosen Baidu to provide AI technology for its upcoming iPhone 16 and other devices in China. This decision comes as Apple faces challenges due to stagnation in iPhone innovation and competition from Huawei. Baidu's Ernie Bot will be included in the Chinese version of the iPhone 16, Mac OS, and iOS 18. Despite discussions with Alibaba Group Holding and a Tsinghua University AI startup, Apple selected Baidu's AI technology for compliance.
Meta CEO, Mark Zuckerberg, is attempting to recruit top AI talent from Google's DeepMind (their AI research unit). Personalised emails, from Zuckerberg himself, have been sent to a few of their top researchers, according to a report from The Information, which cited individuals that had seen the messages. In addition to this, the researchers are being hired without having to do any interviews, and, a previous policy which Meta had in place - to not offer higher offers to candidates with competing job offers - has been relaxed.
AI Daily Chronicle of AI Innovations - March 26th, 2024: 🔥 Zoom launches all-in-one modern AI collab platform; 🤖 Stability AI launches instruction-tuned LLM; 🚨 Stability AI CEO resigns to focus on decentralized AI; 🔍 WhatsApp to integrate Meta AI directly into its search bar; 🥊 Google, Intel, and Qualcomm challenge Nvidia's dominance in AI; 🎬 OpenAI pitches Sora to Hollywood studios
- Stability AI launches instruction-tuned LLM
Stability AI has introduced Stable Code Instruct 3B, a new instruction-tuned large language model. It can handle various software development tasks, such as code completion, generation, translation, and explanation, as well as creating database queries with simple instructions.
Stable Code Instruct 3B claims to outperform rival models like CodeLlama 7B Instruct and DeepSeek-Coder Instruct 1.3B in terms of accuracy, understanding natural language instructions, and handling diverse programming languages. The model is accessible for commercial use with a Stability AI Membership, while its weights are freely available on Hugging Face for non-commercial projects.
- Zoom launches all-in-one modern AI collab platform
Zoom launched Zoom Workplace, an AI collaboration platform that integrates many tools to improve teamwork and productivity. With over 40 new features, including AI Companion updates for Zoom Phone, Team Chat, Events, and Contact Center, as well as the introduction of Ask AI Companion, Zoom Workplace simplifies workflows within a familiar interface.
The platform offers customization options, meeting features, and improved collaboration tools across Zoom's ecosystem. Zoom Business Services, integrated with Zoom Workplace, offers AI-driven marketing, customer service, and sales solutions. It expands digital communication channels and provides real-time insights for better agent management.
- Stability AI CEO resigns because of centralized AI
Stability AI CEO Emad Mostaque steps down to focus on decentralized AI, advocating for transparent governance in the industry.
Mostaque's departure follows the appointment of interim co-CEOs Shan Shan Wong and Christian Laforte.
The startup, known for its image generation tool, faced challenges including talent loss and financial struggles.
Mostaque emphasized the importance of generative AI R&D over revenue growth and highlighted the potential economic value of open models in regulated industries.
The AI industry witnessed significant changes with Inflection AI co-founders joining Microsoft after raising $1.5 billion.
- Estimating Sora's power requirements
Quoting the compute estimates of Sora from the factorial funds blog
A 15% penetration of Sora for videos with realistic video generation demand and utilization will require about 720k Nvidia H100 GPUs. Each H100 requires about 700 Watts of power supply.
720,000 x 700 = 504 Megawatts.
By comparison, even the largest ever fully solar powered plan in America (Ivanpah Solar Power Facility) produces about 377 Megawats.
While these power requirements can be met with other options like nuclear plants and even coal/hydro plants of big sizes ... are we really entering the power game for electricity ?
( it is currently a power game on compute)
- 💬 The Financial Times has introduced Ask FT, a new GenAI chatbot
It provides curated, natural-language responses to queries about recent events and broader topics covered by the FT. Ask FT is powered by Anthropic's Claude and is available to a selected group of subscribers as it is under testing
- 🔍 WhatsApp to integrate Meta AI directly into its search bar
The latest Android WhatsApp beta update will embed Meta AI directly into the search bar. This feature will allow users to type queries into the search bar and receive instant AI-powered responses without creating a separate Meta AI chat. The update will also allow users to interact with Meta AI even if they choose to hide the shortcut.
- 🥊 Google, Intel, and Qualcomm challenge Nvidia's dominance in AI
Qualcomm, Google, and Intel are targeting NVIDIA's software platforms like CUDA. They plan to create open-source tools compatible with multiple AI accelerator chips through the UXL Foundation. Companies are investing over $4 billion in startups developing AI software to loosen NVIDIA's grip on the field.
- 🤖 Apple takes a multi-vendor approach for generative AI in iOS 18
Apple is reportedly in talks with Alphabet, OpenAI, and Anthropic to integrate generative AI capabilities from multiple vendors into iOS 18. This multi-vendor approach aligns with Apple's efforts to balance advanced AI features with privacy considerations, which are expected to be detailed at WWDC 2024 during the iOS 18 launch.
- 🎬 OpenAI pitches Sora to Hollywood studios
OpenAI is actively engaging with Hollywood studios, directors, and talent agencies to integrate Sora into the entertainment industry. The startup has scheduled meetings in Los Angeles to showcase Sora's capabilities and encourage partnerships, with CEO Sam Altman attending events during the Oscars weekend.
Stability AI has introduced Stable Code Instruct 3B, a new instruction-tuned large language model. It can handle various software development tasks, such as code completion, generation, translation, and explanation, as well as creating database queries with simple instructions. Stable Code Instruct 3B claims to outperform rival models like CodeLlama 7B Instruct and DeepSeek-Coder Instruct 1.3B in terms of accuracy, understanding natural language instructions, and handling diverse programming languages. The model is accessible for commercial use with a Stability AI Membership, while its weights are freely available on Hugging Face for non-commercial projects.
Zoom launched Zoom Workplace, an AI collaboration platform that integrates many tools to improve teamwork and productivity. With over 40 new features, including AI Companion updates for Zoom Phone, Team Chat, Events, and Contact Center, as well as the introduction of Ask AI Companion, Zoom Workplace simplifies workflows within a familiar interface. The platform offers customization options, meeting features, and improved collaboration tools across Zoom's ecosystem. Zoom Business Services, integrated with Zoom Workplace, offers AI-driven marketing, customer service, and sales solutions. It expands digital communication channels and provides real-time insights for better agent management.
Stability AI CEO Emad Mostaque steps down to focus on decentralized AI, advocating for transparent governance in the industry. Mostaque's departure follows the appointment of interim co-CEOs Shan Shan Wong and Christian Laforte. The startup, known for its image generation tool, faced challenges including talent loss and financial struggles. Mostaque emphasized the importance of generative AI R&D over revenue growth and highlighted the potential economic value of open models in regulated industries. The AI industry witnessed significant changes with Inflection AI co-founders joining Microsoft after raising $1.5 billion.
Quoting the compute estimates of Sora from the factorial funds blog
A 15% penetration of Sora for videos with realistic video generation demand and utilization will require about 720k Nvidia H100 GPUs. Each H100 requires about 700 Watts of power supply. 720,000 x 700 = 504 Megawatts. By comparison, even the largest ever fully solar powered plan in America (Ivanpah Solar Power Facility) produces about 377 Megawats. While these power requirements can be met with other options like nuclear plants and even coal/hydro plants of big sizes ... are we really entering the power game for electricity ? ( it is currently a power game on compute)
It provides curated, natural-language responses to queries about recent events and broader topics covered by the FT. Ask FT is powered by Anthropic's Claude and is available to a selected group of subscribers as it is under testing
The latest Android WhatsApp beta update will embed Meta AI directly into the search bar. This feature will allow users to type queries into the search bar and receive instant AI-powered responses without creating a separate Meta AI chat. The update will also allow users to interact with Meta AI even if they choose to hide the shortcut.
Qualcomm, Google, and Intel are targeting NVIDIA's software platforms like CUDA. They plan to create open-source tools compatible with multiple AI accelerator chips through the UXL Foundation. Companies are investing over $4 billion in startups developing AI software to loosen NVIDIA's grip on the field.
Apple is reportedly in talks with Alphabet, OpenAI, and Anthropic to integrate generative AI capabilities from multiple vendors into iOS 18. This multi-vendor approach aligns with Apple's efforts to balance advanced AI features with privacy considerations, which are expected to be detailed at WWDC 2024 during the iOS 18 launch.
OpenAI is actively engaging with Hollywood studios, directors, and talent agencies to integrate Sora into the entertainment industry. The startup has scheduled meetings in Los Angeles to showcase Sora's capabilities and encourage partnerships, with CEO Sam Altman attending events during the Oscars weekend.
AI Daily Chronicle of AI Innovations - March 25th, 2024: 🤝 Apple could partner with OpenAI, Gemini, Anthropic; 🤖 Chatbots more likely to change your mind than another human, study says; Verbal Reasoning Test - Opus is better than 93% of people, Gemini 1.5 Pro 59%, GPT-4 Turbo only 36%; Apple’s Tim Cook says AI essential tool for businesses to reduce carbon footprint; Suno V3: Song-on-demand AI is getting insanely good; The first patient with a Neuralink brain-computer implant played Nintendo’s Mario Kart video game with his mind in an impressive new demo video
- 🤝 Apple could partner with OpenAI, Gemini, Anthropic
Apple is discussing with Alphabet, OpenAI, Anthropic, and potentially Baidu to integrate generative AI into iOS 18, considering multiple partners rather than a single one.
The collaboration could lead to a model where iPhone users might choose their preferred AI provider, akin to selecting a default search engine in a web browser.
Reasons for partnering with external AI providers include financial benefits, the possibility to quickly adapt through partnership changes or user preferences, and avoiding the complexities of developing and maintaining cloud-based generative AI in-house.
- 🤖 Chatbots more likely to change your mind than another human, study says
A study found that personalized chatbots, such as GPT-4, are more likely to change people's minds compared to human debaters by using tailored arguments based on personal information.
The research conducted by the École Polytechnique Fédérale de Lausanne and the Italian Fondazione Bruno Kessler showed an 81.7 percent increase in agreement when GPT-4 had access to participants' personal data like age, gender, and race.
Concerns were raised about the potential misuse of AI in persuasive technologies, especially with the ability to generate detailed user profiles from online activities, urging online platform operators to counter such strategies.
- OpenAI CEO's £142 Million Gamble On Unlocking the Secrets to Longer Life, Altman's vision of extended lifespans may be achievable
Biotech startup Retro Biosciences is undertaking a one-of-a-kind experiment housed in shipping containers, funded by a $180 (£142.78) million investment by tech leader Sam Altman to increase lifespan.
Altman, the 38-year-old tech heavyweight, has been a significant player in the industry. Despite his young age, Altman took the tech realm by storm with offerings like ChatGPT and Sora. Unsurprisingly, his involvement in these groundbreaking projects has propelled him to a level of influence rivaling Mark Zuckerberg and Elon Musk, who is currently embroiled in a lawsuit with OpenAI.
It is also worth noting that the Altman-led AI startup is reportedly planning to launch its own AI-powered search engine to challenge Google's search dominance. Altman's visionary investments in tech giants like Reddit, Stripe, Airbnb, and Instacart propelled him to billionaire status. They cemented his influence as a tech giant who relentlessly pushed the boundaries of the industry's future.
- Suno V3 can do multiple languages in one song. This one is English, Portuguese, Japanese, and Italian. Incredible.
Beneath the vast sky, where dreams lay rooted deep, Mountains high and valleys wide, secrets they keep. Ground beneath my feet, firm and ever true, Earth, you give us life, in shades of brown and green hue.
Sopra o vento, mensageiro entre o céu e o mar, Carregando sussurros, histórias a contar. Dançam as folhas, em um balé sem fim, Vento, o alento invisível, guiando o destino assim.
火のように、情熱が燃えて、 光と暖かさを私たちに与えてくれる。 夜の暗闇を照らす、勇敢な炎、 生命の力、絶えず変わるゲーム。
Acqua, misteriosa forza che tutto scorre, Nei fiumi, nei mari, la vita che ci offre. Specchio del cielo, in te ci riflettiamo, Acqua, fonte di vita, a te ci affidiamo.
- OpenAI Heading To Hollywood To Pitch Revolutionary “Sora”
Some of the most important meetings in Hollywood history will take place in the coming week, as OpenAI hits Hollywood to show the potential of its “Sora” software to studios, talent agencies, and media executives.
Bloomberg is reporting that OpenAI wants more filmmakers to become familiar with Sora, the text-to-video generator that potentially could upend the way movies are made.
- Soon, Everyone Will Own a Robot, Like a Car or Phone Today. Says Figure AI founder
Brett Adcock, the founder of FigureAI robots, the company that recently released a demo video of its humanoid robot conversing with a human while performing tasks, predicts that everyone will own a robot in the future. “Similar to owning a car or phone today,” he said – hinting at the universal adoption of robots as an essential commodity in the future.
“Every human will own a robot in the future, similar to owning a car/phone today,” said Adcock.
A few months ago, Adcock called 2024 the year of Embodied AI, indicating how the future comprises AI in a body form. With robots learning to perform low-complexity tasks, such as picking trash, placing dishes, and even using the coffee machine, Figure robots are being trained to assist a person with house chores.
Apple is discussing with Alphabet, OpenAI, Anthropic, and potentially Baidu to integrate generative AI into iOS 18, considering multiple partners rather than a single one. The collaboration could lead to a model where iPhone users might choose their preferred AI provider, akin to selecting a default search engine in a web browser. Reasons for partnering with external AI providers include financial benefits, the possibility to quickly adapt through partnership changes or user preferences, and avoiding the complexities of developing and maintaining cloud-based generative AI in-house.
A study found that personalized chatbots, such as GPT-4, are more likely to change people's minds compared to human debaters by using tailored arguments based on personal information. The research conducted by the École Polytechnique Fédérale de Lausanne and the Italian Fondazione Bruno Kessler showed an 81.7 percent increase in agreement when GPT-4 had access to participants' personal data like age, gender, and race. Concerns were raised about the potential misuse of AI in persuasive technologies, especially with the ability to generate detailed user profiles from online activities, urging online platform operators to counter such strategies.
Biotech startup Retro Biosciences is undertaking a one-of-a-kind experiment housed in shipping containers, funded by a $180 (£142.78) million investment by tech leader Sam Altman to increase lifespan. Altman, the 38-year-old tech heavyweight, has been a significant player in the industry. Despite his young age, Altman took the tech realm by storm with offerings like ChatGPT and Sora. Unsurprisingly, his involvement in these groundbreaking projects has propelled him to a level of influence rivaling Mark Zuckerberg and Elon Musk, who is currently embroiled in a lawsuit with OpenAI. It is also worth noting that the Altman-led AI startup is reportedly planning to launch its own AI-powered search engine to challenge Google's search dominance. Altman's visionary investments in tech giants like Reddit, Stripe, Airbnb, and Instacart propelled him to billionaire status. They cemented his influence as a tech giant who relentlessly pushed the boundaries of the industry's future.
Beneath the vast sky, where dreams lay rooted deep, Mountains high and valleys wide, secrets they keep. Ground beneath my feet, firm and ever true, Earth, you give us life, in shades of brown and green hue. Sopra o vento, mensageiro entre o céu e o mar, Carregando sussurros, histórias a contar. Dançam as folhas, em um balé sem fim, Vento, o alento invisível, guiando o destino assim. 火のように、情熱が燃えて、 光と暖かさを私たちに与えてくれる。 夜の暗闇を照らす、勇敢な炎、 生命の力、絶えず変わるゲーム。 Acqua, misteriosa forza che tutto scorre, Nei fiumi, nei mari, la vita che ci offre. Specchio del cielo, in te ci riflettiamo, Acqua, fonte di vita, a te ci affidiamo.
Some of the most important meetings in Hollywood history will take place in the coming week, as OpenAI hits Hollywood to show the potential of its “Sora” software to studios, talent agencies, and media executives. Bloomberg is reporting that OpenAI wants more filmmakers to become familiar with Sora, the text-to-video generator that potentially could upend the way movies are made.
Brett Adcock, the founder of FigureAI robots, the company that recently released a demo video of its humanoid robot conversing with a human while performing tasks, predicts that everyone will own a robot in the future. “Similar to owning a car or phone today,” he said – hinting at the universal adoption of robots as an essential commodity in the future. “Every human will own a robot in the future, similar to owning a car/phone today,” said Adcock. A few months ago, Adcock called 2024 the year of Embodied AI, indicating how the future comprises AI in a body form. With robots learning to perform low-complexity tasks, such as picking trash, placing dishes, and even using the coffee machine, Figure robots are being trained to assist a person with house chores.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 17 - 24, 2024: Week 3 summary
🕰️ 32-hour workweek with the same pay: AI’s new promise
🗣️ Google's VLOGGER brings photos to life as talking avatars
🔓 Elon Musk’s xAI open-sources Grok AI
💻 Nvidia launches 'world's most powerful AI chip'
🎥 Stability AI's SV3D turns a single photo into a 3D video
🤖 OpenAI CEO hints at "amazing model", maybe ChatGPT-5
🧠 MindEye2: AI Mind Reading from Brain Activity
🚀 Nvidia NIM enables faster deployment of AI models
🤝 Microsoft hires DeepMind co-founder to lead a new AI division
🕵️♂️ A new hack: Stealing Part of a Production Language Model
🧰 Sakana AI’s method to automate foundation model development
👋 Key Stable Diffusion researchers leave Stability AI
⏩ NVIDIA’s LATTE3D: The fastest AI model for 3D generation
📚 Language Models teach themselves to think before speaking
♟️ Neuralink's first human patient plays chess with his mind
💥 Stability AI CEO resigns to ‘pursue decentralized AI’
🎥 OpenAI seeks Hollywood partnerships ahead of Sora AI video generator release
🍎 Apple gives up on its MicroLED dream
🔍 Google begins public testing of its generative AI search
🧠 AI could detect early risk of psychosis based on brain images
🗣️ Google's VLOGGER brings photos to life as talking avatars
🔓 Elon Musk’s xAI open-sources Grok AI
💻 Nvidia launches 'world's most powerful AI chip'
🎥 Stability AI's SV3D turns a single photo into a 3D video
🤖 OpenAI CEO hints at "amazing model", maybe ChatGPT-5
🧠 MindEye2: AI Mind Reading from Brain Activity
🚀 Nvidia NIM enables faster deployment of AI models
🤝 Microsoft hires DeepMind co-founder to lead a new AI division
🕵️♂️ A new hack: Stealing Part of a Production Language Model
🧰 Sakana AI’s method to automate foundation model development
👋 Key Stable Diffusion researchers leave Stability AI
⏩ NVIDIA’s LATTE3D: The fastest AI model for 3D generation
📚 Language Models teach themselves to think before speaking
♟️ Neuralink's first human patient plays chess with his mind
💥 Stability AI CEO resigns to ‘pursue decentralized AI’
🎥 OpenAI seeks Hollywood partnerships ahead of Sora AI video generator release
🍎 Apple gives up on its MicroLED dream
🔍 Google begins public testing of its generative AI search
🧠 AI could detect early risk of psychosis based on brain images
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: C37HCAQRVR7JTFK(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 24th, 2024
- Summary on GPT-5's performance rumors:
Overall performance boost: Sam Altman, CEO, stated that GPT-5 will be smarter and superior in all aspects, with a significant performance improvement over GPT-4.
Enhanced multimodal capabilities: It is predicted that GPT-5 will not only handle text and images but will also be capable of processing audio and videos, becoming a multimodal AI.
Increase in parameter count: It's expected to have several trillion parameters, greatly surpassing GPT-4's one trillion, for more complex and advanced reasoning.
Better text generation quality: GPT-5 aims to produce text that is consistently realistic and indistinguishable from that written by humans.
Expanded context understanding: The context window of GPT-5 is expected to greatly exceed GPT-4's 128,000 tokens, allowing for longer text comprehension and analysis.
Improved logical reasoning: GPT-5 is expected to make significant advancements in its ability to reason logically and tackle more complex problems.
AI agent functionality: GPT-5 may include the capability for performing tasks autonomously, hinting at functionalities similar to those of AI agents.
GPT-5 is expected to mark a revolutionary leap forward from GPT-4. Altman cautions against underestimating the performance improvements of GPT-5, indicating it could introduce groundbreaking changes to the field of AI.
- Nvidia announces AI-powered health care 'agents' that outperform nurses — and cost $9 an hour
High-powered chipmaker Nvidia has teamed up with artificial intelligence health care company Hippocratic AI to develop generative AI "agents" that not only outperform human nurses on video calls but cost a lot less per hour.
The two companies on Thursday announced their collaboration to build "empathetic health care agents" powered by Nvidia and trained on Hippocratic's health care-focused large language model (LLM) that are better able to form a human connection with patients through "super-low latency conversational reactions."
It was interesting watching the demonstration of their AI nurse, Linda, on the Hippocratic AI website. While I doubt elderly patients will be receptive at first, if the AI nurse is able to spend longer time with the patient and answer their questions then that could really be beneficial for healthcare and patients alike. It'll also free up a lot of nurses and remove some of their workload.
If implemented, I'd hope that there is a hybrid call system so that if the patients don't want to talk with the AI, they could be redirected to a human nurse.
- Pro AI regulation Sam Altman has been spending a lot of time in Washington lobbying the government presumably to regulate Open Source.
- Mistral just announced at @SHACK15sf that they will release a new model today: Mistral 7B v0.2 Base Model - 32k instead of 8k context window
Overall performance boost: Sam Altman, CEO, stated that GPT-5 will be smarter and superior in all aspects, with a significant performance improvement over GPT-4.
Enhanced multimodal capabilities: It is predicted that GPT-5 will not only handle text and images but will also be capable of processing audio and videos, becoming a multimodal AI.
Increase in parameter count: It's expected to have several trillion parameters, greatly surpassing GPT-4's one trillion, for more complex and advanced reasoning.
Better text generation quality: GPT-5 aims to produce text that is consistently realistic and indistinguishable from that written by humans.
Expanded context understanding: The context window of GPT-5 is expected to greatly exceed GPT-4's 128,000 tokens, allowing for longer text comprehension and analysis.
Improved logical reasoning: GPT-5 is expected to make significant advancements in its ability to reason logically and tackle more complex problems.
AI agent functionality: GPT-5 may include the capability for performing tasks autonomously, hinting at functionalities similar to those of AI agents.
GPT-5 is expected to mark a revolutionary leap forward from GPT-4. Altman cautions against underestimating the performance improvements of GPT-5, indicating it could introduce groundbreaking changes to the field of AI.
High-powered chipmaker Nvidia has teamed up with artificial intelligence health care company Hippocratic AI to develop generative AI "agents" that not only outperform human nurses on video calls but cost a lot less per hour. The two companies on Thursday announced their collaboration to build "empathetic health care agents" powered by Nvidia and trained on Hippocratic's health care-focused large language model (LLM) that are better able to form a human connection with patients through "super-low latency conversational reactions."
It was interesting watching the demonstration of their AI nurse, Linda, on the Hippocratic AI website. While I doubt elderly patients will be receptive at first, if the AI nurse is able to spend longer time with the patient and answer their questions then that could really be beneficial for healthcare and patients alike. It'll also free up a lot of nurses and remove some of their workload. If implemented, I'd hope that there is a hybrid call system so that if the patients don't want to talk with the AI, they could be redirected to a human nurse.
AI Daily Chronicle of AI Innovations - March 22nd, 2024: 🤖 Nvidia’s Latte 3D generates text-to-3D in seconds! 💰 Saudi Arabia to invest $40 billion in AI 🚀 Open Interpreter’s 01 Light personal pocket AI agent. 🤖 Microsoft introduces a new Copilot for better productivity. 💡Quiet-STaR: LMs can self-train to think before responding 🤯Neuralink's first brain chip patient plays chess with his mind
- Meta AI introduced SceneScript, a novel method of generating scene layouts and representing scenes using language.
SceneScript allows AR & AI devices to understand the geometry of physical spaces. It uses next token prediction like an LLM, but instead of natural language SceneScript model predicts the next architectural tokens such as ‘wall’ or ‘door.’
- Nvidia’s Latte 3D generates text-to-3D in seconds!
NVIDIA introduces Latte3D, facilitating the conversion of text prompts into detailed 3D models in less than a second. Developed by NVIDIA’s Toronto lab, Latte3D sets a new standard in generative AI models with its remarkable blend of speed and precision.
- Quiet-STaR: LMs can self-train to think before responding
A groundbreaking study demonstrates the successful training of large language models (LM) to reason from text rather than specific reasoning tasks. The research introduces a novel training approach, Quiet STaR, which utilizes a parallel sampling algorithm to generate rationales from all token positions in a given string.
- Neuralink's first brain chip patient plays chess with his mind
Elon Musk's brain chip startup, Neuralink, showcased its first brain chip patient playing chess using only his mind. The patient, Noland Arbaugh, was paralyzed below the shoulder after a diving accident.
Neuralink's brain implant technology allows people with paralysis to control external devices using their thoughts. With further advancements, Neuralink's technology has the potential to revolutionize the lives of people with paralysis, providing them with newfound independence and the ability to interact with the world in previously unimaginable ways.
- 🤖 Microsoft introduces a new Copilot for better productivity.
Microsoft's new Copilot for Windows and Surface devices is a powerful productivity tool integrating large language models with Microsoft Graph and Microsoft 365 apps to enhance work efficiency. With a focus on delivering AI responsibly while ensuring data security and privacy, Microsoft is dedicated to providing users with innovative tools to thrive in the evolving work landscape.
- 💰 Saudi Arabia to invest $40 billion in AI
Saudi Arabia has announced its plan to invest $40 billion in AI to become a global leader. Middle Eastern countries use their sovereign wealth fund, which has over $900 billion in assets, to achieve this goal. This investment aims to position the country at the forefront of the fast-evolving AI sector, drive innovation, and enhance economic growth.
- 🎧 Rightsify releases Hydra II to revolutionize AI music generation
Rightsify, a global music licensing leader, introduced Hydra II, the latest AI generation model. Hydra II offers over 800 instruments, 50 languages, and editing tools for customizable, copyright-free AI music. The model is trained on audio, text descriptions, MIDI, chord progressions, sheet music, and stems to create unique generations.
- 🚀 Open Interpreter’s 01 Light personal pocket AI agent
The Open Interpreter unveiled 01 Light, a portable device that allows you to control your computer using natural language commands. It's part of an open-source project to make computing more accessible and flexible. It's designed to make your online tasks more manageable, helping you get more done and simplify your life.
- 🤝 Microsoft's $650 million Inflection deal: A strategic move
Microsoft has recently entered into a significant deal with AI startup Inflection, involving a payment of $650 million in cash. While the deal may seem like a licensing agreement, it appears to be a strategic move by Microsoft to acquire AI talent while avoiding potential regulatory trouble.
- NVIDIA NIM, a containerized inference microservice to simplify deployment of generative AI models across various infrastructures.
Developers can test a wide range of models using cloud APIs from the NVIDIA API catalog or they can self-host the models by downloading NIM and deploying with Kubernetes
- Earth-2 climate digital twin cloud platform for simulating and visualizing weather and climate at unprecedented scale.
Earth-2’s APIs offer AI models and employ a new NVIDIA generative AI model called CorrDiff that generates 12.5x higher resolution images than current numerical models 1,000x faster and 3,000x more energy efficiently
- Roblox adds AI-powered avatar creation ( converts a 3D body mesh into a live, animated avatar) and texture generation (text prompts to quickly change the look of 3D objects)
- ByteDance released AnimateDiff-Lightning,
a lightning-fast text-to-video generation model. It can generate videos more than ten times faster than the original AnimateDiff
- Lighthouz AI launched the Chatbot Guardrails Arena in collaboration with Hugging Face
to stress test LLMs and privacy guardrails in leaking sensitive data. Chat with two anonymous LLMs with guardrails and try to trick them into revealing sensitive financial information and cast your vote for the model that shows greater privacy
- Andrew Ng, cofounder of Google Brain & former chief scientist @ Baidu- "I think AI agentic workflows will drive massive AI progress this year
I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it. Today, we mostly use LLMs in zero-shot mode, prompting a model to generate final output token by token without revising its work. This is akin to asking someone to compose an essay from start to finish, typing straight through with no backspacing allowed, and expecting a high-quality result. Despite the difficulty, LLMs do amazingly well at this task! With an agentic workflow, however, we can ask the LLM to iterate over a document many times. For example, it might take a sequence of steps such as: - Plan an outline. - Decide what, if any, web searches are needed to gather more information. - Write a first draft. - Read over the first draft to spot unjustified arguments or extraneous information. - Revise the draft taking into account any weaknesses spotted. - And so on. This iterative process is critical for most human writers to write good text. With AI, such an iterative workflow yields much better results than writing in a single pass. Devin’s splashy demo recently received a lot of social media buzz. My team has been closely following the evolution of AI that writes code. We analyzed results from a number of research teams, focusing on an algorithm’s ability to do well on the widely used HumanEval coding benchmark. You can see our findings in the diagram below. GPT-3.5 (zero shot) was 48.1% correct. GPT-4 (zero shot) does better at 67.0%. However, the improvement from GPT-3.5 to GPT-4 is dwarfed by incorporating an iterative agent workflow. Indeed, wrapped in an agent loop, GPT-3.5 achieves up to 95.1%. Open source agent tools and the academic literature on agents are proliferating, making this an exciting time but also a confusing one. To help put this work into perspective, I’d like to share a framework for categorizing design patterns for building agents. My team AI Fund is successfully using these patterns in many applications, and I hope you find them useful. - Reflection: The LLM examines its own work to come up with ways to improve it. - Tool use: The LLM is given tools such as web search, code execution, or any other function to help it gather information, take action, or process data. - Planning: The LLM comes up with, and executes, a multistep plan to achieve a goal (for example, writing an outline for an essay, then doing online research, then writing a draft, and so on). - Multi-agent collaboration: More than one AI agent work together, splitting up tasks and discussing and debating ideas, to come up with better solutions than a single agent would. I’ll elaborate on these design patterns and offer suggested readings for each next week.
- AI-generated digital twins of patients can predict future diseases
Named Foresight, the tool uses generative pre-trained transformers, the same family of large language models (LLMs) used by ChatGPT.
Researchers in the UK first trained the models on medical records. Next, they fed their tool fresh healthcare data to create virtual duplicates of patients.
Finally, the digital twins forecast various outcomes, from disease development to medication needs.
Scientists are particularly excited about the prospect of accelerating diagnosis.
When applied to US data, the digital twins correctly identified the next condition of patients next condition with 88% accuracy.
It was less effective, however, on British data. Using information from two National Health Trust (NHS) organisations, the tool accurately predicted subsequent conditions 68% and 76% of the time.
Nonetheless, there are high hopes for the digital twins.
SceneScript allows AR & AI devices to understand the geometry of physical spaces. It uses next token prediction like an LLM, but instead of natural language SceneScript model predicts the next architectural tokens such as ‘wall’ or ‘door.’
NVIDIA introduces Latte3D, facilitating the conversion of text prompts into detailed 3D models in less than a second. Developed by NVIDIA’s Toronto lab, Latte3D sets a new standard in generative AI models with its remarkable blend of speed and precision.
A groundbreaking study demonstrates the successful training of large language models (LM) to reason from text rather than specific reasoning tasks. The research introduces a novel training approach, Quiet STaR, which utilizes a parallel sampling algorithm to generate rationales from all token positions in a given string.
Elon Musk's brain chip startup, Neuralink, showcased its first brain chip patient playing chess using only his mind. The patient, Noland Arbaugh, was paralyzed below the shoulder after a diving accident. Neuralink's brain implant technology allows people with paralysis to control external devices using their thoughts. With further advancements, Neuralink's technology has the potential to revolutionize the lives of people with paralysis, providing them with newfound independence and the ability to interact with the world in previously unimaginable ways.
Microsoft's new Copilot for Windows and Surface devices is a powerful productivity tool integrating large language models with Microsoft Graph and Microsoft 365 apps to enhance work efficiency. With a focus on delivering AI responsibly while ensuring data security and privacy, Microsoft is dedicated to providing users with innovative tools to thrive in the evolving work landscape.
Saudi Arabia has announced its plan to invest $40 billion in AI to become a global leader. Middle Eastern countries use their sovereign wealth fund, which has over $900 billion in assets, to achieve this goal. This investment aims to position the country at the forefront of the fast-evolving AI sector, drive innovation, and enhance economic growth.
Rightsify, a global music licensing leader, introduced Hydra II, the latest AI generation model. Hydra II offers over 800 instruments, 50 languages, and editing tools for customizable, copyright-free AI music. The model is trained on audio, text descriptions, MIDI, chord progressions, sheet music, and stems to create unique generations.
The Open Interpreter unveiled 01 Light, a portable device that allows you to control your computer using natural language commands. It's part of an open-source project to make computing more accessible and flexible. It's designed to make your online tasks more manageable, helping you get more done and simplify your life.
Microsoft has recently entered into a significant deal with AI startup Inflection, involving a payment of $650 million in cash. While the deal may seem like a licensing agreement, it appears to be a strategic move by Microsoft to acquire AI talent while avoiding potential regulatory trouble.
Developers can test a wide range of models using cloud APIs from the NVIDIA API catalog or they can self-host the models by downloading NIM and deploying with Kubernetes
Earth-2’s APIs offer AI models and employ a new NVIDIA generative AI model called CorrDiff that generates 12.5x higher resolution images than current numerical models 1,000x faster and 3,000x more energy efficiently
a lightning-fast text-to-video generation model. It can generate videos more than ten times faster than the original AnimateDiff
to stress test LLMs and privacy guardrails in leaking sensitive data. Chat with two anonymous LLMs with guardrails and try to trick them into revealing sensitive financial information and cast your vote for the model that shows greater privacy
I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it. Today, we mostly use LLMs in zero-shot mode, prompting a model to generate final output token by token without revising its work. This is akin to asking someone to compose an essay from start to finish, typing straight through with no backspacing allowed, and expecting a high-quality result. Despite the difficulty, LLMs do amazingly well at this task! With an agentic workflow, however, we can ask the LLM to iterate over a document many times. For example, it might take a sequence of steps such as: - Plan an outline. - Decide what, if any, web searches are needed to gather more information. - Write a first draft. - Read over the first draft to spot unjustified arguments or extraneous information. - Revise the draft taking into account any weaknesses spotted. - And so on. This iterative process is critical for most human writers to write good text. With AI, such an iterative workflow yields much better results than writing in a single pass. Devin’s splashy demo recently received a lot of social media buzz. My team has been closely following the evolution of AI that writes code. We analyzed results from a number of research teams, focusing on an algorithm’s ability to do well on the widely used HumanEval coding benchmark. You can see our findings in the diagram below. GPT-3.5 (zero shot) was 48.1% correct. GPT-4 (zero shot) does better at 67.0%. However, the improvement from GPT-3.5 to GPT-4 is dwarfed by incorporating an iterative agent workflow. Indeed, wrapped in an agent loop, GPT-3.5 achieves up to 95.1%. Open source agent tools and the academic literature on agents are proliferating, making this an exciting time but also a confusing one. To help put this work into perspective, I’d like to share a framework for categorizing design patterns for building agents. My team AI Fund is successfully using these patterns in many applications, and I hope you find them useful. - Reflection: The LLM examines its own work to come up with ways to improve it. - Tool use: The LLM is given tools such as web search, code execution, or any other function to help it gather information, take action, or process data. - Planning: The LLM comes up with, and executes, a multistep plan to achieve a goal (for example, writing an outline for an essay, then doing online research, then writing a draft, and so on). - Multi-agent collaboration: More than one AI agent work together, splitting up tasks and discussing and debating ideas, to come up with better solutions than a single agent would. I’ll elaborate on these design patterns and offer suggested readings for each next week.
Named Foresight, the tool uses generative pre-trained transformers, the same family of large language models (LLMs) used by ChatGPT. Researchers in the UK first trained the models on medical records. Next, they fed their tool fresh healthcare data to create virtual duplicates of patients. Finally, the digital twins forecast various outcomes, from disease development to medication needs. Scientists are particularly excited about the prospect of accelerating diagnosis. When applied to US data, the digital twins correctly identified the next condition of patients next condition with 88% accuracy. It was less effective, however, on British data. Using information from two National Health Trust (NHS) organisations, the tool accurately predicted subsequent conditions 68% and 76% of the time. Nonetheless, there are high hopes for the digital twins.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: C37HCAQRVR7JTFK(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 21st, 2024: 🕵️♂️ Stealing Part of a Production Language Model 🤖 Sakana AI’s method to automate foundation model development 👋 Key Stable Diffusion researchers leave Stability AI 🗣️Character AI’s new feature adds voice to characters with just 10-sec audio 💡Fitbit to get major AI upgrades powered by Google’s ‘Personal Health’ LLM 🔬Samsung creates lab to research chips for AI’s next phase 🤖GitHub’s latest AI tool can automatically fix code vulnerabilities
- Google's progress on generative AI in health
New modalities in models for healthcare
Medicine is a multimodal discipline; it’s made up of different types of information stored across formats — like radiology images, lab results, genomics data, environmental context and more. To get a fuller understanding of a person’s health, we need to build technology that understands all of this information.
A Personal Health LLM for personalized coaching and recommendations
Fitbit and Google Research are working together to build a Personal Health Large Language Model that can power personalized health and wellness features in the Fitbit mobile app, helping people get even more insights and recommendations from the data from their Fitbit and Pixel devices. This model is being fine-tuned to deliver personalized coaching capabilities, like actionable messages and guidance, that can be individualized based on personal health and fitness goals. For example, this model may be able to analyze variations in your sleep patterns and sleep quality, and then suggest recommendations on how you might change the intensity of your workout based on those insights.
- Google fined €250 million by French authorities for clash with news outlets over AI training data
Google was fined €250 million by French watchdogs after it trained Bard with data from French news publications without their consent
- UN set to vote on first AI resolution, aiming to make it 'safe and secure'
The UN General Assembly is set to vote on its first resolution on artificial intelligence, focusing on ensuring the technology is safe, respects human rights, and benefits all nations.
The draft resolution emphasizes closing the digital divide, fostering global consensus on AI governance, and using AI to achieve the UN's 2030 development goals.
The resolution has received support from all 193 UN member states after months of negotiations and aims to guide the development and use of AI in a manner that respects human rights and fundamental freedoms.
- Stealing Part of a Production Language Model
Researchers from Google, OpenAI, and DeepMind (among others) released a new paper that introduces the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI’s ChatGPT or Google’s PaLM-2.
The attack allowed them to recover the complete embedding projection layer of a transformer language model. It differs from prior approaches that reconstruct a model in a bottom-up fashion, starting from the input layer. Instead, this operates top-down and directly extracts the model’s last layer by making targeted queries to a model’s API. This is useful for several reasons; it
Reveals the width of the transformer model, which is often correlated with its total parameter count.
Slightly reduces the degree to which the model is a complete “blackbox”
May reveal more global information about the model, such as relative size differences between different models
- Sakana AI’s method to automate foundation model development
Sakana AI has introduced Evolutionary Model Merge, a general method that uses evolutionary techniques to efficiently discover the best ways to combine different models from the vast ocean of different open-source models with diverse capabilities.
As of writing, Hugging Face has over 500k models in dozens of different modalities that, in principle, could be combined to form new models with new capabilities. By working with the vast collective intelligence of existing open models, this method is able to automatically create new foundation models with desired capabilities specified by the user.
- Key Stable Diffusion researchers leave Stability AI
Robin Rombach and other key researchers who helped develop the Stable Diffusion text-to-image generation model have left the troubled, once-hot, now floundering GenAI startup.
Rombach (who led the team) and fellow researchers Andreas Blattmann and Dominik Lorenz were three of the five authors who developed the core Stable Diffusion research while at a German university. They were hired afterwards by Stability. Last month, they helped publish a 3rd edition of the Stable Diffusion model, which, for the first time, combined the diffusion structure used in earlier versions with transformers used in OpenAI’s ChatGPT.
Their departures are the latest in a mass exodus of executives at Stability AI, as its cash reserves dwindle and it struggles to raise additional funds.
- 🗣️Character AI’s new feature adds voice to characters with just 10-sec audio
You can now give voice to your Characters by choosing from thousands of voices or creating your own. The voices are created with just 10 seconds of audio clips. The feature is now available for free to everyone.
- 🤖GitHub’s latest AI tool can automatically fix code vulnerabilities
GitHub launches the first beta of its code-scanning autofix feature, which finds and fixes security vulnerabilities during the coding process. GitHub claims it can remediate more than two-thirds of the vulnerabilities it finds, often without the developers having to edit the code. The feature is now available for all GitHub Advanced Security (GHAS) customers.
- 🚀OpenAI plans to release a 'materially better' GPT-5 in mid-2024
According to anonymous sources from Businessinsider, OpenAI plans to release GPT-5 this summer, which will be significantly better than GPT-4. Some enterprise customers are said to have already received demos of the latest model and its ChatGPT improvements.
- 💡Fitbit to get major AI upgrades powered by Google’s ‘Personal Health’ LLM
Google Research and Fitbit announced they are working together to build a Personal Health LLM that gives users more insights and recommendations based on their data in the Fitbit mobile app. It will give Fitbit users personalized coaching and actionable insights that help them achieve their fitness and health goals.
- 🔬Samsung creates lab to research chips for AI’s next phase
Samsung has set up a research lab dedicated to designing an entirely new type of semiconductor needed for (AGI). The lab will initially focus on developing chips for LLMs with a focus on inference. It aims to release new “chip designs, an iterative model that will provide stronger performance and support for increasingly larger models at a fraction of the power and cost.”
- 🔍 Google fined $270M for using news articles to train Gemini
Google agreed to pay approximately $273 million to settle a dispute in France for not informing or compensating French news publishers when using their content for search results and training its AI chatbot, Gemini.
The settlement addresses Google's breach of commitments, including fair negotiations with publishers and informing them about the use of their content by Google's AI services.
As part of the settlement, Google committed to corrective measures including dropping a minimum threshold for publisher remuneration and appointing a French-speaking representative to improve transparency and communication with publishers.
New modalities in models for healthcare
Medicine is a multimodal discipline; it’s made up of different types of information stored across formats — like radiology images, lab results, genomics data, environmental context and more. To get a fuller understanding of a person’s health, we need to build technology that understands all of this information.
A Personal Health LLM for personalized coaching and recommendations
Fitbit and Google Research are working together to build a Personal Health Large Language Model that can power personalized health and wellness features in the Fitbit mobile app, helping people get even more insights and recommendations from the data from their Fitbit and Pixel devices. This model is being fine-tuned to deliver personalized coaching capabilities, like actionable messages and guidance, that can be individualized based on personal health and fitness goals. For example, this model may be able to analyze variations in your sleep patterns and sleep quality, and then suggest recommendations on how you might change the intensity of your workout based on those insights.
Google was fined €250 million by French watchdogs after it trained Bard with data from French news publications without their consent
The UN General Assembly is set to vote on its first resolution on artificial intelligence, focusing on ensuring the technology is safe, respects human rights, and benefits all nations. The draft resolution emphasizes closing the digital divide, fostering global consensus on AI governance, and using AI to achieve the UN's 2030 development goals. The resolution has received support from all 193 UN member states after months of negotiations and aims to guide the development and use of AI in a manner that respects human rights and fundamental freedoms.
Researchers from Google, OpenAI, and DeepMind (among others) released a new paper that introduces the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI’s ChatGPT or Google’s PaLM-2. The attack allowed them to recover the complete embedding projection layer of a transformer language model. It differs from prior approaches that reconstruct a model in a bottom-up fashion, starting from the input layer. Instead, this operates top-down and directly extracts the model’s last layer by making targeted queries to a model’s API. This is useful for several reasons; it Reveals the width of the transformer model, which is often correlated with its total parameter count.
Slightly reduces the degree to which the model is a complete “blackbox”
May reveal more global information about the model, such as relative size differences between different models
Sakana AI has introduced Evolutionary Model Merge, a general method that uses evolutionary techniques to efficiently discover the best ways to combine different models from the vast ocean of different open-source models with diverse capabilities. As of writing, Hugging Face has over 500k models in dozens of different modalities that, in principle, could be combined to form new models with new capabilities. By working with the vast collective intelligence of existing open models, this method is able to automatically create new foundation models with desired capabilities specified by the user.
Robin Rombach and other key researchers who helped develop the Stable Diffusion text-to-image generation model have left the troubled, once-hot, now floundering GenAI startup. Rombach (who led the team) and fellow researchers Andreas Blattmann and Dominik Lorenz were three of the five authors who developed the core Stable Diffusion research while at a German university. They were hired afterwards by Stability. Last month, they helped publish a 3rd edition of the Stable Diffusion model, which, for the first time, combined the diffusion structure used in earlier versions with transformers used in OpenAI’s ChatGPT. Their departures are the latest in a mass exodus of executives at Stability AI, as its cash reserves dwindle and it struggles to raise additional funds.
You can now give voice to your Characters by choosing from thousands of voices or creating your own. The voices are created with just 10 seconds of audio clips. The feature is now available for free to everyone.
GitHub launches the first beta of its code-scanning autofix feature, which finds and fixes security vulnerabilities during the coding process. GitHub claims it can remediate more than two-thirds of the vulnerabilities it finds, often without the developers having to edit the code. The feature is now available for all GitHub Advanced Security (GHAS) customers.
According to anonymous sources from Businessinsider, OpenAI plans to release GPT-5 this summer, which will be significantly better than GPT-4. Some enterprise customers are said to have already received demos of the latest model and its ChatGPT improvements.
Google Research and Fitbit announced they are working together to build a Personal Health LLM that gives users more insights and recommendations based on their data in the Fitbit mobile app. It will give Fitbit users personalized coaching and actionable insights that help them achieve their fitness and health goals.
Samsung has set up a research lab dedicated to designing an entirely new type of semiconductor needed for (AGI). The lab will initially focus on developing chips for LLMs with a focus on inference. It aims to release new “chip designs, an iterative model that will provide stronger performance and support for increasingly larger models at a fraction of the power and cost.”
Google agreed to pay approximately $273 million to settle a dispute in France for not informing or compensating French news publishers when using their content for search results and training its AI chatbot, Gemini. The settlement addresses Google's breach of commitments, including fair negotiations with publishers and informing them about the use of their content by Google's AI services. As part of the settlement, Google committed to corrective measures including dropping a minimum threshold for publisher remuneration and appointing a French-speaking representative to improve transparency and communication with publishers.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: C37HCAQRVR7JTFK(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 20th, 2024: 🤖 OpenAI to release GPT-5 this summer; 🧠 Nvidia’s Jensen Huang says AI hallucinations are solvable, AGI is 5 years away; 🔬 Ozempic creator plans AI supercomputer to discover new drugs; 👀 After raising $1.3B, Inflection eaten alive by Microsoft; 🧠 MindEye2: AI Mind Reading from Brain Activity; 🚀 Nvidia NIM enables faster deployment of AI models
- 🤖 OpenAI to release GPT-5 this summer :
OpenAI is planning to launch GPT-5 around mid-year, aiming to address previous performance issues and significantly improve upon its predecessor, GPT-4.
GPT-5 is described as "materially better" by those who have seen demos, including enhancements and new capabilities like the ability to call AI agents for autonomous tasks, with enterprise customers having already previewed these improvements.
The release timeline for GPT-5 remains uncertain as OpenAI continues its training and thorough safety and vulnerability testing, with no specific deadline for completion of these preparatory steps.
- 👀 After raising $1.3B, Inflection eaten alive by Microsoft :
In June 2023, Inflection raised $1.3 billion led by Microsoft to develop "more personal AI" but was overtaken by Microsoft less than a year later, with co-founders joining Microsoft's new AI division.
Despite significant investment, Inflection's AI, Pi, failed to compete with advancements from other companies such as OpenAI, Google’s Gemini, and Anthropic, leading to its downfall.
Microsoft's takeover of Inflection reflects the strategy of legacy tech companies to dominate the AI space by supporting startups then acquiring them once they face challenges.
- 🧠 Nvidia’s Jensen Huang says AI hallucinations are solvable, AGI is 5 years away
Nvidia CEO Jensen Huang predicts artificial general intelligence (AGI) could be achieved within 5 years, depending on how AGI is defined and measured.
Huang addresses concerns around AI hallucinations, suggesting that ensuring answers are well-researched could easily solve the issue.
The concept of AGI raises concerns about its potential unpredictability and the challenges of aligning its objectives with human values and priorities.
- 🔬 Ozempic creator plans AI supercomputer to discover new drugs
The Novo Nordisk Foundation is investing in "Gefion," an AI supercomputer project developed in collaboration with Nvidia.
"Gefion" aims to be the world’s most powerful AI supercomputer for health sciences, utilizing Nvidia's new chips to accelerate scientific breakthroughs in critical areas such as drug discovery, disease diagnosis, and treatment,
This initiative underscores the growing integration of AI in healthcare, promising to catalyze significant scientific discoveries and innovations that could transform patient care and outcomes.
- MindEye2: AI mind reading from brain activity
MindEye2 is a revolutionary model that reconstructs visual perception from brain activity using just one hour of data. Traditional methods require extensive training data, making them impractical for real-world applications. However, MindEye2 overcomes this limitation by leveraging shared-subject models. The model is pretrained on data from seven subjects and then fine-tuned with minimal data from a new subject.
- Nvidia NIM enables faster deployment of AI models
NVIDIA has introduced NVIDIA NIM (NVIDIA Inference Microservices) to accelerate the deployment of AI applications for businesses. NIM is a collection of microservices that package essential components of an AI application, including AI models, APIs, and libraries, into a container. These containers can be deployed in environments such as cloud platforms, Linux servers, or serverless architectures.
- Microsoft hires DeepMind co-founder to lead a new AI division
Mustafa Suleyman, a renowned co-founder of DeepMind and Inflection, has recently joined Microsoft as the leader of Copilot. Satya Nadella, Microsoft's CEO, made this significant announcement, highlighting the importance of innovation in artificial intelligence (AI).
In his new role as the Executive Vice President and CEO of Microsoft AI, Mustafa will work alongside Karén Simonyan, another talented individual from Inflection who will serve as Chief Scientist. Together, they will spearhead the development and advancement of Copilot and other exciting consumer AI products at Microsoft. Mustafa and his team's addition to the Microsoft family brings a wealth of expertise and promises groundbreaking advancements in AI.
- Google DeepMind’s new AI tool can analyze soccer tactics and offer insights
DeepMind has partnered with Liverpool FC to develop a new AI tool called TacticAI. TacticAI uses generative and predictive AI to help coaches determine which player will most likely receive the ball during corner kicks, whether a shot will be taken, and how to adjust player setup. It aims to revolutionize soccer and help the teams enhance their efficiency.
- Pika Labs introduces sound effects for its gen-AI video generation
Pika Labs has now added the ability to create sound effects from a text prompt for its generative artificial intelligence videos. It allows for automatic or custom SFX generations to pair with video outputs. Now, users can make bacon sizzle, lions roar, or add footsteps to the video of someone walking down the street. It is only available to pro users.
- Buildbox 4 Alpha enables users to create 3D video games from text prompts
Buildbox has released an alpha version of Buildbox 4. It's an AI-first game engine that allows users to create games and generate assets from text prompts. The alpha version aims to make text-to-game a distinct reality. Users can create various assets and animations from simple text prompts. It also allows users to build a gaming environment in a few minutes.
- Nvidia adds generative AI capabilities to empower humanoid robots
Nvidia introduced Project GR00T, a multimodal AI that will power future humanoids with advanced foundation AI. Project GR00T enables humanoid robots to input text, speech, videos, or even live demos and process them to take specific actions. It has been developed with the help of Nvidia’s Isaac Robotic Platform tools, including an Isaac Lab for RLHF.
- Perplexity AI, a hyped Silicon Valley AI startup that claimed to take on Google, was found out copying Google results directly
It never claimed to have an original or superior search algorithm. Why would you need to reinvent the wheel.
Their value is in having an LLM that uses existing search engines well.
- GitHub is launching the first beta of its code scanning autofix feature for finding and fixing security vulnerabilities during the coding process.
This new feature combines the real-time capabilities of GitHub’s Copilot with CodeQL, the company’s semantic code analysis engine. The company first previewed this capability last November.
OpenAI is planning to launch GPT-5 around mid-year, aiming to address previous performance issues and significantly improve upon its predecessor, GPT-4. GPT-5 is described as "materially better" by those who have seen demos, including enhancements and new capabilities like the ability to call AI agents for autonomous tasks, with enterprise customers having already previewed these improvements. The release timeline for GPT-5 remains uncertain as OpenAI continues its training and thorough safety and vulnerability testing, with no specific deadline for completion of these preparatory steps.
In June 2023, Inflection raised $1.3 billion led by Microsoft to develop "more personal AI" but was overtaken by Microsoft less than a year later, with co-founders joining Microsoft's new AI division. Despite significant investment, Inflection's AI, Pi, failed to compete with advancements from other companies such as OpenAI, Google’s Gemini, and Anthropic, leading to its downfall. Microsoft's takeover of Inflection reflects the strategy of legacy tech companies to dominate the AI space by supporting startups then acquiring them once they face challenges.
Nvidia CEO Jensen Huang predicts artificial general intelligence (AGI) could be achieved within 5 years, depending on how AGI is defined and measured. Huang addresses concerns around AI hallucinations, suggesting that ensuring answers are well-researched could easily solve the issue. The concept of AGI raises concerns about its potential unpredictability and the challenges of aligning its objectives with human values and priorities.
The Novo Nordisk Foundation is investing in "Gefion," an AI supercomputer project developed in collaboration with Nvidia. "Gefion" aims to be the world’s most powerful AI supercomputer for health sciences, utilizing Nvidia's new chips to accelerate scientific breakthroughs in critical areas such as drug discovery, disease diagnosis, and treatment, This initiative underscores the growing integration of AI in healthcare, promising to catalyze significant scientific discoveries and innovations that could transform patient care and outcomes.
MindEye2 is a revolutionary model that reconstructs visual perception from brain activity using just one hour of data. Traditional methods require extensive training data, making them impractical for real-world applications. However, MindEye2 overcomes this limitation by leveraging shared-subject models. The model is pretrained on data from seven subjects and then fine-tuned with minimal data from a new subject.
NVIDIA has introduced NVIDIA NIM (NVIDIA Inference Microservices) to accelerate the deployment of AI applications for businesses. NIM is a collection of microservices that package essential components of an AI application, including AI models, APIs, and libraries, into a container. These containers can be deployed in environments such as cloud platforms, Linux servers, or serverless architectures.
Mustafa Suleyman, a renowned co-founder of DeepMind and Inflection, has recently joined Microsoft as the leader of Copilot. Satya Nadella, Microsoft's CEO, made this significant announcement, highlighting the importance of innovation in artificial intelligence (AI). In his new role as the Executive Vice President and CEO of Microsoft AI, Mustafa will work alongside Karén Simonyan, another talented individual from Inflection who will serve as Chief Scientist. Together, they will spearhead the development and advancement of Copilot and other exciting consumer AI products at Microsoft. Mustafa and his team's addition to the Microsoft family brings a wealth of expertise and promises groundbreaking advancements in AI.
DeepMind has partnered with Liverpool FC to develop a new AI tool called TacticAI. TacticAI uses generative and predictive AI to help coaches determine which player will most likely receive the ball during corner kicks, whether a shot will be taken, and how to adjust player setup. It aims to revolutionize soccer and help the teams enhance their efficiency.
Pika Labs has now added the ability to create sound effects from a text prompt for its generative artificial intelligence videos. It allows for automatic or custom SFX generations to pair with video outputs. Now, users can make bacon sizzle, lions roar, or add footsteps to the video of someone walking down the street. It is only available to pro users.
Buildbox has released an alpha version of Buildbox 4. It's an AI-first game engine that allows users to create games and generate assets from text prompts. The alpha version aims to make text-to-game a distinct reality. Users can create various assets and animations from simple text prompts. It also allows users to build a gaming environment in a few minutes.
Nvidia introduced Project GR00T, a multimodal AI that will power future humanoids with advanced foundation AI. Project GR00T enables humanoid robots to input text, speech, videos, or even live demos and process them to take specific actions. It has been developed with the help of Nvidia’s Isaac Robotic Platform tools, including an Isaac Lab for RLHF.
It never claimed to have an original or superior search algorithm. Why would you need to reinvent the wheel. Their value is in having an LLM that uses existing search engines well.
This new feature combines the real-time capabilities of GitHub’s Copilot with CodeQL, the company’s semantic code analysis engine. The company first previewed this capability last November.
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 19th, 2024
- 💻 Nvidia launches 'world's most powerful AI chip': Nvidia has revealed its new Blackwell B200 GPU and GB200 "superchip", claiming it to be the world's most powerful chip for AI. Both B200 and GB200 are designed to offer powerful performance and significant efficiency gains.
Key takeaways:
- The B200 offers up to 20 petaflops of FP4 horsepower, and Nvidia says it can reduce costs and energy consumption by up to 25 times over an H100.
-
The GB200 "superchip" can deliver 30X the performance for LLM inference workloads while also being more efficient.
-
Nvidia claims that just 2,000 Blackwell chips working together could train a GPT -4-like model comprising 1.8 trillion parameters in just 90 days.
- 🎥 Stability AI's SV3D turns a single photo into a 3D video: Stability AI released Stable Video 3D (SV3D), a new generative AI tool for rendering 3D videos. SV3D can create multi-view 3D models from a single image, allowing users to see an object from any angle. This technology is expected to be valuable in the gaming sector for creating 3D assets and in e-commerce for generating 360-degree product views.
- 🤖 OpenAI CEO hints at "Amazing Model", maybe ChatGPT-5: OpenAI CEO Sam Altman has announced that the company will release an "amazing model" in 2024, although the name has not been finalized. Altman also mentioned that OpenAI plans to release several other important projects before discussing GPT-5, one of which could be the Sora video model.
- 🤝 Apple is in talks to bring Google's AI to iPhones: Apple and Google are negotiating a deal to integrate Google's Gemini AI into iPhones, potentially shaking up the AI industry. The deal would expand on their existing search partnership. Apple also held discussions with OpenAI. If successful, the partnership could give Gemini a significant edge with billions of potential users.
- 🏷️YouTube rolls out AI content labels: YouTube now requires creators to self-label AI-generated or synthetic content in videos. The platform may add labels itself for potentially misleading content. However, the tool relies on creators being honest, as YouTube is still working on AI detection tools.
- 🎮Roblox speeds up 3D creation with AI tools: Roblox has introduced two AI-driven tools to streamline 3D content creation on its platform. Avatar Auto Setup automates the conversion of 3D body meshes into fully animated avatars, while Texture Generator allows creators to quickly alter the appearance of 3D objects using text prompts, enabling rapid prototyping and iteration.
- 🌐Nvidia teams up with Shutterstock and Getty Images for AI-generated 3D content: Nvidia's Edify AI can now create 3D content, and partnerships with Shutterstock and Getty Images will make it accessible to all. Developers can soon experiment with these models, while industry giants are already using them to create stunning visuals and experiences.
- 🖌️Adobe Substance 3D introduces AI-powered text-to-texture tools: Adobe has introduced two AI-driven features to its Substance 3D suite: "Text to Texture," which generates photo-realistic or stylized textures from text prompts, and "Generative Background," which creates background images for 3D scenes. Both tools use 2D imaging technology from Adobe's Firefly AI model to streamline 3D workflows.
- 💥 Nvidia unveils the most powerful AI chip ever: Nvidia unveils the Blackwell B200 GPU, labeled as the "world's most powerful chip" for AI, capable of delivering up to 20 petaflops of FP4 horsepower.
The GB200 superchip, which combines two B200 GPUs and a single Grace CPU, can provide 30 times the performance for LLM inference workloads compared to the H100, with a reduction in cost and energy consumption by up to 25x.
Nvidia introduced a new network switch chip to enhance connectivity between multiple GPUs, enabling 576 GPUs to communicate with 1.8 terabytes per second.
- 🤖 Nvidia unveils Project GR00T, an AI platform to power humanoids of the future: Nvidia has announced Project GROOT, its new foundational model aimed at helping the development of robots in industrial use cases.
Project GROOT is designed to enable robots to understand natural language and learn actions by observing humans, enhancing their ability to adapt and interact with the real world.
The initiative is supported by Nvidia's new Jetson Thor computing system, featuring a GPU based on the Nvidia Blackwell architecture, to power these advanced humanoid robots.
- 🚿 SEC charges investment advisors for ‘AI washing’: The SEC charged two investment advisors, Delphia and Global Predictions, for making misleading claims about their use of artificial intelligence, a practice referred to as "AI washing."
Following a cease-and-desist order, both companies agreed to settle the charges by paying a total of $400,000 in civil penalties, after being accused of deceiving investors and regulators about their AI capabilities and regulatory compliance.
SEC Chair Gary Gensler emphasized the importance of truthfulness in marketing AI capabilities, warning against the damages of "AI washing" in investor advisement and financial practices.
- 📹 Stability AI launches new model that turns images into 3D videos: Stability AI introduces two versions of Stable Video 3D (SV3D), enabling the creation of 3D video "meshes" from image prompts, with advanced features like "specified camera paths".
The SV3D model, building on Stable Video Diffusion, can generate 3D model videos of various objects without needing images from all angles, following training on extensive datasets.
Available for both commercial and non-commercial uses, SV3D is seen as valuable for generating 3D assets in gaming and creating immersive 360-degree videos for e-commerce.
Key takeaways:
- The B200 offers up to 20 petaflops of FP4 horsepower, and Nvidia says it can reduce costs and energy consumption by up to 25 times over an H100.
- The GB200 "superchip" can deliver 30X the performance for LLM inference workloads while also being more efficient.
- Nvidia claims that just 2,000 Blackwell chips working together could train a GPT -4-like model comprising 1.8 trillion parameters in just 90 days.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 18th, 2024
- Nvidia Announcing a Platform for Trillion-Parameter Gen AI Scaling
- Sam Altman during the new Lex interview: “We will release an amazing model this year. I don’t know what we will call it.”
- Stability AI: Today, we are releasing Stable Video 3D, a generative model based on Stable Video Diffusion. This new model advances the field of 3D technology, delivering greatly improved quality and multi-view.
- 🕰️ Bernie’s 4 day workweek: less work, same pay: Sen. Bernie Sanders has introduced the Thirty-Two Hour Workweek Act, which aims to establish a four-day workweek in the United States without reducing pay or benefits. To be phased in over four years, the bill would lower the overtime pay threshold from 40 to 32 hours, ensuring that workers receive 1.5 times their regular salary for work days longer than 8 hours and double their regular wage for work days longer than 12 hours
- 🗣️ Google's AI brings photos to life as talking avatars: Google's latest AI research project VLOGGER, automatically generates realistic videos of talking and moving people from just a single image and an audio or text input. It is the first model that aims to create more natural interactions with virtual agents by including facial expressions, body movements, and gestures, going beyond simple lip-syncing.
- Nvidia unveils next-gen Blackwell GPUs with 25X lower costs and energy consumption
- 🤖Elon Musk’s xAI open-sources Grok AI: Elon Musk's xAI has open-sourced the base model weights and architecture of its AI chatbot, Grok. This allows researchers and developers to freely use and build upon the 314 billion parameter Mixture-of-Experts model. Released under the Apache 2.0 license, the open-source version is not fine-tuned for any particular task.
- 🧠 Maisa KPU may be the next leap in AI reasoning: Maisa has released the beta version of its Knowledge Processing Unit (KPU), an AI system that uses LLMs’ advanced reasoning and data processing abilities. In an impressive demo, the KPU assisted a customer with an order-related issue, even when the customer provided an incorrect order ID, showing the system's understanding abilities.
- 🍿 PepsiCo increases market domination using GenAI: PepsiCo uses GenAI in product development and marketing for faster launches and better profitability. It has increased market penetration by 15% by using GenAI to improve the taste and shape of products like Cheetos based on customer feedback. The company is also doubling down on its presence in India, with plans to open a third capability center to develop local talent
- 💻 Deci launches Nano LLM & GenAI dev platform: Israeli AI startup Deci has launched two major offerings: Deci-Nano, a small closed-source language model, and a complete Generative AI Development Platform for enterprises. Compared to rivals like OpenAI and Anthropic, Deci-Nano offers impressive performance at low cost, and the new platform offers a suite of tools to help businesses deploy and manage AI solutions.
- 🎮 Invoke AI simplifies game dev workflows: Invoke has launched Workflows, a set of AI tools designed for game developers and large studios. These tools make it easier for teams to adopt AI, regardless of their technical expertise levels. Workflows allow artists to use AI features while maintaining control over their training assets, brand-specific styles, and image security
- 🚗 Mercedes teams up with Apptronik for robot workers: Mercedes-Benz is collaborating with robotics company Apptronik to automate repetitive and physically demanding tasks in its manufacturing process. The automaker is currently testing Apptronik's Apollo robot, a 160-pound bipedal machine capable of lifting objects up to 55 pounds. The robot inspects and delivers components to human workers on the production line, reducing the physical strain on employees and increasing efficiency.
- 💥 Apple in talks with Google to use their AI models: Apple and Google are discussing a partnership to integrate Google's Gemini AI into Apple's iPhone software features.
The collaboration could enhance their existing search partnership, which currently involves Google paying Apple approximately $20 billion annually to be the default search engine on iOS devices.
Despite ongoing negotiations and potential antitrust concerns, the deal, aimed at introducing powerful AI capabilities to iPhones, may not be announced until Apple's developer conference in June.
- 📹 YouTube mandates AI content disclosure by creators: YouTube now mandates creators to inform viewers when AI was used to make content appear realistically, through a new tool in Creator Studio for disclosing altered or synthetically generated media.
The policy aims to reduce deception among viewers by distinguishing synthetic content from real, especially amid concerns about AI and deepfakes influencing U.S. presidential election perceptions.
Exemptions to the disclosure requirement include clearly fantastical content and use of AI in production assistance, focusing instead on realistic depictions of people, places, events, and voices.
- 🍎 Apple introduces new 'MM1' AI model : Apple researchers have unveiled the 'MM1' AI model, which is capable of training on both text and visual inputs, aiming to create more intelligent and flexible AI systems.
The MM1 model utilizes a diverse dataset that includes image-caption pairs and text-data, improving its performance on tasks like image captioning and visual question answering.
The research highlights the MM1 model's advanced in-context learning abilities, especially in its largest configuration, enabling multi-step reasoning over images with minimal examples.
- Google AI releases MELON, a New Technique for Constructing 3D Objects from Images: MELON is a technique for reconstructing 3D objects from images without known camera positions. MELON uses a lightweight neural network to infer camera poses and incorporates a modulo loss that accounts for objects' pseudo-symmetries, allowing reconstruction from as few as 4-6 images. Demonstrated on the NeRF-Synthetic dataset, MELON achieves accurate reconstructions and novel view synthesis, showing promise for applications in fields where precise camera pose information is unavailable.
MELON can easily be integrated into existing NeRF methods and requires as few as 4–6 images of an object.
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Skin Stem Cell Serum
AI Daily Chronicle of AI Innovations - March 16th, 2024
- 🔍 FTC is probing Reddit’s AI licensing deals: Reddit is under investigation by the FTC for its data licensing practices concerning user-generated content being used to train AI models.
The investigation focuses on Reddit's engagement in selling, licensing, or sharing data with third parties for AI training.
Reddit anticipates generating approximately USD 60 million in 2024 from a data licensing agreement with Google, aiming to leverage its platform data for training LLMs.
- 💻 New jailbreak uses ASCII art to elicit harmful responses from leading LLMs: Researchers identified a new vulnerability in leading AI language models, named ArtPrompt, which uses ASCII art to exploit the models' security mechanisms.
ArtPrompt masks security-sensitive words with ASCII art, fooling language models like GPT-3.5, GPT-4, Gemini, Claude, and Llama2 into performing actions they would otherwise block, such as giving instructions for making a bomb.
The study underscores the need for enhanced defensive measures for language models, as ArtPrompt, by leveraging a mix of text-based and image-based inputs, can effectively bypass current security protocols.
- ArXiv Papers as Audiobooks Official Implementation: converts ArXiv papers into engaging video formats or audio files, utilizing latex conversion, HTML parsing, OpenAI GPT for paraphrasing and simplification, Google's text-to-speech for audio, and video mapping, offering both detailed and summarized versions with the option to upload audio to Google Drive.
- OpenAI aims to make its own AI processors — chip venture in talks with Abu Dhabi investment firm
- Once “too scary” to release, GPT-2 gets squeezed into an Excel spreadsheet.
- AI Weekly Rundown March 09 to March 16th, 2024
🖼️ Huawei's PixArt-Σ paints prompts to perfection
🧠 Meta cracks the code to improve LLM reasoning
📈 Yi Models exceed benchmarks with refined data
🚀 Cohere introduces production-scale AI for enterprises
🎯 RFM-1 redefines robotics with human-like reasoning
🎧 Spotify introduces audiobook recommendations
👨💻 Devin: The first AI software engineer redefines coding
🗣️ Deepgram’s Aura empowers AI agents with authentic voices
🖥️ Meta introduces two 24K GPU clusters to train Llama 3
🎮 DeepMind's SIMA: The AI agent that's a Jack of all games
⚡ Claude 3 Haiku: Anthropic's lightning-fast AI solution for enterprises
🤖 ChatGPT gets a body with "Figure 01"
🛠️ Apple’s new recipe to build performant multimodal models
💥 Cerebras’ chip for enabling 10x larger models than GPT-4
💼 Apple buys startup DarwinAI ahead of a big push into GenAI in 2024
🖼️ Huawei's PixArt-Σ paints prompts to perfection
🧠 Meta cracks the code to improve LLM reasoning
📈 Yi Models exceed benchmarks with refined data
🚀 Cohere introduces production-scale AI for enterprises
🎯 RFM-1 redefines robotics with human-like reasoning
🎧 Spotify introduces audiobook recommendations
👨💻 Devin: The first AI software engineer redefines coding
🗣️ Deepgram’s Aura empowers AI agents with authentic voices
🖥️ Meta introduces two 24K GPU clusters to train Llama 3
🎮 DeepMind's SIMA: The AI agent that's a Jack of all games
⚡ Claude 3 Haiku: Anthropic's lightning-fast AI solution for enterprises
🤖 ChatGPT gets a body with "Figure 01"
🛠️ Apple’s new recipe to build performant multimodal models
💥 Cerebras’ chip for enabling 10x larger models than GPT-4
💼 Apple buys startup DarwinAI ahead of a big push into GenAI in 2024
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 15th, 2024
- 🥘 Apple’s MM1: The new recipe to master AI performance
- ⚡ Cerebras WSE-3: AI chip enabling 10x larger models than GPT-4
- 🤖 Apple acquires Canadian AI startup DarwinAI
- 🤖 Microsoft expands the availability of Copilot across life and work.
- 💻 Oracle adds groundbreaking Generative AI features to its software
- 💰 Databricks makes a strategic investment in Mistral AI
- 📱 Qualcomm emerges as a mobile AI juggernaut
- 👓 MIT researchers develop peripheral vision capabilities for AI models
- 🤔 Microsoft calls out Google dominance in generative AI
- 📝 Anthropic releases affordable, high-speed Claude 3 Haiku model
- 🚫 Midjourney bans prompts with Joe Biden and Donald Trump over election misinformation concerns
- 🤖 Mercedes tests humanoid robots for ‘low skill, repetitive’ tasks
- 💊 Health Equity Assessment of machine Learning performance (HEAL): Google introduce Health Equity Assessment of machine Learning performance (HEAL), a novel evaluation framework designed to quantitatively assess whether the performance of an ML-based health tool is equitable. Google propose a 4-step process for estimating the likelihood that an ML tool performs better for groups with, on average, worse health outcomes as compared to other groups, with the goal to inform improvements that make health ML technologies more equitable.
AI Daily Chronicle of AI Innovations - March 14th, 2024
- ⚡ Claude 3 Haiku: Anthropic's lightning-fast AI solution for enterprises
- 🎮 DeepMind's SIMA: The AI agent that's a Jack of all games
- 🤖 OpenAI-powered "Figure 01" can chat, perceive, and complete tasks
- 🎥 OpenAI’s Sora will be publicly available later this year
- 🛡️ Microsoft to expand AI-powered cybersecurity tool availability from April 1
- 📰 OpenAI partners with Le Monde, Prisa Media for news content in ChatGPT
- 🏠 Icon's AI architect and 3D printing breakthroughs reimagine homebuilding
- 🛍️ Amazon streamlines product listing process with new AI tool
- Claude 3 Haiku is now available for free on Perplexity Labs! Try it now.
- Humanoid robots could fight as early as 2030, US colonel predicts
- Defense in Depth: An Action Plan to Increase the Safety and Security of Advanced AI
- Building Meta’s GenAI Infrastructure: "Meta’s long-term vision is to build artificial general intelligence (AGI) that is open and built responsibly so that it can be widely available for everyone to benefit from"
AI Daily Chronicle of AI Innovations - March 13th, 2024
- 🧠 This Software Engineer AI Can Train Other AIs, Code Websites by Itself: Engineers at Cognition Labs have developed Devin, an AI that can autonomously code, complete engineering jobs on platforms like Upwork, and self-improve by tuning its own AI models. Devin is capable of completing entire projects independently by learning from the internet and can debug problems without human intervention, according to Cognition Labs CEO Scott Wu. The AI has demonstrated its capabilities by coding a basic Pong game and creating a website from scratch in under 20 minutes, potentially changing the nature of software engineering work.
- 🇪🇺 European lawmakers pass world’s first major regulation for AI: European Union lawmakers have voted to officially adopt the Artificial Intelligence Act (AI Act), aimed at regulating AI technology, including prohibiting certain uses and demanding transparency from providers. The AI Act's implementation includes a phased approach, with varying deadlines for banning prohibited AI systems and enforcing rules on "general-purpose AI systems" and "high-risk" AI systems. Critics express concerns over the AI Act's potential to stifle innovation and disadvantage European AI companies, while others see it as a step towards safer and more transparent AI development.
- 🎮 Google Deepmind’s new AI will play video games with you: Google DeepMind introduced SIMA, an AI agent designed to play video games more like a human partner, aiming to complement rather than replace human players in gaming experiences. SIMA, which is being trained to understand and perform tasks in video games without the need for winning, has been developed in collaboration with game developers like Hello Games and Coffee Stain, focusing on open-world and non-linear games. With about 600 basic skills and the potential for more complex tasks, SIMA represents an evolving AI tool that could change player interaction by bringing AI-controlled characters that learn and adapt alongside human players.
- 🗣️ Deepgram’s Aura empowers AI agents with authentic voices
- 🖥️ Meta introduces two 24K GPU clusters to train Llama 3
- 🎮 Google Play to display AI-powered FAQs and recent YouTube videos for games
- 🛡️ DoorDash’s new AI-powered tool automatically curbs verbal abuses
- 🔍 Perplexity has decided to bring Yelp data to its chatbot
- 👗 Pinterest’s ‘body types ranges’ tool delivers more inclusive search results
- 🚀 OpenAI’s GPT-4.5 Turbo is all set to be launched in June 2024
- 🚀 With OpenAI, Figure 01 can now have full conversations with people
- WSJ interview with OpenAI CTO Mira Murati on Sora, set for release "definitely this year, but could be a few months"
- Cerebras Systems Unveils World’s Fastest AI Chip with Whopping 4 Trillion Transistors - Cerebras
- New LLM Leaderboard measuring Uncensored General Intelligence
AI Daily Chronicle of AI Innovations - March 12th, 2024
- 🚫 Google restricts election-related queries for its Gemini chatbot
- 💰 AI startups reach record funding of nearly $50 billion in 2023
- 🚀Cohere’s introduces production-scale AI for enterprises
- 🙃 Midjourney bans all its competitor's employees
- 🤖 RFM-1 redefines robotics with human-like reasoning
- 🎧 Spotify introduces audiobook recommendations
- 💡 Elon Musk makes xAI's Grok chatbot open-source
- 🖼️ Midjourney launches character consistent feature
- 🤖 Apple tests AI for App Store ad optimization
- 🏥China tests AI chatbot to assist neurosurgeons
- 🧓South Korea deploys AI dolls to tackle elderly loneliness
- 🚀 A new training strategy, Gradient Low-Rank Projection (GaLore), has been proposed that allows full-parameter learning while being more memory-efficient than common low-rank adaptation methods such as LoRA.
- openrouter.ai is quickly becoming a go-to source for LLMs seeking the best models and prices for their prompts. With billions of tokens and top contributors like Mistral and Gemini, it's no surprise that the platform is gaining traction.
- Answer Ai is revolutionizing the world of language models with their latest project. They have developed an open-source system that can train a 70b large language model on a regular desktop computer with two or more standard migaming GPUs (RTX 3090 or 4090). This breakthrough system is the result of a collaboration between Answer.AI, Tim Dettmers (U Washington), and hashtag#HuggingFace’s Titus von Köller and Sourab Mangrulkar.
AI Daily Chronicle of AI Innovations - March 11th, 2024
- 🖼️ Huawei's PixArt-Σ paints prompts to perfection
- 🧠 Meta cracks the code to improve LLM reasoning
- 📈 Yi Models exceed benchmarks with refined data
- OpenAI's Evolution into Skynet: AI and Robotics Future, Figure Humanoid Robots
- 🏠 Redfin's AI can tell you about your dream neighborhood
- 🔊 Pika Labs Adds Sound to Silent AI Videos
- 🖥️ HP's new AI-powered PCs redefine work
Zulay Fruit Press Machine - Masticating Juicer Machine with High Yield, Quiet Motor, & Reverse Function - Celery Juicer & Carrot #Ad
I researched the Zulay Fruit Press Machine Masticating Juicer Machine with High Yield Quiet Motor Reverse Function Celery Juicer Carrot Juicer with Wide Chute Slow Juicer Cold Press for Fruits Vegetables and I thought you might find the following analysis helpful.
Users liked:
Produces high-quality juice with dry pulp (backed by 3 comments)
Easy to assemble and clean (backed by 3 comments)
Efficient extraction of juice (backed by 3 comments)
Order it Now, Go Green and Get healthier
AI Daily Chronicle of AI Innovations - March 02 - 09th, 2024
- Sam say's AI will do what 95% of marketer's do.
- Microsoft may debut its first 'AI PCs' later this month: A report suggests an OLED Surface Pro 10 and Surface Laptop 6 are imminent.
- 🤖 Multimodal, Adversarial, Auto Pruning, Auto Trainer: Open Source Release: EMMAT (Efficient Multi-Modal Language Model Auto Trainer) is a groundbreaking framework that revolutionizes the way we train large-scale language models. By leveraging cutting-edge techniques like Dynamic Knowledge Distillation, Pruning, and Multi-Modal Adversarial Transfer Learning, EMMAT enables language models to learn from multiple modalities (text, images, audio) and adapt to new tasks with unparalleled efficiency.
- Saudi Arabia's Male Humanoid Robot Accused of Sexual Harassment
- 👀 Google’s ScreenAI can ‘see’ graphics like humans do
- 🐛 How AI ‘worms’ pose security threats in connected systems
- 🧠 New benchmarking method challenges LLMs' reasoning abilities
- 🏆 Anthropic’s Claude 3 Beats OpenAI’s GPT-4
- 🖼️ TripsoSR: 3D object generation from a single image in <1s
- 🛡️ Cloudflare's Firewall for AI protects LLMs from abuses
- 🐋 Microsoft's Orca AI beats 10x bigger models in math
- 🔍 Google’s search update targets AI-generated spam
- 🎨 GPT-4V wins at turning designs into code
- 🎥 Haiper by DeepMind alums joins the AI video race
- 🗣️ Microsoft's NaturalSpeech makes AI sound human
- 🔍 Google’s search update targets AI-generated spam
- 🤖 Google's RT-Sketch teaches robots with doodles
- 🚀 Inflection-2.5: GPT4-like performance with 40% less compute
- 📱 Google’s new tool enables LLMs to run fully on-device
- 💡 GaLore: For memory-efficient pre-training & fine-tuning of LLMs
- 📜 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits!: Researchers have introduced BitNet b1.58, a groundbreaking 1-bit LLM where every parameter is ternary {-1, 0, 1}. It achieves the same performance as traditional full-precision LLMs but is significantly more efficient in terms of latency, memory, throughput, and energy consumption.
- Beyond Language Models: Byte Models are Digital World Simulators - Microsoft Research Asia 2024 - bGPT - Exceptional capabilities in simulating CPU behaviour, with an accuracy exceeding 99.99% in executing various operations! Could help combat the problems with tokenisation!
Paper: [https://arxiv.org/abs/2402.191...](https://arxiv.org/abs/2402.191...) Paper Page with **code and weights**: [https://byte-gpt.github.io/](https://byte-gpt.github.io/) Abstract: >**Traditional deep learning often overlooks bytes, the basic units of the digital world, where all forms of information and operations are encoded and manipulated in binary format**. Inspired by the success of next token prediction in natural language processing, we introduce **bGPT**, a model with **next byte prediction** to simulate the digital world. bGPT matches specialized models in performance across various modalities, including text, audio, and images, and offers new possibilities for predicting, simulating, and diagnosing algorithm or hardware behaviour. It has almost **flawlessly replicated** the process of converting symbolic music data, achieving a **low error rate of 0.0011 bits per byte** in converting ABC notation to MIDI format. In addition, bGPT **demonstrates exceptional capabilities in simulating CPU behaviour, with an accuracy exceeding 99.99% in executing various operations.** Leveraging next byte prediction, models like bGPT can directly learn from vast binary data, effectively simulating the intricate patterns of the digital world. Posted Image
Invest in your future today by enrolling in this Azure Fundamentals - Pass the Azure Fundamentals Exam with Ease: Master the AZ-900 Certification with the Comprehensive Exam Preparation Guide!
AI Daily Chronicle of AI Innovations - March 08th, 2024
- OpenAI announces new members to board of directors: Dr. Sue Desmond-Hellmann, former CEO of the Bill and Melinda Gates Foundation, Nicole Seligman, former EVP and General Counsel at Sony Corporation and Fidji Simo, CEO and Chair of Instacart. Additionally, Sam Altman, CEO, will rejoin the OpenAI Board of Directors.
- Inflection 2.5: A new era of personal AI is here!
- Google announces LLMs on device with MediaPipe
- GaLore: A new method for memory-efficient LLM training
- Adobe makes creating social content on mobile easier
- OpenAI now allows users to add MFA to user accounts
- US Army is building generative AI chatbots in war games
- Cognizant launches AI lab in San Francisco to drive innovation
- 🛡️OpenAI now allows users to add MFA to user accounts
- 🧑🎨 Claude 3 builds the painting app in 2 minutes and 48 seconds
Firming Moisturizer, Advanced Hydrating Facial Replenishing Cream, with Hyaluronic Acid, Resveratrol & Natural Botanicals to Restore Skin's Strength, Radiance, and Resilience, 1.75 Oz
AI Daily Chronicle of AI Innovations - March 07th, 2024
- OpenAI finally releases what everyone has been waiting for, multi-factor authentication.
- 🗣️Microsoft's NaturalSpeech makes AI sound human
- 🔍Google’s search update targets AI-generated spam
- 🤖Google's RT-Sketch teaches robots with doodles
- Google's Gemini lets users edit within the chatbox
- 📈Adobe's AI boosts IBM's marketing efficiency
- 💡 Zapier's new tool lets you make AI bots without coding
- 🤝Accenture teams up with Cohere to bring AI to enterprises
- 🎥 Meta builds mega AI model for video recommendations
- OpenAI is researching photonic processors to run their AI on
Active Anti-Aging Eye Gel, Reduces Dark Circles, Puffy Eyes, Crow's Feet and Fine Lines & Wrinkles, Packed with Hyaluronic Acid & Age Defying Botanicals
AI Daily Chronicle of AI Innovations - March 07th, 2024
- 🏆 Microsoft's Orca AI beats 10x bigger models in math
- 🎨 GPT-4V wins at turning designs into code
- 🎥 DeepMind alums' Haiper joins the AI video race
- OpenAI vs Musk (openai responds to elon musk).
- Guy builds an AI-steered homing/killer drone in just a few hours
- Claude 3 vs. GPT-4
- Always Say Hello to Your GPTs... (Better Performing Custom GPTs)
- 📱 AI app diagnoses ear infections with a snap
Skin Stem Cell Serum
AI Daily Chronicle of AI Innovations - March 05th, 2024
- 🏆Anthropic’s Claude 3 Beats OpenAI’s GPT-4
- AIs ranked by IQ; AI passes 100 IQ for first time, with release of Claude-3
- 🖼️ TripsoSR: 3D object generation from a single image in <1s
- 🔒 Cloudflare's Firewall for AI protects LLMs from abuses
- 🤖 ChatGPT can now read your responses out loud
- 💻 Wix’s new AI chatbot can build websites in a flash
- 🛒 Amazon adds Claude 3 models to Bedrock
- 🫀AI tool detects kidney failure 6x faster compared to human experts
- 🚀 Groq launches GroqCloud, a developer playground to access Groq LPU
Can AI Really Predict Lottery Results? We Asked an Expert.
AI Daily Chronicle of AI Innovations - March 04th, 2024
- 👀 Google’s ScreenAI can ‘see’ graphics like humans do
- 🐛 How AI ‘worms’ pose security threats in connected systems
- 🧠 New benchmarking method challenges LLMs' reasoning abilities
- 💊 AI may enable personalized prostate cancer treatment
- 🎥 Vimeo debuts AI-powered video hub for business collaboration
- 📱 Motorola revving up for AI-powered Moto X50 Ultra launch
- 📂 Copilot will soon fetch and parse your OneDrive files
- ⚡ Huawei's new AI chip threatens Nvidia's dominance in China
- Anthropic launches Claude 3, claiming to outperform GPT-4 across the board
AI Daily Chronicle of AI Innovations - March 02nd, 2024
- This AI Paper from Meta AI Explores Advanced Refinement Strategies: Unveiling the Power of Stepwise Outcome-based and Process-based Reward Models.
- AI worm infects users via AI-enabled email clients — Morris II generative AI worm steals confidential data as it spreads.
- Korean new AI image generator is 8 times faster than OpenAI’s best tool — and can run on cheap computers.
- AI-generated porn, including celebrity fake nudes, persist on Etsy as deepfake laws ‘lag behind’.
AI Daily Chronicle of AI Innovations - March 01st, 2024
- 💥 Elon Musk sues OpenAI and Sam Altman over ‘betrayal’
- 🪄Sora showcases jaw-dropping geometric consistency
- 🧑✈️Microsoft introduces Copilot for finance in Microsoft 365
- 🤖OpenAI and Figure team up to develop AI for robots
- 🔍 SEC reportedly probing whether OpenAI CEO Sam Altman misled investors
- 💼 Microsoft introduces Copilot AI chatbot for Excel
- 🤝 Google Cloud adds Stack Overflow's knowledge base to Gemini AI
- At least 100 cases of malicious ML models were found on Hugging Face, some of which can execute code on users' machines.
- "BadGPT" and "FraudGPT" are two examples of LLMs sold on the dark web to write phishing emails, create fake websites, and create malware.
- A look at how AI is casting a long shadow on the adult entertainment industry, as AI "dream girls" threaten to replace human actresses.
- And OpenAI faces two new lawsuits: one from publications over copyright infringement and one from Elon Musk over abandoning its mission.
February 2024 AI Recap
AI Daily Chronicle of AI Innovations - February 29th, 2024
- 📸 Alibaba's EMO makes photos come alive (and lip-sync!)
- 💻 Microsoft introduces 1-bit LLM
- 🖼️ Ideogram launches text-to-image model version 1.0
- 🎵Adobe launches new GenAI music tool
- 🎥Morph makes filmmaking easier with Stability AI
- 💻 Hugging Face, Nvidia, and ServiceNow release StarCode 2 for code generation.
- 📅Meta set to launch Llama 3 in July
- 🤖 Apple subtly reveals its AI plans
- 🤖 OpenAI to put AI into humanoid robots
- 💥 GitHub besieged by millions of malicious repositories in ongoing attack
- 😳 Nvidia just released a new code generator that can run on most modern CPUs
- ⚖️ Three more publishers sue OpenAI
AI Daily Chronicle of AI Innovations - February 28th, 2024
- 🏆NVIDIA's Nemotron-4 beats 4x larger multilingual AI models
- 👩💻 GitHub launches Copilot Enterprise for customized AI coding
- ⏱️ Slack study shows AI frees up 41% of time spent on low-value work
- 🎞️ Pika launches new lip sync feature for AI videos
- 💰 Google pays publishers to test an unreleased GenAI tool
- 🤝 Intel and Microsoft team up to bring 100M AI PCs by 2025
- 📊 Writer’s Palmyra-Vision summarizes charts, scribbles into text
- 🚗 Apple cancels its decade-long electric car project
- NVIDIA's Nemotron-4 beats 4x larger multilingual AI models
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
AI Daily Chronicle of AI Innovations - February 27th, 2024
- 🌪️ Mistral Large: The new rival to GPT-4, 2nd best LLM of all time
- 🎮 DeepMind’s new gen-AI model creates video games in a flash
- 📱 Meta’s MobileLLM enables on-device AI deployment
- 🤖 Tesla's robot is getting quicker, better
- 🧠 Nvidia CEO: kids shouldn't learn to code — they should leave it up to AI
- 🇪🇺 Microsoft's deal with Mistral AI faces EU scrutiny
- 🥽 Apple Vision Pro’s components cost $1,542—but that’s not the full story
- 🎮 PlayStation to axe 900 jobs and close studio
Top 1000 Canada Quiz and trivia: CANADA CITIZENSHIP TEST- HISTORY - GEOGRAPHY - GOVERNMENT- CULTURE - PEOPLE - LANGUAGES - TRAVEL - WILDLIFE - HOCKEY - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
AI Daily Chronicle of AI Innovations - February 26th, 2024
- 🛡️ Microsoft eases AI testing with new red teaming tool
- 🧠 Transformers learn to plan better with Searchformer
- 👀 YOLOv9 sets a new standard for real-time object recognition
- 🍎Apple tests internal ChatGPT-like tool for customer support
- 📱 ChatGPT gets an Android home screen widget
- 🤖 AWS adds open-source Mistral AI models to Amazon Bedrock
- 🚇 Montreal tests AI system to prevent subway suicides
- 🍔 Fast food giants embrace controversial AI worker tracking
Top 1000 Africa Quiz and trivia: HISTORY - GEOGRAPHY - WILDLIFE - CULTURE - PEOPLE - LANGUAGES - TRAVEL - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
AI Daily Chronicle of AI Innovations - February 24th, 2024
- 🤯 Google’s chaotic AI strategy
- 🛑 Filmmaker puts $800 million studio expansion on hold because of OpenAI’s Sora
- 🤖 Google explains Gemini’s ‘embarrassing’ AI pictures
- 🍎 Apple tests internal ChatGPT-like AI tool for customer support
- 🤝 Figure AI's humanoid robots attract funding from Microsoft, Nvidia, OpenAI, and Jeff Bezos
AI Daily Chronicle of AI Innovations - February 23rd, 2024
- 📱 Stable Diffusion 3 creates jaw-dropping images from text
- ✨ LongRoPE: Extending LLM context window beyond 2 million token
- 🤖 Google Chrome introduces "Help me write" AI feature
- 💸Jasper acquires image platform Clipdrop from Stability AI
- 🎧Suno AI V3 Alpha is redefining music generation.
- 🤖GPT Store introduces linking profiles, ratings, and enhanced about pages.
- ✏️Microsoft introduces a generative erase feature for AI-editing photos in Windows 11.
- 📢Google cut a deal with Reddit for AI training data.
AI Daily Chronicle of AI Innovations - February 22nd, 2024
- 🫠 Google suspends Gemini from making AI images after backlash
- 📈 Nvidia posts revenue up 265% on booming AI business
- 💰 Microsoft and Intel strike a custom chip deal that could be worth billions
- 🛑 AI researchers' open letter demands action on deepfakes before they destroy democracy
- 🎨 Stability AI's Stable Diffusion 3 preview boasts superior image and text generation capabilities
- 💡 Google releases its first open-source LLM
- 🔥 AnyGPT: A major step towards artificial general intelligence
- ☠ DeepMind forms new unit to address AI dangers
- 💑 Match Group bets on AI to help its workers improve dating apps
- 📱 Google Play Store tests AI-powered app recommendations
AI Daily Chronicle of AI Innovations - February 21st, 2024
- 📃 Adobe's new AI assistant manages your docs
- 🎤 Meta released Aria recordings to fuel smart speech recognition
- 🔥 Penn's AI chip runs on light, not electricity
- 🤖 Google launches two new AI models
- 🥴 ChatGPT has meltdown and starts sending alarming messages to users
- 💍 An Apple smart ring may be imminent
- 👆 New hack clones fingerprints by listening to fingers swipe screens
- 💬 iMessage gets major update ahead of 'quantum apocalypse'
- 🖱 Brain chip: Neuralink patient moves mouse with thoughts
- 💻 Microsoft develops server network cards to replace NVIDIA
- 🤝 Wipro and IBM team up to accelerate enterprise AI
- 📱 Telekom's next big thing: an app-free AI Phone
- 🚨 Tinder fights back against AI dating scams
💎 Introducing Gemma by Google
AI Daily Chronicle of AI Innovations - March 04th, 2024
- 👀 Google’s ScreenAI can ‘see’ graphics like humans do
- 🐛 How AI ‘worms’ pose security threats in connected systems
- 🧠 New benchmarking method challenges LLMs' reasoning abilities
- 💊 AI may enable personalized prostate cancer treatment
- 🎥 Vimeo debuts AI-powered video hub for business collaboration
- 📱 Motorola revving up for AI-powered Moto X50 Ultra launch
- 📂 Copilot will soon fetch and parse your OneDrive files
- ⚡ Huawei's new AI chip threatens Nvidia's dominance in China
- Anthropic launches Claude 3, claiming to outperform GPT-4 across the board
AI Daily Chronicle of AI Innovations - March 02nd, 2024
- This AI Paper from Meta AI Explores Advanced Refinement Strategies: Unveiling the Power of Stepwise Outcome-based and Process-based Reward Models.
- AI worm infects users via AI-enabled email clients — Morris II generative AI worm steals confidential data as it spreads.
- Korean new AI image generator is 8 times faster than OpenAI’s best tool — and can run on cheap computers.
- AI-generated porn, including celebrity fake nudes, persist on Etsy as deepfake laws ‘lag behind’.
AI Daily Chronicle of AI Innovations - March 01st, 2024
- 💥 Elon Musk sues OpenAI and Sam Altman over ‘betrayal’
- 🪄Sora showcases jaw-dropping geometric consistency
- 🧑✈️Microsoft introduces Copilot for finance in Microsoft 365
- 🤖OpenAI and Figure team up to develop AI for robots
- 🔍 SEC reportedly probing whether OpenAI CEO Sam Altman misled investors
- 💼 Microsoft introduces Copilot AI chatbot for Excel
- 🤝 Google Cloud adds Stack Overflow's knowledge base to Gemini AI
- At least 100 cases of malicious ML models were found on Hugging Face, some of which can execute code on users' machines.
- "BadGPT" and "FraudGPT" are two examples of LLMs sold on the dark web to write phishing emails, create fake websites, and create malware.
- A look at how AI is casting a long shadow on the adult entertainment industry, as AI "dream girls" threaten to replace human actresses.
- And OpenAI faces two new lawsuits: one from publications over copyright infringement and one from Elon Musk over abandoning its mission.
February 2024 AI Recap
AI Daily Chronicle of AI Innovations - February 29th, 2024
- 📸 Alibaba's EMO makes photos come alive (and lip-sync!)
- 💻 Microsoft introduces 1-bit LLM
- 🖼️ Ideogram launches text-to-image model version 1.0
- 🎵Adobe launches new GenAI music tool
- 🎥Morph makes filmmaking easier with Stability AI
- 💻 Hugging Face, Nvidia, and ServiceNow release StarCode 2 for code generation.
- 📅Meta set to launch Llama 3 in July
- 🤖 Apple subtly reveals its AI plans
- 🤖 OpenAI to put AI into humanoid robots
- 💥 GitHub besieged by millions of malicious repositories in ongoing attack
- 😳 Nvidia just released a new code generator that can run on most modern CPUs
- ⚖️ Three more publishers sue OpenAI
AI Daily Chronicle of AI Innovations - February 28th, 2024
- 🏆NVIDIA's Nemotron-4 beats 4x larger multilingual AI models
- 👩💻 GitHub launches Copilot Enterprise for customized AI coding
- ⏱️ Slack study shows AI frees up 41% of time spent on low-value work
- 🎞️ Pika launches new lip sync feature for AI videos
- 💰 Google pays publishers to test an unreleased GenAI tool
- 🤝 Intel and Microsoft team up to bring 100M AI PCs by 2025
- 📊 Writer’s Palmyra-Vision summarizes charts, scribbles into text
- 🚗 Apple cancels its decade-long electric car project
- NVIDIA's Nemotron-4 beats 4x larger multilingual AI models
Get 20% off Google Workspace (Google Meet) Business Plan (AMERICAS): M9HNXHX3WC9H7YE (Email us for more)
Get 20% off Google Google Workspace (Google Meet) Standard Plan with the following codes: 96DRHDRA9J7GTN6(Email us for more)
AI Daily Chronicle of AI Innovations - February 27th, 2024
- 🌪️ Mistral Large: The new rival to GPT-4, 2nd best LLM of all time
- 🎮 DeepMind’s new gen-AI model creates video games in a flash
- 📱 Meta’s MobileLLM enables on-device AI deployment
- 🤖 Tesla's robot is getting quicker, better
- 🧠 Nvidia CEO: kids shouldn't learn to code — they should leave it up to AI
- 🇪🇺 Microsoft's deal with Mistral AI faces EU scrutiny
- 🥽 Apple Vision Pro’s components cost $1,542—but that’s not the full story
- 🎮 PlayStation to axe 900 jobs and close studio
Top 1000 Canada Quiz and trivia: CANADA CITIZENSHIP TEST- HISTORY - GEOGRAPHY - GOVERNMENT- CULTURE - PEOPLE - LANGUAGES - TRAVEL - WILDLIFE - HOCKEY - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
AI Daily Chronicle of AI Innovations - February 26th, 2024
- 🛡️ Microsoft eases AI testing with new red teaming tool
- 🧠 Transformers learn to plan better with Searchformer
- 👀 YOLOv9 sets a new standard for real-time object recognition
- 🍎Apple tests internal ChatGPT-like tool for customer support
- 📱 ChatGPT gets an Android home screen widget
- 🤖 AWS adds open-source Mistral AI models to Amazon Bedrock
- 🚇 Montreal tests AI system to prevent subway suicides
- 🍔 Fast food giants embrace controversial AI worker tracking
Top 1000 Africa Quiz and trivia: HISTORY - GEOGRAPHY - WILDLIFE - CULTURE - PEOPLE - LANGUAGES - TRAVEL - TOURISM - SCENERIES - ARTS - DATA VISUALIZATION
AI Daily Chronicle of AI Innovations - February 24th, 2024
- 🤯 Google’s chaotic AI strategy
- 🛑 Filmmaker puts $800 million studio expansion on hold because of OpenAI’s Sora
- 🤖 Google explains Gemini’s ‘embarrassing’ AI pictures
- 🍎 Apple tests internal ChatGPT-like AI tool for customer support
- 🤝 Figure AI's humanoid robots attract funding from Microsoft, Nvidia, OpenAI, and Jeff Bezos
AI Daily Chronicle of AI Innovations - February 23rd, 2024
- 📱 Stable Diffusion 3 creates jaw-dropping images from text
- ✨ LongRoPE: Extending LLM context window beyond 2 million token
- 🤖 Google Chrome introduces "Help me write" AI feature
- 💸Jasper acquires image platform Clipdrop from Stability AI
- 🎧Suno AI V3 Alpha is redefining music generation.
- 🤖GPT Store introduces linking profiles, ratings, and enhanced about pages.
- ✏️Microsoft introduces a generative erase feature for AI-editing photos in Windows 11.
- 📢Google cut a deal with Reddit for AI training data.
AI Daily Chronicle of AI Innovations - February 22nd, 2024
- 🫠 Google suspends Gemini from making AI images after backlash
- 📈 Nvidia posts revenue up 265% on booming AI business
- 💰 Microsoft and Intel strike a custom chip deal that could be worth billions
- 🛑 AI researchers' open letter demands action on deepfakes before they destroy democracy
- 🎨 Stability AI's Stable Diffusion 3 preview boasts superior image and text generation capabilities
- 💡 Google releases its first open-source LLM
- 🔥 AnyGPT: A major step towards artificial general intelligence
- ☠ DeepMind forms new unit to address AI dangers
- 💑 Match Group bets on AI to help its workers improve dating apps
- 📱 Google Play Store tests AI-powered app recommendations
AI Daily Chronicle of AI Innovations - February 21st, 2024
- 📃 Adobe's new AI assistant manages your docs
- 🎤 Meta released Aria recordings to fuel smart speech recognition
- 🔥 Penn's AI chip runs on light, not electricity
- 🤖 Google launches two new AI models
- 🥴 ChatGPT has meltdown and starts sending alarming messages to users
- 💍 An Apple smart ring may be imminent
- 👆 New hack clones fingerprints by listening to fingers swipe screens
- 💬 iMessage gets major update ahead of 'quantum apocalypse'
- 🖱 Brain chip: Neuralink patient moves mouse with thoughts
- 💻 Microsoft develops server network cards to replace NVIDIA
- 🤝 Wipro and IBM team up to accelerate enterprise AI
- 📱 Telekom's next big thing: an app-free AI Phone
- 🚨 Tinder fights back against AI dating scams
💎 Introducing Gemma by Google
Gemma, a new family of lightweight, advanced open models from Google, offers free access and tools for the AI community, promoting responsible innovation and collaboration.
Learn more: Full Article.
🎤 Meta Enhances Speech Recognition with Aria Recordings
Meta's release of a multi-modal dataset from Aria smart glasses aims to improve smart speech recognition, offering rich audio, video, and motion data for AI training. This innovation 🚀 promises to make AI interactions more natural and intuitive.
Explore more: Full Article.
📄 Adobe's New AI Assistant Manages Your Docs
Adobe introduced an AI assistant in Acrobat to enhance document handling, offering content summarization, query responses, and formatted overviews. The initiative also includes the formation of a dedicated AI research team, CAVA, to advance generative tools in media creation.
Learn more: Full Article.
Penn's AI chip runs on light, not electricity
Penn engineers have innovated an AI chip powered by light, enhancing AI computations with speed and efficiency. This photonic chip, blending optical computing with photonics, marks a significant leap in AI technology.
Learn more: Discover Full Article here.
💡 Gemini 1.5: A Cost-Efficient Leap Forward
Gemini 1.5's competitive edge against GPT-4 and its cost-efficiency could significantly impact the AI API market, posing challenges for OpenAI. 🚀
For a detailed analysis, visit: Read More.
🖱 Brain chip: Neuralink patient moves mouse with thoughts
Elon Musk announced successful recovery of the first Neuralink patient, who can now control a mouse cursor with thoughts. (Link)
💻 Microsoft develops server network cards to replace NVIDIA
Microsoft is creating networking cards to facilitate server data movement, aiming to lessen dependence on NVIDIA's offerings. (Link)
🤝 Wipro and IBM team up to accelerate enterprise AI
Wipro and IBM expand their collaboration, launching the Wipro Enterprise AI-Ready Platform for integrated AI environments. (Link)
📱 Telekom's next big thing: an app-free AI Phone
Deutsche Telekom unveils an AI-driven, app-free phone concept, emphasizing voice and text commands for daily tasks. (Link)
🚨 Tinder fights back against AI dating scams
Tinder enhances ID verification to combat AI-driven scams, requiring a driver's license and video selfie. (Link)
AI Daily Chronicle of AI Innovations - February 20th, 2024
- 🚀 Groq’s New AI Chip Outperforms ChatGPT
- 📊 BABILong: The new benchmark to assess LLMs for long docs
- 👥 Stanford’s AI model identifies sex from brain scans with 90% accuracy
Groq's LPU: A New Era for AI Processing
Groq's innovative AI chip, dubbed the "GroqChip," leverages a Language Processing Unit (LPU) to surpass traditional GPUs in processing power, making it ideal for real-time AI tasks. This advancement represents a significant leap in AI hardware design.
Discover more about this breakthrough: Read the full article.
BABILong: Enhancing LLMs for Long Documents
The research introduces BABILong, a new benchmark aimed at overcoming the challenges LLMs face with lengthy documents, proposing recurrent memory enhancements for improved performance.
For more insights: Read the full article.
Stanford AI Achieves 90% Accuracy in Sex Identification from Brain Scans
Stanford's AI model, focusing on dynamic MRI scans, identifies sex with over 90% accuracy, analyzing key brain networks. This breakthrough could enhance personalized medicine for neuropsychiatric conditions.
Discover more: Full Article.
AI Daily Chronicle of AI Innovations - February 19th, 2024
- 🚀 NVIDIA's new dataset sharpens LLMs in math
- 🌟 Apple is working on AI updates to Spotlight and Xcode
- 🤖 Google open-sources Magika, its AI-powered file-type identifier
- 🤖 OpenAI in talks to acquire Nvidia competitor
- 💰 SoftBank to build a $100B AI chip venture
- 💸 Reddit has a new AI training deal to sell user content
- 🤷♀️ Air Canada chatbot promised a discount. Now the airline has to pay it.
NVIDIA's OpenMathInstruct-1 Boosts LLMs in Math
NVIDIA's OpenMathInstruct-1, a synthetic dataset with 1.8M problem-solution pairs, advances LLMs' mathematical capabilities. It outperforms existing models without using GPT-4, fostering open-source collaboration in AI research.
For more insights: Discover More
Apple Enhances Spotlight and Xcode with AI
Apple is integrating generative AI into Xcode and exploring AI-driven features for consumer apps like Apple Music and Spotlight, signaling a strategic move towards more AI-centric offerings.
Learn more: Full Article
Google Open-Sources Magika for File-Type Identification
Google's Magika, an AI-powered tool for identifying file types, offers enhanced accuracy and speed. It significantly improves upon traditional methods, particularly for textual files, boosting security across Google's services.
Read more about Magika's impact: Full Article
AI Daily Chronicle of AI Innovations - February 10 to February 17th, 2024
- 📊 DeepSeekMath: The key to mathematical LLMs
- 💻 localllm enables GenAI app development without GPUs
- 📱 IBM researchers show how GenAI can tamper calls
- 🔍 More Agents = More Performance: Tencent Research
- 🎥 Google DeepMind’s MC-ViT understands long-context video
- 🎙 ElevenLabs lets you turn your voice into passive income
- 💻 Nvidia launches offline AI chatbot trainable on local data
- 🧠 ChatGPT can now remember conversations
- 🌐 Cohere launches open-source LLM in 101 languages
- 🎥 Apple’s Keyframer: A text-to-anime AI using GPT-4
- 🖼️ Stability AI introduced Stable Cascade: A text-to-image model
- 🛡️ OpenAI disrupted the activities of 5 state-affiliated threat actors
- 🚀 OpenAI launches Sora, a text-to-video model
- 🌟 Google announces Gemini 1.5 with 1 million tokens!
- 🤖 Meta’s V-JEPA: A step toward advanced machine intelligence
DeepSeekMath: The Key to Mathematical LLMs
In its latest research paper, DeepSeek AI has unveiled DeepSeekMath 7B, a specialized AI model aimed at boosting mathematical reasoning in open-source Large Language Models (LLMs). This model, pre-trained on an extensive dataset of 120 billion tokens derived from math-centric web content, leverages reinforcement learning techniques specifically designed for tackling mathematical challenges.
When subjected to rigorous testing on key benchmarks in both English and Chinese, DeepSeekMath 7B demonstrated superior performance over existing open-source models focused on mathematical reasoning, closely rivaling even proprietary models like GPT-4 and Gemini Ultra.
For more details, visit the full article on DeepSeek AI's Research.
localllm: Enabling GenAI App Development without GPUs
Google has launched localllm, an open-source tool designed to run Large Language Models (LLMs) locally on CPUs, particularly within Cloud Workstations, eliminating the need for GPU resources. This tool leverages "quantized" LLMs from HuggingFace, optimized for efficient operation on less powerful devices.
IBM Researchers Uncover GenAI's Potential to Tamper Calls
localllm's ability to run LLMs on CPU and memory significantly boosts productivity and cost efficiency, enabling developers to incorporate advanced LLMs into their projects without the complexities of GPU management or dependence on external services.
IBM's latest experiment reveals a concerning possibility of using GenAI to manipulate live phone calls. Researchers developed a man-in-the-middle tool to intercept and audio-jack conversations, altering spoken content like bank account numbers with AI-generated fakes, seamlessly blending them into the call without detection.
Source: IBM's Audio-Jacking Experiment
“More Agents = More Performance” - The Tencent Research Team
The Tencent Research Team suggests enhancing language model performance by increasing the number of agents. Using a "sampling-and-voting" approach, multiple agents analyze the input, and the most common result is selected. This method shows that even smaller models can surpass larger ones by expanding the agent ensemble.
Source: Read More
ElevenLabs: Monetize Your Voice
ElevenLabs offers an innovative AI voice cloning model, enabling users to generate passive income through their "Voice Actor Payouts" program. By uploading a sample, users can create and share a professional voice clone, earning rewards when it's used.
Source: Learn More
NVIDIA's Chat with RTX: Offline AI Chatbot
NVIDIA introduces Chat with RTX, enabling the creation of personalized AI chatbots using local data on PCs with GeForce RTX GPUs. This tool allows users to integrate their digital content into responsive chatbot interactions.
Source: Discover More
ChatGPT Enhances with Memory Feature
OpenAI introduces a memory feature for ChatGPT, enabling it to recall past conversations for more personalized interactions. This capability allows ChatGPT to adapt and provide relevant suggestions over time, enhancing user experience without repeating information.
Source: Read More
Cohere Unveils Aya: A Multilingual Leap in AI
Cohere's Aya, a groundbreaking open-source LLM, supports 101 languages, doubling the reach of existing models. It leverages extensive datasets to enhance AI accessibility for diverse cultures, significantly outperforming other multilingual models and emphasizing the importance of linguistic diversity.
For more insights: Discover Aya's Impact
Meta AI introduces V-JEPA
Meta AI has unveiled V-JEPA (Video Joint Embedding Predictive Architecture), a novel approach for imparting machines with an understanding of the physical world through video observation. A suite of V-JEPA vision models, honed through self-supervised learning with a feature prediction objective, has been released. These models possess the capability to comprehend and anticipate video content, even when faced with sparse information. Details | GitHub
Open AI introduces Sora
Open AI has launched Sora, a cutting-edge text-to-video model capable of crafting videos up to 60 seconds in length. These videos can showcase intricate scenes, elaborate camera movements, and multiple characters exhibiting a range of emotions. Details + sample videos | Report
Google announces Gemini 1.5
Google has announced the advent of Gemini 1.5, its next-gen model employing a novel Mixture-of-Experts (MoE) architecture. The inaugural model, Gemini 1.5 Pro, features an unprecedented context window of up to 1 million tokens, setting a new benchmark for large-scale foundation models. Gemini 1.5 Pro is adept at executing complex understanding and reasoning tasks across various modalities, including video, matching the performance level of its predecessor, 1.0 Ultra. Details | Tech Report
Reka Introduces Reka Flash and Reka Edge
Reka introduced Reka Flash, a new 21B multimodal and multilingual model trained entirely from scratch. Reka also presents Reka Edge, a smaller and more efficient 7B model. Both are available in Reka Playground. Details.
Cohere For AI Releases Aya
Cohere For AI released Aya, a massively multilingual LLM & dataset to support under-represented languages. Aya outperforms existing models, covering 101 languages. Details.
BAAI Releases Bunny
BAAI released Bunny, a family of multimodal models. The Bunny-3B model, built upon SigLIP and Phi-2, outperforms similar-sized MLLMs and achieves performance on par with LLaVA-13B. Details.
Amazon Introduces BASE TTS
Amazon introduced BASE TTS, the largest TTS model trained on 100K hours of public domain speech data, exhibiting emergent qualities for natural speech. Details.
Stability AI Releases Stable Cascade
Stability AI released Stable Cascade, a new text to image model with a three-stage architecture, easy to train and finetune on consumer hardware. Details.
UC Berkeley Releases LWM
UC Berkeley released the Large World Model (LWM), a versatile model capable of understanding and generating language, images, and video. LWM can accurately retrieve facts across extensive contexts and even comprehend lengthy YouTube videos. Details.
GitHub Accelerator Program Applications Open
GitHub has opened applications for its next Accelerator program cohort, focusing on AI-based solutions under an open-source license. Details.
NVIDIA Introduces Chat with RTX
NVIDIA's Chat with RTX is a local AI assistant for Windows PCs equipped with specific NVIDIA GPUs, integrating with your file system for enhanced document and video interactions. Details.
Open AI Tests Memory with ChatGPT
Open AI is enhancing ChatGPT with a memory feature, allowing it to recall information from across all chats, improving user interaction continuity. This feature is currently in a limited rollout. Details.
BCG X Releases AgentKit
BCG X has launched AgentKit, a LangChain-based starter kit for building constrained agent applications, offering new possibilities for app developers. Details.