- AI Geekly
- Posts
- AI Geekly: AI In Action
AI Geekly: AI In Action
Reducing time from idea to execution
Welcome back to the AI Geekly, by Brodie Woods, brought to you by usurper.ai. This week we bring you yet another week of fast-paced AI developments packaged neatly in a 5 minute(ish) read.
TL;DR Glass Half-Full; Text-to-3DPrint; TikTok’s AI Masterclass
This week we take a look at a though piece from the CEO of Anthropic (makers of the Claude family of models), where he makes a series of predictions around the positive benefits of AI to society across several key vectors. This is notable; as the CEO of the company that produces arguably the best performing AI model available to the public, he is uniquely positioned to share an educated perspective on the subject. Next, we take a look at a new application of Large Language Models (LLMs) paired with 3D mesh (3D modelling) that converts natural language to 3D models, enabling a very exciting pipeline when 3D printers are considered: idea>text > [LLM] > 3D model > [3D Printer] > 3D object —this is a development that we have been eagerly looking forward to! Finally, we end with a case study of “AI done right” in a business/enterprise setting —TikTok has rolled-out a suite of AI tools to advertisers with the potential to materially accelerate their workflows while also making advanced advertising tech available to a much broader customer base (top line upside).
Machines of Loving Grace
Anthropic CEO’s views on the benefits of advanced AI
What it is: Anthropic CEO Dario Amodei recently published a thought-provoking essay outlining his optimistic vision for a future shaped by advanced AI. While we typically focus on weekly developments in the AI Geekly, Amodei's piece, published early last month, offers a broader perspective we thought was worth sharing, as we suspect many of our readers may not have come across it yet. In it, he addresses the potential upsides of powerful AI across five key areas: biology and health, neuroscience and mind, economic development and poverty, peace and governance, and work and meaning.
What it means: Amodei acknowledges the significant risks of advanced AI, yet, like us, he argues that these risks should not overshadow its transformative potential. He suggests we consider the concept of "marginal returns to intelligence," asking how a "country of geniuses in a datacenter" might accelerate progress in various fields, even given practical constraints. He proposes a thought experiment: if we could compress the next 50-100 years of scientific and technological advancements into 5-10 years, what might that look like? Amodei explores this question across the five areas mentioned above, offering a vision of a future where disease is largely eradicated, poverty is significantly reduced, and human well-being is dramatically enhanced.
Why it matters: Amodei's essay provides a refreshing counterpoint to the often-alarmist narratives around AI, inviting us to consider not just the potential downsides, but the immense opportunities that AI presents. It raises important questions about how we might shape a future where AI serves humanity's best interests, from accelerating scientific discovery and improving global health to fostering economic development and promoting democratic values. Amodei's vision is both ambitious and nuanced, and we encourage our readers to explore his full essay for a deeper dive into these important ideas.
AI Powered Star Trek Replicator
New Nvidia 3D modeler + 3D printing tech unlocks possibilities
What it is: Researchers from Tsinghua University and Nvidia have introduced LLaMA-Mesh, a novel technique that unifies 3D mesh generation (i.e. 3D models) with large language models (LLMs). By representing 3D mesh data as simple text, LLaMA-Mesh allows LLMs like the open-source Llama family of models (produced by Meta) to generate and interpret complex 3D models directly from natural language prompts. This eliminates the need for specialized 3D modeling software or expertise, opening up the possibility of creating intricate physical objects from simple text descriptions, much like the "replicators" often depicted in science fiction.
What it means: The publishing of this paper (and eventual code release later in November) is an important step to making consumer 3D printers more useful and user friendly. Currently, without dedicating dozens of hours to learning to use Computer Assisted Design (CAD) software like Autodesk or Blender, owners of home 3D printers are limited to printing models made available by others online. LLaMA-Mesh simplifies the traditionally complex process of 3D modeling, making it accessible to a wider audience. Combined with readily available and increasingly affordable 3D printing technology, like that offered by Bambu Labs and others, this technology opens up many possibilities, certainly contributing to the democratization of manufacturing of objects.
Why it matters: LLaMA-Mesh’s ability to enable 3D generation via the familiar text-based interface of LLMs enables innovation across various applications. The open-source nature of the underlying technology further accelerates this potential, fostering collaboration and empowering a broader community of creators to explore the possibilities of this futuristic technology. It quite literally allows anyone with the right equipment to take any idea they can express verbally and transform it into something tangible in the physical world —if you can think it, you can make it.
AI Done Right Case Study: The AI Easy Button
TikTok introduces AI-powered toolkit for advertisers
What it is: Readers of the Geekly will recall our words of caution regarding the use of AI models produced under the jurisdiction of the PRC. Despite these reservations, we wanted to highlight an impressive deployment of AI capabilities by TikTok in an enterprise setting via the roll-out this week of its Symphony Creative Studio. The studio is a comprehensive suite of AI-powered tools designed to streamline and enhance the ad creative production process. Symphony Creative Studio offers a range of features including automated video generation from product information or URLs, integration of AI-powered digital avatars, multilingual translation and dubbing capabilities, and AI-enhanced video editing. The platform leverages licensed assets from partners like Getty Images and Billo, ensuring all content is cleared for commercial use —a must have for enterprise.
What it means: The company has wisely decided to leverage its AI chops to make it easier for its number one revenue source (advertisers) to use its tools. By reducing the friction for advertisers even further (no need to retain video/voice/influencer talent, translators, video editors, license images, music or sound) it becomes incredibly simple for anyone to use the company’s platform for advertisement. By automating and simplifying various aspects of video production, Symphony Creative Studio lowers the barrier to entry for advertisers, including those with limited resources.
Why it matters: This move by TikTok underscores the growing importance of AI in content creation and its potential to transform the advertising landscape. The company understands that one source of value from AI comes from making its platform accessible to a broader market by reducing barriers to entry. Beyond scaling business’ creative output via AI, the tool also offers valuable data-driven insights and best practices, derived from TikTok's vast user base, to optimize ad performance. This integrated approach, combining AI-powered tools with platform-specific knowledge demonstrates a compelling case study of “AI done right” in a business/enterprise setting.
Before you go… We have one quick question for you:
If this week's AI Geekly were a stock, would you: |
About the Author: Brodie Woods
As CEO of usurper.ai and with over 18 years of capital markets experience as a publishing equities analyst, an investment banker, a CTO, and an AI Strategist leading North American banks and boutiques, I bring a unique perspective to the AI Geekly. This viewpoint is informed by participation in two decades of capital market cycles from the front lines; publication of in-depth research for institutional audiences based on proprietary financial models; execution of hundreds of M&A and financing transactions; leadership roles in planning, implementing, and maintaining of the tech stack for a broker dealer; and, most recently, heading the AI strategy for the Capital Markets division of the eighth-largest commercial bank in North America.