Sunday, December 8, 2024

AI: Hope is Not a Strategy

Circumspection about knowledge worker or business leader claims of AI boosts to productivity is a good idea. After all, subjective feelings and decision-maker self-interest are not the same as objective outcomes. To borrow an analogy, hope is not a strategy.


We might easily tend to believe AI makes us more productive. Whether that is the case, or whether we can actually measure such changes is the issue.


A KPMG study suggests technology, telecom and media firms are already seeing, or expecting, return on investment from artificial intelligence spending. In fact, survey respondents reported hefty revenue growth, with 38 percent of respondents suggesting AI already drove more than 10 percent of total entity revenue. 


source: KPMG


In my experience, most enterprise software buyers are quite skeptical of such claims, if intrigued. And few respondents who profess such outcomes are equally able to quantify the outcomes, when asked to do so. 


“We think this outcome can be attributed to a specific input” is one matter. Actual proof is something else altogether. And there is no shortage of reasons for respondents to make such claims. Leaders always are eventually required to justify investments, costs of those investments and attributable financial outcomes.


As the old adage goes, “nobody ever got fired for recommending we buy from IBM.” In other words, bad choices can be job-ending moves. 


As always, the assumptions are key. There is a difference between stakeholder expectations about how AI can contribute, and the actual outcomes. We expect positive outcomes or would not make the investments. But outcomes and productivity are notoriously difficult to quantify for any sort of knowledge or office work.


Neither is it easy to quantify the specific outcomes enabled by any single change a firm makes, when multiple inputs--all dynamic--might be involved. 


For example, the “Redefining TMT with AI” report talks about the benefits of AI-driven predictive network analysis, in tandem with robotic process automation to  enhance network operations and quality

of service. 


In a strict sense, there are two independent variables here: process automation and use of AI. Beyond that, KPMG consultants note that the use case involves automated scripts and algorithms; predictive models; fault prediction; alarm handling; trouble-ticket management;, configuration management; 

 customized network-level reports and workflow management. 


As many of you know, such process automation already is a feature of many operational support systems. AI should help, of course. 


But it might be hard to quantify the degree of impact. Still, the point is that ROI is created by a reduction in volume of alarms, faults and tickets; improved Mean Time to Repair and reduced downtime. I cannot think of a single OSS platform or system that fails to mention those outcomes as benefits. 


In the video content industry, the report suggests AI produces ROI by affecting the efficiency and accuracy of video dubbing (language translation) and synchronizing the dubbed dialogue with the onscreen actors’ vocal movements. The ROI then is produced by reduced production time and cost. 


Also, AI is used to “for understanding and translating complex scripts and  while supporting real-time lip-sync. Basically, in this use case AI aids the dubbing process. 


AI also is used to speed up software coding, so the ROI is based on faster development cycles, faster debugging, code quality and developer productivity. 


The issue is not AI and its ability to improve all those processes and use cases. That indeed is the attraction. Instead, the issue is that it is hard to isolate the AI contributions from the other value created by the processes AI enhances. 


So among tne issues we are going to confront when trying to assess the value of artificial intelligence as a driver of value--and hence revenue--is that AI already is embedded as a feature into many products and experiences. 


When AI enhances shopping, for example, it might not be possible to precisely quantify the costs and benefits. 


Likewise, it will be hard to quantify the incremental revenue gain from AI that enhances existing communication  or entertainment experiences primarily by intensifying personalization. In that sense, many consumers will encounter generative AI, for example, mostly passively in their shopping or content experiences. 


source: Bain 


Businesses already have AI-enhanced software and platforms supporting customer service, predictive maintenance, supply chains, security and fraud prevention; human resources; inventory management; risk assessment; marketing and process control. 


So it will not be easy to identify or quantify the impact of AI if we are forced to conduct such evaluations. For consumers, the process is mostly informal. We evaluate the merits of buying a new smartphone and might consider the features supposedly enabled by AI. As a practical matter, though, that is more likely to be viewed as “better pictures” or “longer-lasting battery” than AI as such. 


Consumer Applications

Business Applications

Virtual Assistants (e.g., Siri, Alexa)

Customer Service Chatbots

Personalized Content Recommendations

Predictive Maintenance

Smart Home Devices

Supply Chain Optimization

Photo and Video Editing Apps

Fraud Detection

Navigation and Maps

Automated Accounting

Language Translation Apps

Recruitment and Hiring

Fitness and Health Tracking

Market Analysis and Forecasting

Voice-to-Text Transcription

Cybersecurity Threat Detection

Personalized Shopping Recommendations

Process Automation (RPA)

Smart Email Categorization

Quality Control in Manufacturing

Face Recognition for Device Unlocking

Personalized Marketing Campaigns

AI-Enhanced Mobile Photography

Inventory Management

Smart Keyboards with Predictive Text

Risk Assessment in Finance

Voice-Controlled Smart Appliances

Energy Management in Buildings

Personalized News Feeds

Automated Customer Segmentation


For users of business software and platforms, AI is more likely to play a role in creating the desired “single pane of glass” interface, for example--improving existing functions--than it is to create some huge new capability, at least in the near term. 


That would seem to make sense, as it seemingly always does. It’s arguably easy to grasp how a new technology makes a current solution better than to envision an entirely-new application.  


Saturday, December 7, 2024

Gencast Shows AI Ability to Forecast Highly-Chaotic Systems (Weather)

Gencast, Google Deepmind’s weather forecasting tool, is said to produce 15-day weather forecasts better than the existing standard, the ENS ensemble forecast of the European Centre for Medium-Range Weather Forecasts. ECMRW forecasts also can forecast up to 15 days.


But GenCast outperforms; in several ways. GenCast is said to have outperformed outperformed ECMWF's system in 97.2 percent of forecasting scenarios tested, with a 99.8 percent accuracy rate for forecasts beyond 36 hours. Most models can forecast up to about a week in advance.

GenCast also demonstrated more precise predictions of tropical cyclone tracks, offering an average of 12 additional hours of advance notice, according to Alphabet.


GenCast also can produce a complete 15-day forecast (using more than 80 surface and atmospheric variables) in eight minutes. in eight minutes, using a single Google Cloud Tensor processing unit v5, while traditional models like ENS typically require several hours.


Weather forecasts are difficult due to the highly chaotic nature of atmospheric systems.The slightest discrepancy in initial measurements can cause weather forecasts to diverge from actual weather over time. 


Some will note that chaotic systems are inherently nonlinear. Temperature, pressure, and humidity interact in ways that are not directly proportional or linear. So the system’s behavior cannot be accurately predicted by simply adding or scaling inputs. The variables interact in complex, non-additive ways.


This chaos means that beyond a certain time horizon, weather becomes inherently unpredictable, regardless of the quality of data or models used. Up to this point, the “gold standard” has been predictions generally correct for as much as a week. 


Artificial intelligence, especially deep learning, excels at detecting complex patterns in large datasets, which is why it helps with weather forecasts.


If Generative AI is "Winner Take All," That Dictates Investment Bets

Contestants in the generative artificial intelligence model business are following a “winner takes all” approach to the market, investing at a pace that gets criticized by financial analysts (and rightly so, in some respects) for exceeding what seems immediate and tangible financial payback.


But the “winner takes all” strategy has worked often in the internet era, across many different segments of the market. So we already can predict some outcomes. 


“Wild” levels of spending are going to pay off for at least one and perhaps two providers of GenAI models, in terms of leadership of the market (ecosystems built on them). Some of the contestants will eventually “break even” for their investors, neither gaining or losing equity value. 


But most will fail, losing their investors most to all of the invested capital. That is simply what has happened in winner-take-all markets. 


As we saw early on, even pre-internet, for operating systems, and later saws for search engines, mobile operating systems, e-commerce, social media, ridesharing, peer-to-peer lodging and other app segments, market leadership is highly concentrated with one or sometimes two providers dominating. 


So the companies developing GenAI frontier models, such as OpenAI, Google DeepMind, and Anthropic, are assuming the market  will eventually shake out with a “winner takes all” structure, with market leadership highly concentrated and creating a new ecosystem of value around the leading models, with the hundreds of other would-be contestants relegated to history.


Market/Product Category

Dominant Players

Search Engines

Google

Social Media

Facebook, Instagram, TikTok

E-commerce

Amazon

Mobile Operating Systems

Apple iOS, Google Android

Desktop Operating Systems

Microsoft Windows, macOS

Web Browsers

Google Chrome, Safari

Streaming Video

Netflix, YouTube

Cloud Computing

Amazon AWS, Microsoft Azure, Google Cloud

Online Advertising

Google, Meta (Facebook)

Smartphones

Apple (iPhone)

Productivity Software

Microsoft Office, Google Workspace

Mobile Payment Systems

Apple Pay, Google Pay, PayPal

Digital Maps

Google Maps

Food Delivery Apps

DoorDash, Uber Eats

Ride-Hailing Apps

Uber, Lyft (US), Didi (China)

Video Conferencing

Zoom, Microsoft Teams


By way of contrast, end users of GenAI--especially enterprises--will have to answer for their choices, as they would for any other capital investments made to support their businesses. And that goes for suppliers fo hardware and software incorporating AI functionality. 


GenAI model leadership will undoubtedly shake out into a “winner takes all” pattern. But that also means hardware or software suppliers might “bet on the wrong horse,” as might their customers. That generally means high danger for model suppliers and those firms that are direct parts of those ecosystems. 


Hardware and software participants might face moderate to high impact if they are in the “wrong” ecosystem or platform. End users--whether consumers or businesses--might face low repercussions if their chosen GenAI supplier does not emerge as the eventual market leader (either provider one or two). 


Basic or firm-specific functionality might not be the big problem, as niche solutions might be perfectly viable, especially if a model emerges as providing superior industry-specific functionality. 


Value Chain Segment

Example Participants

Degree of Danger if Supplier/Partner is Not a Winner

Reason for Risk

Foundation Model Providers

OpenAI, Google DeepMind, Anthropic, Meta

High

If the chosen provider loses, end-users face high costs in switching models due to integration dependencies.

Model Fine-Tuners & Adaptation

Hugging Face, Scale AI, Stability AI

High

Fine-tuning investments and adaptations may not be transferrable to new, winning models.

Hardware Providers

Nvidia, AMD, Intel, custom chip startups

Moderate to High

Dependence on specialized chips may limit flexibility; non-dominant players may face R&D cutbacks.

AI Infrastructure Platforms

AWS, Microsoft Azure, Google Cloud

High

Non-winner platforms may lack interoperability, leading to high switching costs and stranded data assets.

Middleware Providers

Databricks, Snowflake, API integration services

Moderate to High

Middleware built for a specific provider may need reconfiguration, reducing compatibility with other models.

Application & Service Developers

Jasper, ChatGPT plugins, custom AI tools

Moderate to High

Apps tightly integrated with a non-winning model may struggle with compatibility and require costly rewrites.

Consulting & Implementation

Deloitte, Accenture, AI-specific consultancies

Moderate

Models selected may lose support, limiting longevity of solutions delivered to clients.

AI Tooling & DevOps

MLOps tools, AI pipeline solutions (e.g., Weights & Biases)

Moderate

DevOps tools tied to a specific model may lose relevance if clients pivot to dominant providers.

Enterprise & Custom Solutions

SAP AI, Oracle AI, Salesforce Einstein

Moderate

Custom solutions based on non-winning models may have compatibility and feature limitations.

Data Providers

Web-scraping firms, domain-specific datasets

Low to Moderate

Dependence on any one model is less critical, though dataset compatibilities may shift.

Consumer-facing Applications

Grammarly, Canva, AI-powered image and video tools

Low to Moderate

User-facing tools may need updates for compatibility but can adapt faster than core infrastructure layers.

End Users & Businesses

Large enterprises, SMEs, individual users

Low to Moderate

Users may face inconvenience, but model-agnostic interfaces could minimize direct switching costs.


Still, longer term, the specific GenAI ecosystem should matter, as network effects should eventually emerge.


Linear TV Enters Early Stages of "Harvesting"

Since all observers agree the linear TV (“live TV”) subscription business is dwindling, we should expect consolidation of service providers,...