TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Think
- Research & Resources
  - TDWI Playbook | Next Generation Data Science: The AI-Driven Data Science Life Cycle
  - TDWI Data Points | The Data Foundation for AI
  - TDWI Best Practices Report | Data Strategies and Foundations for Modern Data Management
  - TDWI Insight Accelerator | Adopting a Platform Approach for Gaining Insights from Unstructured Data
- Webinars
  - Data Integration for AI: Overcoming Modern Pipeline Challenges July 23, 2025
  - From Silos to Insights: Centralizing Data to Drive AI July 24, 2025
  - Expert Panel: Leveraging AI-Powered Solutions for Data Management July 28, 2025
  - A Generative AI Framework for Credit and Financial Markets July 29, 2025
- Virtual Summits
  - Virtual Events Keys to Making Your Data AI Ready September 10, 2025
  - Virtual Events Data Quality for BI, Analytics and AI October 22, 2025
  - Virtual Events Modern Data Strategy November 12, 2025
  - Virtual Events What’s Ahead in 2026 for Data & Analytics December 10, 2025
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Speaking of Data Podcast
  
  Current Research Surveys
Train
- In-Person Events
  - Conference TDWI Transform 2025 San Diego August 18, 2025
  - Executive Summit TDWI Modern Data Leader's Summit San Diego: AI in the Enterprise August 18, 2025
  - Executive Summit AI Accelerate 2025, Brought to You by AI Boadroom & TDWI August 18, 2025
  - Conference TDWI Transform 2025 Orlando November 16, 2025
- Virtual Live Seminars
  - TDWI Data Governance Principles and Practices: Managing Data as an Asset June 25, 2025
  - Building Your Company’s Data Governance Roadmap June 25, 2025
  - Data Governance: Driving Engagement and Organizational Change June 26, 2025
  - A Framework for Modern Data Governance June 25, 2025
- Online Learning
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Train Your TeamCustom solutions for training your team
  
  Get CertifiedEarn a professional credential in BI and Analytics, Data Governance, or AI
  
  TDWI MembershipExclusive access to the research, tools, training, and connections
Engage
- Connect
  - Connect and Contribute to Our Vibrant Community of Data Leaders
    
    Subscribe to TDWI Stay up to date on the latest news and events. Sign Up
    
    Become a TDWI Member Gain exclusive access to the research, tools, training, and connections to move your careers, teams, and projects forward. Learn More
    
    Become a Part of the TDWI Research Panel Make a difference in the data and analytics industry and earn incentives by sharing your insights with TDWI. Explore Now
    
    Speak at TDWI Events Share your expertise and build your personal brand as a speaker at a TDWI In-Person or Virtual Event. Submit a Proposal
    
    Become a TDWI Research Fellow Apply to be a member of TDWI’s industry leading research team. Apply Today
    
    Become a Member of the Data & AI Leaders Forum Engage in collaborative discussions, stay ahead of the curve, and stay in the know. Apply Now
    
    Showcase Your Data & AI Solutions Reach and engage with TDWI community through multi-channel marketing programs. Learn More

TDWI Articles

The Future of Machine Learning: Models as APIs

Creating a successful machine learning model is extremely difficult and expensive. Providing the model as a service to other businesses may recoup part of the cost.

By Aleksandras Šulženko
July 19, 2021

Making the World Better, One Small Model at a Time

After some time in the field of machine learning, nearly everyone realizes that accurate models are terribly hard to make. As we shall see, creating APIs out of these models may be the natural conclusion of business involvement in machine learning because it creates additional sources of revenue.

For Further Reading:

3 Signs of a Good AI Model

Don’t Forget the Back End of the Machine Learning Process

Tackling Bias and Explainability in Automated Machine Learning

Even relatively simple classification models can take a substantial amount of time to produce if there are no publicly available labeled data sets. Acquiring the data, labeling the entities, and feeding it to the machine model is just a small part of the process. Additionally, most models are not highly accurate for a significant amount of time due to concept drift.

Building more complex models that classify and predict set values by using regression is out of the question for most businesses. Such models might only be within the scope of those most dedicated to machine learning. For the majority of businesses, creating models that can understand human languages will be completely out of the question due to the high resource costs.

Natural Partitioning of Models

The natural partitioning process is a response to the high costs arising from machine learning model production. Two of the primary drawbacks of all current machine learning models are that they are data-inefficient (requiring considerable amounts of data) and fragile (suitable for a very narrow range of tasks). Thus, IT-based businesses are often forced to restrict their machine learning ambitions by tackling smaller, more pressing tasks.

I’ve worked on a project that fell into the trap of ambition. We were attempting to improve public data acquisition through machine learning but had to reduce our target to parsing due to the insurmountable amount of data required for other parts of the process. Then we had to reduce our goal to a specific type of page. After some time, we finally restricted the type of website as well. All this led to a working model, but we quickly realized that we either needed to expand our team significantly or change our goals. The latter seemed to make more business sense.

I think most businesses will go through much of the same. At the current state of the industry, businesses not focused on purely ML applications will probably develop only small-scale solutions that solve key challenges in a particular field. However, these smaller solutions can still have widespread use, they just need to be shared.

Models into APIs

The idea is already out there. Open AI finished the development of GPT-3 (a complex machine learning model that can understand and work with any language task, including creating perfectly readable long-form content) and created an API for access to the model.

Open AI is one of the largest machine learning and artificial intelligence research companies, so it’s not surprising that they would be able to create something universally useful. Turning such a machine learning model into an API is an obvious conclusion, but how about smaller models? Is there good reason to turn them into APIs?

Most businesses indirectly apply the Pareto principle to machine learning model development. Due to the inherent difficulty, finding small, common, high-maintenance problems to solve with machine learning becomes the natural route as resource constraints simply cannot allow for more leeway.

This means that not only is the development of machine learning models naturally partitioned among businesses due to complexity, but the models target recurring industry problems that may be of use to other businesses.

APIs into Solutions

Luckily, to integrate machine learning models into regular day-to-day business processes we don’t need to have some Skynet-level AI take care of every other process. Machine learning model integration can happen incrementally. Of course, over time the incremental approach will lead to many moving parts made from machine learning models. These models might not seem very useful because they will be solving niche problems for specific industries.

However, I think we would be making a grievous mistake assuming that any reasonably accurate model has no scientific or business application. No matter how small the model is, training it is still resource intensive.

For example, to get to a decent level of accuracy in supervised learning, thousands (or tens of thousands) of labeled data points are needed. These labels will usually have to be assigned manually by in-house teams or by contracting outside help. Going through thousands or tens of thousands of identical data points is bound to become boring and painful. Either way, there is a significant cost assigned to the acquisition of data.

Machine learning woes don’t end with training the model. Even assuming that no errors were made in training and everything works perfectly, eventually we will experience concept drift. To continue making accurate predictions as data changes over time, the model should change as well. Concept drift means that all models must be constantly maintained in a continual process instead of being a solution with a clear endpoint.

Therefore, we should always be looking at the possibility of an API. Most machine learning applications will solve small but common and painful industry-related issues, but these solutions come at a significant price. Recouping a part of the price is possible by providing the model as a service to other businesses.

Conclusion

We need to accept the fact that machine learning models will, in most cases, not be “done and dusted.” At some point, updating and maintaining the models will become a necessity. Thus, the end goal should be to develop machine learning models and turn them into APIs as a long-term service. Whether they should be priced or free is up to the businesses producing them, but keeping them private is simply restricting progress in nearly every field of business and science.

About the Author

Aleksandras Šulženko has been at the forefront of today’s fast changing web-data-gathering trends. He started as an account manager, overseeing the daily operations and challenges of the world’s biggest data-driven brands, and this experience inspired him to shift his career path towards product development, with a focus on creating the most innovative, ethical, and secure web data collection services. You can reach the author via LInkedIn.

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI Members have access to exclusive research reports, publications, communities and training.

Individual, Student, and Team memberships available.

↑

TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

The Future of Machine Learning: Models as APIs

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI

Engage

Research

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

The Future of Machine Learning: Models as APIs

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects, and Your Career

TDWI

Engage

Research

Accelerate Your Projects,
and Your Career