TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Think
- Research & Resources
  - TDWI Playbook | Next Generation Data Science: The AI-Driven Data Science Life Cycle
  - TDWI Data Points | The Data Foundation for AI
  - TDWI Best Practices Report | Data Strategies and Foundations for Modern Data Management
  - TDWI Insight Accelerator | Adopting a Platform Approach for Gaining Insights from Unstructured Data
- Webinars
  - Modernize and Govern: Unifying Your Data Strategy July 10, 2025
  - Expert Panel: Best Practices for Modernizing Your Data Environment July 14, 2025
  - Powering Data Science with AI-Driven Tools and Practices July 15, 2025
  - Data Integration for AI: Overcoming Modern Pipeline Challenges July 23, 2025
- Virtual Summits
  - Virtual Events Keys to Making Your Data AI Ready September 10, 2025
  - Virtual Events Data Quality for BI, Analytics and AI October 22, 2025
  - Virtual Events Modern Data Strategy November 12, 2025
  - Virtual Events What’s Ahead in 2026 for Data & Analytics December 10, 2025
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Speaking of Data Podcast
  
  Current Research Surveys
Train
- In-Person Events
  - Conference TDWI Transform 2025 San Diego August 18, 2025
  - Executive Summit TDWI Modern Data Leader's Summit San Diego: AI in the Enterprise August 18, 2025
  - Executive Summit AI Accelerate 2025, Brought to You by AI Boadroom & TDWI August 18, 2025
  - Conference TDWI Transform 2025 Orlando November 16, 2025
- Virtual Live Seminars
  - TDWI Data Governance Principles and Practices: Managing Data as an Asset June 25, 2025
  - Building Your Company’s Data Governance Roadmap June 25, 2025
  - Data Governance: Driving Engagement and Organizational Change June 26, 2025
  - A Framework for Modern Data Governance June 25, 2025
- Online Learning
- By Topic
  - By Topic
    
    Explore the Latest AI, Analytics, and Data Research and Training by Topic
  - BI, Analytics, and Data Literacy
  - AI, Data Science, and Machine Learning
  - Data Management and Governance
  - Platforms and Architecture
  - Strategy and Methods
- Train Your TeamCustom solutions for training your team
  
  Get CertifiedEarn a professional credential in BI and Analytics, Data Governance, or AI
  
  TDWI MembershipExclusive access to the research, tools, training, and connections
Engage
- Connect
  - Connect and Contribute to Our Vibrant Community of Data Leaders
    
    Subscribe to TDWI Stay up to date on the latest news and events. Sign Up
    
    Become a TDWI Member Gain exclusive access to the research, tools, training, and connections to move your careers, teams, and projects forward. Learn More
    
    Become a Part of the TDWI Research Panel Make a difference in the data and analytics industry and earn incentives by sharing your insights with TDWI. Explore Now
    
    Speak at TDWI Events Share your expertise and build your personal brand as a speaker at a TDWI In-Person or Virtual Event. Submit a Proposal
    
    Become a TDWI Research Fellow Apply to be a member of TDWI’s industry leading research team. Apply Today
    
    Become a Member of the Data & AI Leaders Forum Engage in collaborative discussions, stay ahead of the curve, and stay in the know. Apply Now
    
    Showcase Your Data & AI Solutions Reach and engage with TDWI community through multi-channel marketing programs. Learn More

TDWI Articles

Event Data: The Root of All Analytics

What is event data, and what makes it unique and valuable for analytics?

By Michelle Wetzler
October 30, 2017

Analytics is reshaping computing, businesses, and many of our day-to-day activities. The fuel that drives the analytics engine is all around us, in everything we do. It's data, of course, but more precisely, it's event data.

Events are happening everywhere, all the time -- in apps, cars, appliances, servers, and even in our brains. With more devices connecting to the Internet, it's becoming easier to collect data from just about anywhere.

For Further Reading:

The Benefits of Streaming Data Are Contagious

Why Data Warehouse Modernization Must be Coordinated with Other Modernization Projects

Use a Hadoop-based Data Lake to Empower New Best Practices for Business Analytics

Let's take a closer look at event data and what makes it unique and valuable for your analytics. The easiest way to understand event data is by comparing it to another type of data. I've chosen entity data because it's familiar from use in databases and spreadsheets.

Entity Data

Entity data is stored in tables and can be associated with such elements as users, products, and accounts. Typically, a separate table is assigned for each of type of entity, with columns that contain related properties. This allows a user to quickly look up information about any entity.

Also, in entity databases, data is normalized and rarely duplicated. For example, a table for accounts might contain attributes such as account name, type, and category. Because multiple users can be associated with the same account, user information wouldn't typically be stored in the accounts table. Instead, a key in each user record would link to its account.

A major drawback to this data model is that in order to analyze entities (for example, to sort employees by department name), you must pull in data from multiple tables. At large scale, these operations take time.

Event Data

Event data doesn't just describe entities; it describes actions performed by entities (for example, "Publish a blog post"). Event data contains three key pieces of information, sometimes called behavior data:

Action
Timestamp
State

The action is the thing that's happening (e.g., "publish"). The timestamp is self-explanatory. The state refers to all of the other relevant information we know about this event, including information about entities related to the event, such as the author and content management system associated with the blog post.

Let's consider a more complex event: recording every player's "death" in an online video game. Typically, there are many ways the player can experience "death," such as falling from great heights, starvation, drowning, stumbling into lava, or being killed by a zombie.

To analyze the most common type of death, the age of the player at the time of "death," length of time played at the time of "death," the most lethal enemies, or any number of "death"-related questions, we can use a simple event data model with a few specific qualities:

The data is rich
The data is denormalized
The data is nested
The data is schemaless

Event Data is Rich

Events can have hundreds of properties; they seek to describe not just one entity but all of the entities involved in an action. In the above example, we can add even more data, such as location of the death, game settings, and software version -- just to name a few.

Event Data is Denormalized

Unlike in a relational database, the same data is continuously repeated in an event database. User attributes, app version, or difficulty settings might be repeated on every single event even if they rarely change. This redundancy is necessary to capture a representation of the application state at the time of the event. In entity databases when properties (e.g., player settings) are updated, the previous values are lost forever, but event databases can capture entity data at a point in time. To be clear, event databases are a great companion, not a replacement, for entity databases.

Event Data is Nested

Event data can have multiple properties; most databases optimized for event data can store it using nested JSON. This is particularly helpful when data sets have many properties and entities to describe.

Event Data is Schemaless

As mentioned earlier, event data can capture state at the time of an event. For example, starvation, drowning, or lava deaths, which don't involve an "enemy," might have their own unique properties, "lava temperature," for example. In other words, the death events don't follow a strict schema. Event databases are designed to handle a multitude of arbitrary properties.

Event Data at Scale

An online game can have millions of users, and for every user there are many actions. Because entity data captures current state information and the history of actions that happen over time, its scale is massive compared to entity data points. Fortunately, data storage is now affordable enough to support event databases.

Although entity data will always be a valuable asset for data science, without event data we wouldn't be able to perform analytics as we know it today.

About the Author

Michelle Wetzler is chief data scientist at Keen IO, which offers products that enable businesses to add analytics and data science features directly into their applications. She previously developed advanced IT architectures for Fortune 500 enterprises as a consultant with Accenture and has also taught imaging technology at the University of Illinois. You can contact the author on Twitter at @michellewetzler.

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI Members have access to exclusive research reports, publications, communities and training.

Individual, Student, and Team memberships available.

↑

TDWI | Training & Research | Business Intelligence, Analytics, Big Data, Data Warehousing

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Event Data: The Root of All Analytics

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects,
and Your Career

TDWI

Engage

Research

Research & Resources

Webinars

Virtual Summits

By Topic

In-Person Events

Virtual Live Seminars

Online Learning

By Topic

Connect and Contribute to Our Vibrant Community of Data Leaders

TDWI Articles

Event Data: The Root of All Analytics

Related Articles

Trending Articles

Breaking Barriers in Conversational BI/AI with a Semantic Layer

AI in 2025: Key Considerations for Technology Leaders

The Tech Blanket: Building a Seamless Tech Ecosystem

What’s Ahead in Generative AI in 2025? (Part Two)

TDWI Membership

Accelerate Your Projects, and Your Career

TDWI

Engage

Research

Accelerate Your Projects,
and Your Career