event-sourcing-demo

Purpose

My event-sourcing-demo is for me to understand the concepts of event sourcing. I don’t intent it for use in real applications.

System Behavior

Here: system-behavior

System Structure

Here: system-structure

Demo Scope

First, a simple in-memory event store capable of the basic use cases outlined in the above links. This provides a simple foundation.

Next, investigate each of the following concerns.

Audit: how to list the events related to a user, from a specific IP address, or regarding a specific business entity.
Access control:
1. how to define access control lists on events
2. how to restrict a list to only events the current user can see
3. how to verify the current user is allowed to issue a particular command - functional access: Whether the user is allowed to issue this command in the first place - data access: Whether the user can issue this command for this paritcular business entity
Reporting: how to project the data for consumption by a traditional report engine
Data migration: how to migrate data from an existing CRUD application
Scale: explore tools for larger systems
1. Many event types, many commands, many relationships between events
2. Hundreds, thousands, millions, billions of events
Queries:
1. How to populate lists
2. How to show reports
Event handling:
1. configurable set of commands to raise in response to events
2. ability to issue a command for events on only certain objects, and not for other objects
Event routing
Extensibility
1. Feature flags: enable or disable particular event handlers
2. Customer-specific code
  - custom event handler
  - custom UI
Verifying event data against a JSON schema
Related events
Building current state of an entity from the history of events for that entity
Populating secondary repositories from the event stream
- Database
- Search
Checking business rules

Documentation Requirements

All documents as text files within this repository.

Maintain a requirements document and architecture diagrams.

Constraints

Support different event stores; no tight coupling to a particular implementation

Some tools that may be useful

Java Faker Generate a stream of test events
Report engines:
1. Apache Superset
2. Eclipse BIRT
Event stores:

Note, I am allergic to the Confluent stack.

General Approach to Event Sourcing Applications

What are the events? the commands? the queries?
- Every command is mapped to one or more events.
- Every query will retrieve events
- The UI, or more generally portals, issue commands
- The back end records events
Each command might be a class, and events are classes
- Vertical Slice Architecture
- UI (or any portal) sends a command, and the command records events
- Commands may return an entity id, or nothing
- A portal might issue multiple commands back to back, establishing a data flow; earlier commands may raise events, later commands may query events and raise new events. Each command interacts only with the event store. Commands don’t know about each other.
Command processing
- check constraints (that is: by querying event stream).
- Events from the query results can be reduced to a data structure.
- This data structure is per-command, something like a DTO; avoid a single application-wide domain model (in other words: avoid the “Single Model Fallacy”).
- functions to evaluate rules or make decisions are pure functions, issuing a query and iterating over events. No connection to commands or to other objects. These functions do not appear in the public documentation.
- if you need events from different filters, query the event stream with each filter and join the results into a single context… being careful about the performance impacts.
- if constraints are OK, execute the command (transform / side effects …) and generate events to record the changes made
- if not, raise an error
- command execution pipeline: query –> replay –> build model –> check model –> generate events –> record events
What to do if new, relevant events were raised between “replay” and “generate events”?
- How to tell: execute the query again … but this introduces yet another race condition: the time between this new query and when we record the new events
- or, just see if more relevant events were added at all. If more events were added, relevant to our use case, then we stop processing. But again this introduces another race.
Commands do not return a payload, but they can return an envelope with status information (success/failure, reason for failure)
Queries return a data structure
- sourced, one way or another, from the event stream
Design documentation
- List of commands; for each, the events recorded
Implement a vertical slice
- Which events make up the context?
- What is the context model - what do I need to know about the state of the system right now?
- What constraints do I need to check?
- What decisions do I need to make?
- Which events do I need to generate?

This site is open source. Improve this page.