Temporal Data in a Relational Database

Lately, I’ve been spending a great deal of time in the DB world. My most recent challenge has been in dealing with time-based, or temporal, data. In several real-world scenarios, we find that timing matters, particularly when dealing with transaction-based systems, such as banking, payroll, or HR.

Most relational databases with referential integrity work really well with current state data. But a system may need to keep up with the current, past, and future states as well. For example, if I’m going to create a paycheck for an employee, I may need to know something about his future pay-rate as in “On such-and-such date, Joe Blow will receive a paycheck and be paid x dollars.” His current state may be some amount less or more than that particular pay-rate on that date. To further complicate the matter, I will also want to know what we knew when we paid him on a particular date in the past. We may even back-date a pay-raise, creating a future state with an effective date in the past.

Whoa… Where’s the Doctor with the TARDIS when you need him?

These problems aren’t new. Many DB gurus far more able than I am have been digging around these issues for the past 30+ years and nobody has come up with a perfect solution. Some, such as Richard Snodgrass have written entire (quite good) books on the problem (Morgan-Kaufmann, ISBN: 1558604367 (1999)). My favorite discussion, though (primarily because of its simplicity), comes from the always ingenious Martin Fowler: http://martinfowler.com/eaaDev/timeNarrative.html. I highly recommend reading this article for a solid understanding of temporal data patterns.

For our project, we had a DB in place that handled temporal data by never performing an actual UPDATE against a record. This way, a row of data in a table represents a single point in time which can be given a Created Date and an Effective Date. When you need to UPDATE, you simply INSERT a new record with some identifying data, such as an Employee Number for example, to tie everything together. Pretty smart.

But one problem in dealing with relational data in a temporal model this way is in maintaining referential integrity as the data changes. When the updated record is inserted, how can you effectively maintain the relationship between, say, an Employee record, and time-sensitive Compensation data? As you can see, the scenario gets pretty messy quickly, so we need a more elegant solution than INSERTS with effective dating.

One solution that was tried was to create a single “Base” table, with immutable records that contained an ID that was used to tie everything together.

The revised “Base Table” model looked similar to this:

Although this model may work for a while, referential integrity is eventually lost because all entities, regardless of their relationship to each other, were tied back to only the single record in the base table. This had the effect of flattening the data model and forcing the maintenance of referential integrity into the application layer. Over time new applications would come on board and developers that didn’t understand the entity relationships (or didn’t care) ignore integrity rules.

The model we eventually decided on was inspired by what Martin Fowler referred to as a “Temporal Object” pattern. Basically, we constructed the entire model to maintain referential integrity between all entities through foreign key relationships. Temporal data, such as pay rate, was extracted into separate entities with a foreign key relationship created between them and the table from which they were extracted.

The model we ended up with was similar to this:

Naturally a major concern was that adding these extra tables would complicate JOINs in our queries which, of course, it did. However, most queries added only a single extra JOIN, but there was definitely the potential for many more, depending on the specific query needs. Given the need for Referential Integrity + Temporality, the trade-off was accepted.

With the exception of auditing, most of our needs required only data from a single point in time. We managed this by using an Effective Date field in our temporal tables and passing in a date as a parameter for all our queries. We could retrieve a single employee record for a given point in time like this:
SELECT e.EmployeeId, d.EmployeeDataId, d.EffectiveDate, d.EmployeeNumber, d.EmploymentType, d.HireDate, etc FROM Employee e JOIN (SELECT * FROM EmployeeData WHERE EffectiveDate = (SELECT MAX(EffectiveDate) FROM EmployeeData WHERE ((EmployeeId = @EmployeeId) AND (EffectiveDate <= @EffectiveDate)))) d ON e.EmployeeId = d.EmployeeId WHERE e.EmployeeId = @EmployeeId
Note that rather than joining directly on the temporal data table, we join on a query that retrieves the MAX effective date starting at the date we pass in as a parameter. This would allow us to retrieve the most recent data as of EffectiveDate we passed in.

[For another approach to the above SQL see Temporal Data in a Relational Database Pt. 2]

There are many other approaches to this problem, of course, but this approach provides the benefit of supporting both temporal record maintenance as well as referential integrity while being only slightly more complex.

11 thoughts on “Temporal Data in a Relational Database”

Kessir December 29, 20134:32 pm Reply

I have been thinking of this problem for a while now. I am really glad I found your blog post. Thank you for sharing.
1. Ed Jones January 8, 201410:49 am Reply
  
  You’re welcome! I’m glad you found it useful!
Temporal Data in a Relational Database Pt. 2 | Extremely Talented Monkeys March 10, 20147:09 am Reply

[…] basics of the pattern remain the same, so I simply refer you to the article, Temporal Data in a Relational Database, for a […]
Database: To delete or not to delete records | ASK AND ANSWER December 25, 20151:45 am Reply

[…] For temporal data, see: https://talentedmonkeys.wordpress.com/2010/05/15/temporal-data-in-a-relational-database/ […]
Le-roy Staines March 16, 20169:05 pm Reply

Microsoft SQL 2016 and SQL Azure now supports Temporal tables in a very transparent and powerful way: https://channel9.msdn.com/shows/data-exposed/temporal-in-sql-server-2016
software developer May 13, 20165:25 am Reply

Thanks Ed Jones, for the nice explanations.
I have some queries regarding the tables structure you have mentioned in your article. You have mentioned EmployeeNbr, EmploymentType and HireDate in Employee table but in query you are referring those columns from the EmployeeeData table, can you please clarify, it will help me to understand in detail.

I am considering the same for my project, but I am thinking only use EmployeeId and audit fields in Employee table while all others Employee related details in EmployeeData table. Is this correct or some employee fields should be in Employee table?
Aramist February 21, 201811:20 am Reply

Really great concept! I like the idea of breaking the pricing out into a child table which can easily be sorted by “effective date”. Did you come up with any mockups of what the user interface would look like when editing the temporal data? Did you show all of the historical data in the forms? Was the temporal nature of the form invisible to the user?

I could see arguments for both approaches. On one hand, if you make the temporal nature invisible it makes for a very simple user experience. However, if the user wanted to make changes to something in the future, they would need access to both the present data, and the ability to add future data.

I would be interested to see screenshots/mockups of how you solved the interface problem.
1. Ed Jones March 5, 20182:26 pm Reply
  
  Thanks for the feedback! I don’t have any screenshots as this work was done long ago, but the current data was largely displayed on a page about a single item with a grid at the bottom of the page displaying the entire history.
Database: To delete or not to delete records - PhotoLens February 17, 202212:53 am Reply

[…] For temporal data, see: https://talentedmonkeys.wordpress.com/2010/05/15/temporal-data-in-a-relational-database/ […]
유효 날짜가있는 가격을 저장하는 방법은 무엇입니까? 사람으로부터 특정 제품을 주문할 수 있습니다. 효과적인 가격을 - IT HOW March 27, 20226:38 pm Reply

[…] https://talentedmonkeys.wordpress.com/2010/05/15/temporal-data-in-a-relational-database/ […]
Best practice for erased data August 13, 20223:09 am Reply

[…] my experience I prefer to keep this data, but for some cases I use temporary tables , which may help you. In some cases it really does not make sense, such as when storing […]