I was taken aback when I was recently asked this question by a colleague of mine who is a data professional. Because this seems to be such a straightforward and easy question,  I thought ‘surely, they would know the answer to this?!’ but on reflection I realized this is yet another example of how we data professionals are actually very bad at defining things and as data governance professionals,

This is even worse, because we spend our time helping others and asking others to write definitions for their data and yet we so often don’t define the terms we use well enough for others to understand. 

Upon some reflection, I decided that this was actually an excellent question and was definitely one that I should answer.

The first thing I did was look at one of my most commonly used reference books, the DAMA Dictionary of Data Management. This is an excellent reference book for anyone in data governance. However, on this occasion, it did take me aback

I opened the page at Data Domain and was quite surprised at the definition it gave. It states that ‘a data domain is a set of allowable values for a data attribute’. However, that is not how I use the term and I think that is the perfect example of what happens as data professionals. We start using a term and people we work with start using it, it proliferates, but we’re not necessarily using it for its original intention.

While a data domain is perhaps terminology more commonly used in data modelling and in databases, we use it a lot in the data governance world, but in my view, with a slightly different meaning. We clearly don’t mean ‘a set of allowable values for a data attribute’ -that’s very techy and data geeky and not at all the type of language we would want to use when we’re trying to talk to business users. So, what do we mean?

Well, when I use the term, I mean a logical grouping of data – something where we can tell where it starts, and it ends. From my point of view, I’m normally trying to identify data domains so that I can assign data owners.

For example, you might call customer data a data domain. Or finance data, HR data, product data, supplier data. These are all ideas of logical groupings of data that all relate together.

It’s then the work of data governance to work out the details of what is actually included in those domains… but that is what I, and many professionals that I work with, mean when we say, ‘data domain’.

Sometimes I don’t use the word data domain. I talk in terms of ‘data set’, which is any logical grouping of data and I think you can use the words ‘data set’ and ‘data domain’ interchangeably. Just make sure that you actually understand what you mean when you use the term and explain to the business users that you’re talking to what it means to avoid all confusion.

So, there you have it. That’s my definition of a data domain.