Almost no one knows what a reference data is, but everyone has seen and used it many times. Those are all those codes, lists of values, lookup, or dropdowns that we have all filled in on numerous forms. Those are all those vague status, type or flag columns. What does status = ‘D’ mean? Is it ‘Disabled’ or ‘Deleted’? What values are even allowed? Those and more questions can be answered with a convenient documentation.
Learn how to get your lookups under control.
Watch our webinar: https://dataedo.com/videos/documenting-reference-data-lookups-codes-and-how-it-helped-me-fix-data-quality
2. Agenda
1. What is reference data?
2. What’s the challenge?
3. Reference data in Dataedo
4. How I fixed my Conversion Report
5. Benefits
6. What you need
7. Q&A
4. Reference data - definition
Reference data is data used to classify or categorize other data.
5. Reference data - AKA
1. Codes
2. Lookups
3. Lists of values
4. Enums
5. Data (field) Domains
6. Examples
1. Countries
2. Accounts
3. Currencies
4. Units of measure
5. Departments
6. Types (e.g. document)
7. Statuses (e.g. order)
8. Calendar (e.g. quarters)
7. Example: booking a travel
1. Destination country, city, airport
2. One way/Round trip
3. Airline (American Airlines, Lufthansa, KLM, …)
4. Car rental (Herz, Budget, …)
5. Title (Mr, Mrs, Ms, Miss)
6. Number of passengers/guests (1, 2, 3, …)
7. Accomodationtype (Hotel, Appartment, Motel, B&B, …)
8. Hotel – stars (1-5)
9. Facilities (Parking, Restaurant, WiFi, Swimming pool…)
10. Car type (Mini, Standard, Sport, Pickup, Van)
8. Transcational vs Master vs Reference data
Transactional Data Master Data Reference Data
Transactions/events Business Entities Labels, categories
Orders, Quotes, Trials, … Employees, Customers,
Products
Countries, curencies, segments,
types
Millions… Hundreds, thousands, … Tens, hundreds
New records daily, hourly Slowly changing Very slowly changing
Metadata
28. Better understanding of data
• Developers – less bugs
• Data/BI anaysts – faster and more reliable analytics
• Business users – better understanding of reports
29. Better data quality
• Understand data quality
• Build Data Quality rules
• Build Data Cleansing rules