A dataset of schema transitions from the evolution history of the database of an AI assistant over 10 years. It contains 100 transitions.
Wikipedia: A dataset of schema transitions from the evolution history of the database behind the Wikipedia website over 15 years. It contains 250 transitions.
Hospital: A dataset of schema transitions from the evolution history of the database of a hospital information system over 7 years. It contains 50 transitions.
For each dataset, we run Phasic Extractor for different values of k and we compute the divergence from the mean for each extracted phase. Lower values indicate better quality phases.
We can observe that for k close to the “ground truth” number of phases:
- Assistant: 3 phases