2fc4dacd59c9b107d42d10e18fe9ddd17a625520
Random Corp
This repo is a reference end-to-end example implementation of Data Products using Markdown Data Definition Language (MD-DDL). This project implements data products for sourcing and the Financial Crime Domain.
The project leverages:
- https://git.semprini.me/paul/semprini-core to provide I&AM, Code Repos
- https://git.semprini.me/paul/semprini-data for data management, data product metadata, real-time data sourcing, and data transform platforms.
- https://git.semprini.me/paul/semprini-data-domain for foundational and experience data products.
Project Goals
- Synthetic data generators per source system - replicating realistic cadence of data change and system semantics in source systems. This will be achieved by:
- A single source system Postgres DB server for all sources
- Per source system database instances
- Python (faker) synthetic data generator per source system
- Ability to set change cadence
- Use the MD-DDL source definitions to define schemas and synthetic generators as per md-ddl.
-
Per source system, source aligned data products. This will be defined once the synthetic data generators are running
-
Domain aligned data products. Detail to be defined later but will be:
- Party Core
- Financial Transaction
- Product Core
- Consumer aligned data products. To be defined later.
Description
Languages
Python
94.3%
PowerShell
5.7%