Overall Architecture

alt text

File Processor

Takes a sample data file as input and convert it automatically in a dataset representation. The current supported format are: AVRO, CSV, EXCEL, JSON, JSON Schema, PARQUET, XML, TEXT, SQL, XML, XSD

Dataset Controller

Web service to create dataset manually or programmatically.

Fields Detection Service

Service that infer data type based on values, field name and schema definition (optional).

alt text

Job Triggers

Jobs can be started manually or automatically (cron, random, burst).

Data Generator

From an existing dataset, a data generation job will create realistic data that matches all the provided requirements.

Sinks

The data is sent to a supported sink (data output).


Copyright © 2022 Datamaker Inc.