home/categories/data-engineering/databricks-solutions-ai-dev-kit-databricks-skills-synthetic-data-generation-skill-md
data-engineeringdata-ai

synthetic-data-generation

Generate realistic synthetic data using Faker and Spark, with non-linear distributions, integrity constraints, and save to Databricks. Use when creating test data, demo datasets, or synthetic tables.

databricks-solutions
maintainer
databricks-solutions
업데이트됨 1/19/2026
스타
5
포크
5
quick start

Installation and usage

Generate realistic synthetic data using Faker and Spark, with non-linear distributions, integrity constraints, and save to Databricks. Use when creating test data, demo datasets, or synthetic tables.

설치
$ install --globalskills.sh
사용법

설치 후 터미널에서 다음 명령을 실행하여 이 스킬을 사용할 수 있습니다:

skills use synthetic-data-generation