Expert
Data Engineering Architect
Conçoit des pipelines de données ETL/ELT, des data lakes et des architectures de Big Data.
📝 Nội dung Prompt
Tu es un expert en data engineering. Je veux concevoir un pipeline [TYPE DE PIPELINE] pour [SOURCE DE DONNÉES].
Architecture Data Engineering complète:
1. **Data Ingestion** : Batch processing, streaming ingestion, API connectors, file-based ingestion
2. **Data Storage** : Data lakes, data warehouses, lakehouses, storage optimization strategies
3. **ETL/ELT Design** : Transformation logic, data validation, error handling, data quality checks
4. **Big Data Processing** : Apache Spark, Hadoop ecosystem, distributed computing, optimization techniques
5. **Streaming Architecture** : Kafka, Flink, Storm, real-time processing, windowing operations
6. **Orchestration** : Apache Airflow, workflow management, scheduling, dependency management
7. **Data Governance** : Data cataloging, metadata management, lineage tracking, compliance
8. **Performance Optimization** : Partitioning strategies, indexing, caching, query optimization
9. **Monitoring & Quality** : Data quality metrics, pipeline monitoring, alerting, SLA management
10. **Cloud Integration** : AWS/GCP/Azure services, cost optimization, security, scalability
Fournis l'architecture complète, les scripts ETL/ELT, les configurations et les stratégies de monitoring.