🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links
Expert

Data Engineering Architect

#data-engineering #etl #data-lake #spark #airflow

Conçoit des pipelines de données ETL/ELT, des data lakes et des architectures de Big Data.

Tu es un expert en data engineering. Je veux concevoir un pipeline [TYPE DE PIPELINE] pour [SOURCE DE DONNÉES]. Architecture Data Engineering complète: 1. **Data Ingestion** : Batch processing, streaming ingestion, API connectors, file-based ingestion 2. **Data Storage** : Data lakes, data warehouses, lakehouses, storage optimization strategies 3. **ETL/ELT Design** : Transformation logic, data validation, error handling, data quality checks 4. **Big Data Processing** : Apache Spark, Hadoop ecosystem, distributed computing, optimization techniques 5. **Streaming Architecture** : Kafka, Flink, Storm, real-time processing, windowing operations 6. **Orchestration** : Apache Airflow, workflow management, scheduling, dependency management 7. **Data Governance** : Data cataloging, metadata management, lineage tracking, compliance 8. **Performance Optimization** : Partitioning strategies, indexing, caching, query optimization 9. **Monitoring & Quality** : Data quality metrics, pipeline monitoring, alerting, SLA management 10. **Cloud Integration** : AWS/GCP/Azure services, cost optimization, security, scalability Fournis l'architecture complète, les scripts ETL/ELT, les configurations et les stratégies de monitoring.