data-analytics

安装量: 1.6K
排名: #2973

安装

npx skills add https://github.com/markdown-viewer/skills --skill data-analytics

Data Analytics Diagram Generator Quick Start: Define data sources → Declare ingestion/ETL icons → Connect to storage/warehouse → Add BI/visualization → Wrap in plantuml fence. ⚠️ IMPORTANT: Always useplantuml or puml code fence. NEVER usetext — it will NOT render as a diagram. Critical Rules Every diagram starts with @startuml and ends with @enduml Use left to right direction for data pipelines (Source → Ingest → Transform → Store → Visualize) Use mxgraph.aws4.* stencil syntax for analytics, database, and storage icons Default colors are applied automatically — you do NOT need to specify fillColor or strokeColor Use rectangle "Zone" { ... } or package "Layer" { ... } for grouping pipeline stages Directed flows use --> , async/streaming flows use ..> (dashed) Full stencil reference: See stencils/README.md for 9500+ available icons. Mxgraph Stencil Syntax mxgraph.aws4. "Label" as Analytics & ETL Stencils Category Stencils Purpose Query Engine athena , athena_data_source_connectors Serverless SQL on S3 data ETL glue , glue_crawlers , glue_data_catalog , aws_glue_data_quality , aws_glue_for_ray Data integration & cataloging Streaming kinesis , kinesis_data_streams , kinesis_data_firehose , kinesis_data_analytics , kinesis_video_streams Real-time data streaming MapReduce emr , emr_engine , emr_engine_mapr_m3 , emr_engine_mapr_m5 Big data processing (Spark, Hive) Data Warehouse redshift , redshift_ra3 , redshift_streaming_ingestion , redshift_ml Columnar analytics warehouse Search opensearch_service_data_node , opensearch_ingestion , cloudsearch Full-text search & log analytics BI quicksight Dashboards & visualizations Data Lake lake_formation , s3 , glacier , glacier_deep_archive Governed data lake storage Catalog datazone_custom_asset_type , data_exchange Data governance & sharing Streaming Kafka msk , msk_connect Managed Kafka streaming Database Stencils Category Stencils Purpose Relational aurora , aurora_instance , rds , rds_instance , rds_mysql_instance , rds_postgresql_instance Transactional databases NoSQL dynamodb , dynamodb_table , dynamodb_global_secondary_index , dynamodb_stream Key-value & document store Graph neptune Graph database In-Memory elasticache , elasticache_for_redis , elasticache_for_memcached Cache & session store Document documentdb , documentdb_with_mongodb_compatibility Document database Ledger quantum_ledger_database Immutable transaction log Wide-Column keyspaces Cassandra-compatible Connection Types Syntax Meaning Use Case A --> B Solid arrow Batch data flow / API call A ..> B Dashed arrow Streaming / async / CDC A -- B Solid line Bidirectional sync A --> B : "label" Labeled connection Describe data format or volume Quick Example @startuml left to right direction mxgraph.aws4.s3 "Data Lake\n(S3)" as s3 mxgraph.aws4.glue "Glue\nETL" as glue mxgraph.aws4.redshift "Redshift" as rs mxgraph.aws4.quicksight "QuickSight" as qs s3 --> glue glue --> rs rs --> qs @enduml Data Analytics Architecture Types Type Purpose Key Stencils Example Data Lake Centralized raw data store s3 , lake_formation , glue , athena data-lake.md Real-time Streaming Event stream processing kinesis , msk , lambda_function , opensearch_service real-time-streaming.md Data Warehouse Star-schema analytics redshift , glue , quicksight data-warehouse.md ETL Pipeline Extract-transform-load glue , glue_crawlers , glue_data_catalog , s3 etl-pipeline.md Log Analytics Centralized logging kinesis_data_firehose , opensearch_service , lambda_function log-analytics.md ML Feature Store Feature engineering pipeline glue , s3 , athena , emr ml-feature-pipeline.md CDC Pipeline Database change capture dynamodb_streams , kinesis , lambda_function , redshift cdc-pipeline.md Multi-source BI Cross-database reporting aurora , dynamodb , redshift , quicksight multi-source-bi.md

返回排行榜