Course Outline
Greenplum Architecture
- Parallel processing and symmetric multi-processing
- Segment roles and cluster configuration
- Scalability and data movement
- Greenplum Data Warehouse architecture
Greenplum Table Structures
- Distributed vs. randomly assigned tables
- Heap vs. append-only tables
- Row vs. columnar storage formats
- Partitioned and clustered tables
Data Distribution and Hashing
- Hashing logic and distribution keys
- Skew handling and performance impact
- Hash maps and row placement strategies
Indexes and Performance Optimization
- Clustered and non-clustered indexes
- B-tree and bitmap index use cases
- Index scan and storage behavior
Physical Database Design
- Normalization and logical model design
- User access strategies and distribution analysis
- Data demographics and indexing decisions
Denormalization Techniques
- Derived data, summary tables, and pre-joins
- Columnar tables as vertical partitioning
- Data marts and materialized views
Advanced SQL and Query Execution
- Join strategies and redistribution
- OLAP and window functions
- Temporary tables, subqueries, and derived tables
EXPLAIN Plans and Query Tuning
- Reading and interpreting EXPLAIN output
- Cost analysis and plan optimization
- Join movement and segment-local operations
Greenplum Utilities and Best Practices
- ANALYZE and VACUUM
- Data loading and movement with Nexus
- Security, permissions, and performance tips
Summary and Next Steps
Requirements
- An understanding of relational databases and SQL
- Experience with data warehousing or analytical systems
- Familiarity with Linux command line operations
Audience
- Data architects and engineers
- Database administrators and technical leads
- BI developers and analytics specialists working with Greenplum
Testimonials (5)
los ejercicios y la resolucion de problemas
Mario Humberto Serrano Gutierrez - Hipodromo de Agua Caliente
Course - Greenplum Architecture and Data Modeling
La teoria fue lo mas interesante
Luis Antonio Jimenez Gil - Hipodromo de Agua Caliente
Course - Greenplum Architecture and Data Modeling
los ejercicios practicos y la disposicion para contestar las preguntas
Edith Vichua Solis - Hipodromo de Agua Caliente
Course - Greenplum Architecture and Data Modeling
la explicacion, creo que uso terminos quee para todos fue demasiado entendible
Moises Jafet Hernandez Fuentes - Hipodromo de Agua Caliente
Course - Greenplum Architecture and Data Modeling
the practices