Site LogoSite Logo
  • 🗃️ read_me
  • backend
    • api
    • canal
    • data-systems
      • compute_storage_separation_architecture
      • data_compute_platform
      • data_federation
      • data_lakehouse
      • data_lakes
      • data_models
      • data_processing_architectures
      • data_skew
      • data_warehouse_naming_conventions
      • data_warehouses
      • databricks_cluster_config
      • ddia_chapter_1
      • delta_lake
      • extract_transform_load
      • gemini_state_backend
      • hot_cold_storage
      • htap
      • materialization
      • materialized_view
      • olap
      • olap_operations
      • oltp
      • oltp_vs_olap
      • pangu
      • snowflake_schema
      • star_schema
      • storage_computing_architectures
      • storage_disaggregation_architecture
      • time_series_database
      • view
    • databases
    • frameworks
    • hologres
    • integration
    • languages
    • media
    • os
    • sql
    • streaming
  • concepts
  • dump
  • finance
  • frontend
  • infrastructure
  • skating
  • backend
  • data-systems
  • data_lakes

🗓️ 06032025 1753

DATA LAKES

pros​

  • flexibile data storage
  • streaming support
  • cost efficient in the cloud
  • support for AI / machine learniing

cons​

  • no transactional support
  • poor data reliability
  • slow analysis performance
  • data governance concerns (privacy / security)
  • data_warehouses still needed
  • lack of integration with a data catalog
  • ineffective partitioning
  • too many small files

References​

  • https://www.youtube.com/watch?v=myLiFw9AUKY&t=331s
Previous
data_lakehouse
Next
data_models
  • pros
  • cons
  • References