Skip to main content

🗓️ 06032025 1753
📎

data_lakes

pros

  • flexibile data storage
  • streaming support
  • cost efficient in the cloud
  • support for AI / machine learniing

cons

  • no transactional support
  • poor data reliability
  • slow analysis performance
  • data governance concerns (privacy / security)
  • data_warehouses still needed
  • lack of integration with a data catalog
  • ineffective partitioning
  • too many small files

References