News from last month (2024/06 edition)
Data
- Benchmarking Report: Theseus Engine - Voltron Data
- Building an Apache Spark Performance Lab: Tools and Techniques for Spark Optimization
- How to learn data engineering
- Scaling data analytics with software engineering best practices
- The Future of Spark Technology: Igniting Tomorrow!
- My First Billion (of Rows) in DuckDB
- Déployer dbt avec Github Actions
- dbt unit-test framework. A dedicated framework for unit-testing… - Teads
- LakeChime: A Data Trigger Service for Modern Data Lakes
- Lessons in adopting Airflow on Google Cloud - Booking
- Redshift vs Snowflake vs BigQuery vs Databricks vs …
- An Empirical Evaluation of Columnar Storage Formats
- Data Engineering at Netflix using Apache Spark and Flink with Joan Goyeau - YouTube
- Practical Performance Analysis by Simone Bordet - YouTube
- SQLFluff documentation
- S3 Is Showing Its Age
- How I Try To Keep Up With The Data Tech World (A List of Data Blogs)
- Tweeq Data Platform: Journey and Lessons Learned: Clickhouse, dbt, Dagster, and Superset
- Modernizing Uber’s Batch Data Infrastructure with Google Cloud Platform - Uber
- Data Platform Explained Part II - Spotify
Scala/Java
- Domain-Driven Design with FP in Scala
- What’s New in JMC 9? - Sip of Java – Inside.java
- Programmer’s Guide to JDK Flight Recorder - YouTube
Craft
- We invested 10% to pay back tech debt; Here’s what happened
- Fast git handover with mob œ Tool for smooth git handover.
- Conventional Comments
- Learn Git Branching