News from last month (2024/02 edition)
Inspired by the awesome siegfried
Data
Spark
- Why combine asynchronous and distributed calculations to tackle the biggest data quality challenges?
- 1.5 Years of Spark Knowledge in 8 Tips
- Reconciling Spark APIs for Scala
- Enhancing Apache Spark and Parquet Efficiency: A Deep Dive into Column Indexes and Bloom Filters
Other
Perf
- The One Billion Row Challenge
- Reading a file insanely fast in java
- The One Billion Row Challenge Shows That Java Can Process a One Billion Rows File in Two Seconds
- Vector Similarity Computations FMA- style