PinnedPublished inThe StartupDemystifying Spark’s Stream-Stream OUTER JoinReal-time analysis & recommendation is always a fancy idea in data science. Nothing is more exciting than serving the most up-to-date…Dec 11, 2020Dec 11, 2020
Mazda 2019 CX-5 spark plug mythI own a 60k miles CX-5 and I am looking for a good replacement of the spark plugs that the dealers are trying to charge me for $200/4…Mar 2Mar 2
Speeding Up PySpark Tests in Docker: A Battle Against Spark’s DefaultsIf you’ve ever tried running PySpark inside a Docker image, you might have faced this: ✅ It works. ❌ It’s slow — painfully slow.Mar 1Mar 1
medical insurance lessonDo not pay without negotiation. Don’t be afraid of delaying.Oct 20, 2023Oct 20, 2023
How does Spark determine where to start to read from Kafka stream?if no checkpoint specified - “earliest” will read starting from the earlist retained data (retention) - “latest” will read starting from…Jan 11, 2021Jan 11, 2021
How the music streaming industry worksSince I work in this industry, my interest in how this industry works on the business level grow stronger recently. So I decided to spend…Nov 29, 2020Nov 29, 2020