StackToHeap is the podcast where I, Manoj Mahalingam, talk to developers in the Indian region to understand the work they are doing, their learning and the major problems they encounter in the industry.
All content for StackToHeap is the property of Manoj Mahalingam and is served directly from their servers
with no modification, redirects, or rehosting. The podcast is not affiliated with or endorsed by Podjoint in any way.
StackToHeap is the podcast where I, Manoj Mahalingam, talk to developers in the Indian region to understand the work they are doing, their learning and the major problems they encounter in the industry.
In the 10th Episode of the StackToHeap podcast, we speak with Lordson from Arjira Tech about the patterns, challenges and solutions that one comes across while setting up a data ingestion and processing platform using Spark.
We discuss about the data ingestion issues, handling PII data, DSL for onboarding new data sources and using notebooks for orchestrating the Spark jobs.
StackToHeap
StackToHeap is the podcast where I, Manoj Mahalingam, talk to developers in the Indian region to understand the work they are doing, their learning and the major problems they encounter in the industry.