Job Description:The ZPA Advanced Analytics team at Zscaler is looking for a Python/Spark Engineer with big data experience who will help us in the analysis of vast amounts of data, write production grade pipelines, and infrastructure optimization. This role will be responsible for product features related to data transformations, enrichment, and analytics. You will collaborate with internal stakeholders such as product managers, UI, and other backend teams to understand requirements and translate the same into functional specifications. You will seize every opportunity to refactor code in the interests of maintainability and reusability. We are someone who will lead and provide technical direction and mentorship to junior engineers and bring out the best in them. The ideal candidate has a proven track record of building large scale enterprise products, and is a creative thinker, problem solver, learner, and a fantastic manager of people, and is motivated with engineering excellence.Responsibilities:Data mining using state-of-the-art methodsEnhancing data collection procedures to include information that is relevant for building analytic systemsProcessing, cleansing, and verifying the integrity of data used for analysisAd-hoc analysis and presenting your results in a clear mannerCreating automated anomaly detection systems and constant tracking of its performanceMinimum Qualifications:Bachelor's/Master's in Computer Science or a related technical discipline.8+ years SWE experience5+ years of experience in developing and deploying Big data solutions.Experience with Python, PySpark (PySparkSQL), Spark (SparkSQL), SQL.Experience with Data visualization tools / Data visualization algorithms.Understanding and sense of data-warehousing and data-modeling techniques.Strong understanding of distributed systems, real-time analytics.Experience with stream-processing technologies like Apache Kafka and Apache Flink.Strong history of self/unit testing and feature verification.Experience with data engineering tasks or working with data engineers.Preferred Qualifications:Pipeline orchestration tool Kubeflow.#LI-KF3