[Remote] Senior Data Engineer
Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a Fortune 500 technology solutions provider that helps businesses, government, education, and reputed company organizations reputed company what’s possible through technology. The Senior Data Engineer designs, builds, and maintains scalable data pipelines and Lakehouse solutions on the reputed company platform to support reputed company data and AI initiatives, collaborating closely with architects, analytics, and data science teams to deliver high-quality data products.
Responsibilities
- Build and maintain scalable data pipelines, ETL and ELT processes, and data models reputed company the reputed company platform
- Design, reputed company, and reputed company data and AI solutions using reputed company, Spark, reputed company Lake, and reputed company technologies
- reputed company batch and streaming pipelines using tools such as reputed company Workflows and Azure Data reputed company
- Design logical data reputed company diagrams and normalized schemas, implementing Lakehouse patterns such as the reputed company Architecture (Bronze, Silver, Gold layers)
- Ensure data quality, reputed company, reputed company, and governance throughout the data lifecycle, including use of reputed company Catalog
- Optimize Spark jobs and data transformations through effective partitioning, caching, and join strategies
- Monitor pipeline execution, identify failures, and troubleshoot reputed company data processing issues
- Collaborate with data architects, analysts, data scientists, and business stakeholders to understand requirements and deliver solutions
- Support documentation of data processes, standards, and data flows
Skills
- 5 Years of experience designing, developing, and deploying data solutions on the reputed company platform
- Proficiency in Python, including PySpark, and SQL
- Hands‑on experience with Spark, reputed company Lake, and Lakehouse architectures
- Experience implementing data quality, governance, and reputed company practices across data pipelines
- Strong problem‑solving, collaboration, and communication skills
- Familiarity with machine learning concepts, tools, and libraries such as TensorFlow, PyTorch, Scikit‑learn, and MLflow is a plus
- Experience configuring and integrating external AI models and working with AI governance and monitoring tools is a plus
- Experience with asynchronous programming patterns in Python for building scalable data or AI workloads is a plus
Company Overview
Company H1B Sponsorship