[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. Nscale is a GPU cloud company engineered for AI, providing infrastructure for AI start-ups and enterprises. They are seeking a Data Engineer to design and build data foundations that support their platform and operations, ensuring reliable and scalable data products.
Responsibilities
- Design and build scalable, reliable data pipelines that ingest data from infrastructure, platform services, and business systems
- Define data models and schemas that support operational workflows and use cases across the business, monitoring, and analytics
- Clean, transform and structure the data to create a digital twin of Nscale
- Implement permissioning and manage access and security of the Foundry implementation
- Create trusted datasets and metrics that power workflows and processes, internal tools, and customer-facing insights
- Enable self-serve analytics by establishing clear data contracts, documentation, and semantic layers
- Build use cases including but not limited to capacity planning, cost optimisation, reliability analysis, and customer reporting to drive our business forward
- Collaborate with Product and Commercial teams to translate real-world questions into robust data solutions
- Implement data quality checks, monitoring, and alerting to ensure data correctness and availability
- Codify data lineage, freshness, and consistency across systems
- Establish best practices around data versioning, access control, and governance appropriate for a fast-scaling company
- Continuously improve system resilience and observability
- Take end-to-end ownership of projects, from design through to production and iteration
- Help define standards, tooling, and ways of working for data at Nscale
- Contribute to technical decision-making as the company scales its platform and customer base
- Act as a thought partner to engineers and operators, not just a service function
Skills
- Deep, hands-on experience building in Palantir Foundry, including ontology modelling, pipeline development, API integration, and large-scale data platform design
- Strong proficiency in Python, with experience applying data engineering libraries and frameworks (e.g. Spark, PySpark, Dask, pandas) to work with large, complex datasets
- Familiarity with API-driven data integration, including REST, GraphQL, and Foundry Action APIs
- Practical experience working in Git-based development workflows, including code reviews, version control, and CI/CD pipelines
- Comfort working in ambiguous, early-stage environments where requirements evolve quickly
- Strong communication skills — able to explain data concepts clearly to technical and non-technical stakeholders
- A bias toward ownership, pragmatism, and shipping useful solutions
- Experience with cloud platforms (AWS, GCP, Azure) and infrastructure telemetry
- Familiarity with distributed systems, monitoring data, or usage-based billing data
- Experience supporting customer-facing data products or platforms
Benefits
- Highly competitive package (base + equity) with reviews every 12 months.
- Dynamic progression plan tailored to your ambitions.
- Human-First Flexibility: We treat you as humans first. Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments.
- Join our thriving remote-first team. Geography is no barrier to impact or connection. We build seamless virtual collaboration, empowering you, wherever you work.
Company Overview