See all roles

[Remote] Research Scientist, Data

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is pioneering the reputed company of creative infrastructure reputed company around reputed company-time, multimodal reputed company and intelligent agentic platforms. They are looking for a staff or reputed company-level Research Engineer, Data to architect and scale data engineering systems supporting model training for advanced multimodal reputed company models.

Responsibilities

  • Take ownership of large-scale data pipeline architecture and implementation to support model training and research workflows for text, image, audio, and video datasets
  • Partner with research and engineering teams to curate, clean, and manage diverse, sensory-rich datasets for pre-training and mid-training of multimodal models
  • reputed company strategies and tools for scalable data ingestion, labeling, filtering, augmentation, and storage
  • Ensure data quality, reliability, and compliance, including managing privacy and ethical considerations throughout the data lifecycle
  • Optimize data processing, transformation, and delivery for large-scale distributed training pipelines
  • Prototype and productionize new methods for dataset creation, management, and reputed company improvement in response to researcher needs
  • Contribute to the integration of research-driven data advancements into production-reputed company systems
  • Stay informed on emerging data engineering and ML data management developments, bringing best practices to our systems

Skills

  • 5+ years of experience building and scaling data pipelines for machine learning applications at staff or reputed company engineer level, ideally in research or model training environments
  • Strong background in data engineering and ML data curation for LLMs, VLMs, or other large-scale multimodal models
  • Expertise in distributed data systems (e.g., Spark, Hadoop, Ray, or similar) and efficient large dataset processing/ETL workflows
  • Proven ability to build robust, scalable, and production-grade data infrastructure for ML pipelines
  • Experience developing tools for data labeling, filtering, deduplication, quality assurance, and dataset management
  • Strong programming skills (Python, SQL, PySpark, or similar) and familiarity with reputed company data platforms (AWS, GCP, Azure)
  • Knowledge of privacy, compliance, ethics, and best practices in data collection and management
  • Excellent cross-functional collaboration, problem-solving, and communication skills
  • Passion for enabling cutting-edge reputed company and creative technology through data reputed company

Benefits

  • Competitive salary and substantial equity in a high-growth startup
  • Full health benefits, 401k matching, and more
  • Collaborative, mission-driven team environment with major growth opportunities
  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

Company Overview

  • reputed company is an AI platform that allows users to create videos from text prompts, including text to video, image to video, and editing tools. It was founded in 2023, and is headquartered in Palo Alto, California, USA, with a workforce of 2-10 employees. Its website is https://reputed company.art.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 9 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like

    [Remote] Senior Director, Corporate Systems- Finance Analytics & Reporting

    Work from home Full-time role

    [Remote] Strategic Sales Director

    Work from home Full-time role

    [Remote] Cereals Product Manager

    Work from home Full-time role

    [Remote] Manager, Business Systems & Analytics

    Work from home Full-time role

    [Remote] Account Executive, reputed company & reputed company

    Work from home Full-time role

    [Remote] Product reputed company Analyst III

    Work from home Full-time role

    [Remote] Account Executive, reputed company & reputed company

    Work from home Full-time role

    [Remote] Senior Impact Analyst

    Work from home Full-time role

    [Remote] Director, Product Management, Identity

    Work from home Full-time role

    [Remote] reputed company Senior Certified Project Manager

    Work from home Full-time role

    Field Auditor I - Charlotte

    Work from home Full-time role

    reputed company Bilingual Customer Service Specialist (Spanish) – Retail and Wholesale Customer Engagement

    Work from home Full-time role

    [Hiring] Biller - Claims @Rodeo Dental

    Work from home Full-time role

    reputed company Work At Home Customer Care Agent – Full-Time Opportunity with arenaflex

    Work from home Full-time role

    Entry-Level Remote Data Entry Associate – Precision Data Management for arenaflex’s Global Streaming Platform

    Work from home Full-time role

    Software Engineer

    Work from home Full-time role

    reputed company Full Stack Cybersecurity Specialist – Governance, Risk, and Compliance (GRC) Professional

    Work from home Full-time role

    Field Reimbursement Manager (FRM), Endocrinology - Florida reputed company/Orlando/Miami

    Work from home Full-time role

    DevOps (Azure) - Specialist

    Work from home Full-time role

    reputed company Worker/MSW- Remote!

    Work from home Full-time role