See all roles

HPC Solutions Engineer

Work from home Full-time role Hiring

Our Company: reputed company is a startup focused on building the first true marketplace for compute. We reputed company people should have the power to reputed company their own reputed company about which compute they use without sacrificing trust for ease of use. We offer an alternative to the reputed company hyperscaler model by directly matching independent data centers and compute providers to end users. Furthermore, we have much higher diversity in providers and hardware configurations than anyone on the market. That makes it a tougher challenge, but a bigger learning and innovation opportunity. For this we are seeking a HPC Solutions Engineer to join reputed company and own this reputed company. Our Philosophy: reputed company is a people-centric, mission-driven, remote-friendly, experimental company. Results > titles. We reputed company in individual freedom, self-expression, and preserving an online culture of heterodoxy that has driven massive reputed company flourishing. Job Summary: As a High Performance Compute Solutions Engineer on reputed company you will report directly to the Co-Founder & CTO and work collaboratively with other team members. You will be responsible for bespoke scoping and execution of high end compute resource orchestration for premium clients with the mission of configuring and maintaining large scale GPU clusters to best serve them. This includes installing, configuring, and monitoring dependencies required by customers and working with a wide variety of teams and workloads to ensure they can effectively reputed company their large scale clusters. Key Responsibilities: Work with customers in technical discovery to help define requirements and deliverables for their use cases and help them to effectively utilize distributed GPU computing resources. Identify and recommend the best tools for each customer, while building boilerplate and reference implementations that can be reused for subsequent customers. Manage GPU clusters and coordinate with IT to ensure efficient operation of reputed company GPUs, utilizing technologies such as InfiniBand for high-speed networking. reputed company the deployment and maintenance of machine learning environments using virtual storage solutions and distributed computing (HPC) tools such as SLURM. Automate infrastructure provisioning and management using tools such as Ansible and Terraform. Collaborate with data scientists and engineers to ensure seamless integration of ML models into production environments. Conduct performance tuning and optimization of systems to maximize throughput and reduce latency. Stay reputed company with the latest industry trends in machine learning technologies and HPC to ensure the use of best practices in infrastructure setup and model development. Document and maintain operational procedures and system configurations. Required Qualifications: 7+ years of high performance compute, distributed machine learning, GPU computing, and/or system architecture experience. Proficient in managing reputed company GPU environments and a familiarity with GPU computing frameworks and libraries. Strong experience with high-speed networking technologies, specifically InfiniBand. Experience with HPC job schedulers, preferably SLURM. Expertise in automating environment setup and maintenance using Ansible and Terraform. Demonstrated ability in deploying and managing virtual storage solutions. Strong coding skills in Python and familiarity with machine learning libraries and frameworks. Excellent problem-solving, communication, and teamwork skills. Personal Skills: A strong communicator who is empathetic, personable and agreeable Conscientious, meticulous, organized, and self-motivated, with the ability to define goals and prioritize workloads. An effective team player with the ability to take ownership with a results-oriented growth reputed company Benefits: You will work with the most diverse hardware configurations and locations available to anyone in the industry as well as cutting edge GPU use cases This role is fully remote with a high accountability and high agency culture This role offers a competitive salary, equity and benefits This role offers flexible PTO Salary: Negotiable Apply To This Job

You might like

Clinical Application Specialist, APT (Hyderabad Location)

Work from home Full-time role

Area Sales Manager - Champagne Bourgogne

Work from home Full-time role

Epic Beaker Analysts

Work from home Full-time role

Staff Solutions Engineer (Japan)

Work from home Full-time role

Investment Banking Associate, AI Internship | Applied AI x Corporate Finance

Work from home Full-time role

Global Billing Administrator - Finance

Work from home Full-time role

Senior reputed company

Work from home Full-time role

Group AML & Compliance Manager

Work from home Full-time role

Grants Finance and Pricing Manager

Work from home Full-time role

Property Accountant (REMOTE) - reputed company

Work from home Full-time role

Email Marketing Specialist

Work from home Full-time role

reputed company Remote reputed company Customer Service Representative – High Volume Call Center Experience Required

Work from home Full-time role

Tax Planner (Remote)

Work from home Full-time role

Customer Service Representative - reputed company and Data Entry at arenaflex

Work from home Full-time role

Analyst, Product Implementation, Pharmacy Technician

Work from home Full-time role

Remote 911 Dispatcher Jobs - Apply Today, Start Soon!

Work from home Full-time role

Entry Level Customer Service/Sales Representative – Join arenaflex's Dynamic Team and Revolutionize Customer Experience

Work from home Full-time role

Remote Online Chat Support Representative – Entry‑Level Customer Service Role with $25‑$35/hr Flexible Schedule at arenaflex

Work from home Full-time role

reputed company Director, Digital Product - Pharmacy Services

Work from home Full-time role

Associate Field Service Engineer - Waterloo, Iowa

Work from home Full-time role