Ai Engineer
Company:

Diverse Lynx


Details of the offer

AI+HPC infra requirement

looking for someone with Architectural and design experience also along with experience in handling 1000+ nodes.

Technical/Functional Skills -
Proficiency in RoCEv2, K8s, KVM, Ubuntu, Python, Shell, Go, Rust, GPU drivers, and Cluster interconnect with 200G/400G networking.
Managing GPU clusters optimizing GPU-based services/tools/software

Roles & Responsibilities -

Develop, implement, and maintain GPU-based clusters of 10 to 1000 nodes, ensuring optimal performance and availability.
Administer Client/AI platforms - Distributed Client services, LLMs, Vector-DB and AI inferencing, by managing deployments, resource allocation, monitoring, and security.
Collaborate with cross-functional teams to address AI infrastructure requirements, support AI-related projects, and provide technical expertise.
Monitor and evaluate the performance of AI systems and clusters, ensuring that they adhere to industry best practices and meet company standards.
Compile reports, document procedures, and publish recommendations for improving AI infrastructure and solutions.
Use AI/Client to continuously improve internal processes and tools that are used in end-to-end delivery of your services in this team

Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.
#J-18808-Ljbffr


Source: Grabsjobs_Co

Job Function:

Requirements

Ai Engineer
Company:

Diverse Lynx


Senior iOS Software Engineer - Growth

The Growth team’s mission is to connect users with Reddit’s core value of community while providing relevant content and experiences to them. As a core membe...


From Reddit - California

Published a month ago

Principal/Senior Engineer - Defi -Web3 Application(Marketplace & Discover)

Who We Are At OKX, we believe our future is reshaped with technology. Founded in 2017, OKX is one of the world’s leading cryptocurrency spot and derivativ...


From OKX - California

Published a month ago

Engineering Manager

We are a globally distributed team with folks from the United States, Canada, Chile, Mongolia, Turkey, India, Madagascar, Ukraine, Indonesia, Brazil, Colombi...


From Clipboard Health - California

Published a month ago

Lead Online Engineer

Who We AreFounded in 2005, 2K Games is a global video game company, publishing titles developed by some of the most influential game development studios in t...


From 2K - California

Published a month ago

Built at: 2024-06-07T20:42:11.570Z