Senior Technical Support Engineer

Company: NVIDIA
Company: NVIDIA
Location: China, Beijing
Commitment: Full time
Posted on: 2023-09-08 05:57
We are seeking a highly skilled Technical Account Manager (TAM) with extensive knowledge and experience in High-Performance Computing (HPC), InfiniBand, NCCL, and MPI technologies. As a TAM, you will be responsible for nurturing customer relationships and providing comprehensive support throughout the entire customer journey. Your expertise in Network Architecture, HPC, NCCL, and MPI will enable you to effectively address the unique challenges faced by customers in these specialized domains.What you’ll be doing:Act as the primary point of contact and advocate for key customer accounts in China, focusing on those utilizing HPC and InfiniBand technologies. Provide proactive support, conduct technical reviews, and offer guidance to prevent issues, optimize performance, and ensure customer satisfaction. Collaborate with cross-functional teams, including engineering, product management, and support, to address customer needs and resolve complex technical issues related to HPC and InfiniBand. Join group discussions and contribute technical expertise during onsite visits to customer locations, actively participating in workshops, planning sessions, and troubleshooting activities. Assist customers in optimizing their HPC applications, addressing performance bottlenecks, and ensuring efficient utilization of InfiniBand networking technology. Stay updated with industry trends, emerging technologies, and best practices in HPC and InfiniBand, providing insights and recommendations to customers for future planning and improvements. Take ownership of critical issues, coordinate resources, drive technical direction, and ensure prompt resolution, while maintaining open communication and managing customer expectations. A deep understanding of X86 Server Architecture, and Linux Systems, and experience in GPU Server Support would be a plus.What we need to see:Strong communication and presentation skills to effectively convey technical information to customers and stakeholders.Strong organizational skills with the ability to prioritize and multitask efficiently with limited supervision.Excellent interpersonal skills with the ability to maintain and manage the overall resolution for any escalated customer case under all circumstances.Extensive practical experience in Linux System Administration.Demonstrated ability to troubleshoot networking protocols using tools such as tcpdump and Wireshark or similar packet generation and analysis tools.Candidates should have a minimum of a four-year degree from an accredited university or college in Computer Science, Electrical Engineering, or Computer Engineering.Industry-recognized Linux/Cisco certifications are highly desired.At least 3+ years of relevant work experience.Ways to stand out from the crowd:Distributed parallel filesystems (Lustre, GPFS, parallel NFS).Batch scheduling systems (slurm, torque, SGE, AWS batch, AWS parallel cluster).Good understanding of server virtualization technologies with a specialty in OpenStack. Experience in High-Performance Computing (HPC), InfiniBand, NCCL, and MPI technologies.Familiarity with cloud computing platforms (e.g., AWS, Azure, Google Cloud) and their integration with HPC workloads is a plus. If you possess the required expertise in HPC and InfiniBand, along with hands-on experience supporting and optimizing production systems, we invite you to submit your resume along with any relevant certifications and a cover letter detailing your experience and qualifications.Our company fosters a culture of innovation, collaboration, and personal growth, providing ample opportunities for career development within a dynamic and exciting industry.
View Original Job Posting