Job Description
Key Skills: Linux, C Programming, Device driver, TCP/IP, Networking, TCPIP, C++, EthernetRoles and Responsibilities:Design, develop, and maintain system-level software enabling GPU-to-GPU communication.Collaborate with cross-functional hardware and software teams to deliver scalable networking solutions.Contribute to architecture and optimization of GPU fabric networking for large compute clusters.Align software implementation with deployment environments and customer requirements.Develop and maintain device drivers and kernel modules supporting networking hardware.Debug and profile multi-threaded, high-performance software interacting with hardware.Implement robust and efficient networking protocols on Linux systems.Participate in code reviews, unit testing, and integration testing to ensure software quality.Document software designs, APIs, and procedures for internal and external use.Work effectively with a distributed team, communicating design decisions and progress clearly.Skills Required:Must-Have:Linux kernel and system-level programmingC programming (including memory management and concurrency)Device driver development and debuggingTCP/IP stack and networking protocolsUnderstanding of high-performance interconnects (InfiniBand, Ethernet)Networking fundamentals and distributed systems knowledgeNice-to-Have:C++ programming for system softwareEthernet hardware and protocol expertiseMulti-threaded and parallel programmingGPU fabric architecture experienceKernel module development and debugging tools (GDB, perf, strace)Education: A degree or equivalent experience in Computer Science, Electrical Engineering, or a related field (B.S., M.S., or Ph.D.)