Fujitsu’s R&D vision is to create cutting-edge technologies that support society and prioritize the flow of data. Our five key technologies are essential elements for collecting data from all parts of society, transporting it over ultra-high-speed and secure networks, analyzing it with trusted AI, converting it into value, and returning it to society. Central to realizing our strategy and vision are the people behind our R&D initiatives. We strive to foster top-notch talent in the innovation process by nurturing skilled individuals equipped to advance this cause with originality and dedication.
Role Purpose
We are seeking a result-oriented Solution Architect with expertise in deep learning and machine learning, to design & develop solutions powered by Fujitsu’s new processor, which help solve real-world challenges facing society and businesses across different industries. The work involves designing a software development kit which will accelerate the development of AI, deep learning (DL), and large language model (LLM)-based applications on Fujitsu’s next gen processors (ARM based CPU). The ideal candidate will have knowledge, experience, and passion for both AI models/applications and hardware/software system & should be skilled in back-end development, high-performance computing, AI framework engineering on modern CPU-based architectures.
Responsibilities
-
Design and implement modules for the SDK including, Preprocessing and postprocessing pipelines, Model orchestration and inference engines and Integration with underlying CPU-specific acceleration libraries
-
Develop APIs and utilities that enable fast prototyping and production deployment of DL/LLM workloads
-
Benchmark and optimize SDK performance across target Hardwares
-
Enable and optimize OSS/ISV applications for Fujitsu’s new processor, starting with LLM-related applications.
-
Solid understanding of hardware & software systems
-
Maintain clean documentation and provide best practices for downstream users
-
Troubleshooting, debugging, and fixing bugs and upgrading software/applications.
-
Writing technical documentation
-
Learning continually, sharing knowledge, and fostering exchange of skills
-
Working using agile methods (planning meetings, review meetings, standup meetings, development, etc.)
-
Working on multiple projects at once while keeping focused on project timeline
Key Performance Indicators:
-
Number of software applications enabled and optimized for Fujitsu’s new processor.
-
Number of software applications with improved performance
-
Development of new software technologies
Experience:
You will be able to demonstrate that you have:
-
A degree in Computer Science, Engineering, or a related field (Master’s/MTech or PhD preferred)
-
Minimum 5 years’ experience in SW development, Deep Learning & AI based application development, deployment and optimization.
-
Proficiency with fundamental back-end server-side languages such as Python & C/C++.
-
Knowledge in neural compression & quantization techniques such as pruning & knowledge distillation
-
Knowledge of LLM Fine tuning & training techniques
-
Knowledge and experience related to AI Agents, RAG, NLP, Computer Vision.
-
Familiarity with Inference runtimes ONNX Runtime, OpenVINO.
-
Experience with Deep learning frameworks such as PyTorch, TensorFlow, JAX, Llama.cp & vLLM.
-
Experience in software development in agile approach
-
Strong debugging skills using profiling tools (PyTorch Profiler, TensorFlow Profiler, perf).
-
Excellent writing, verbal communication, and teamwork skills
Preferred Experience:
You will be able to demonstrate that you have:
-
Prior experience working on multiple ML/DL SDKs
-
Exposure to model optimization techniques for CPU
-
Experience in building profiling tools
-
Knowledge of hardware-aware optimizations (oneDNN, XLA, ROCm, MLIR, SparseML).
-
Contributions to open-source community
-
Publications in International forums / conferences / journals
-
Knowledge and experience in Cloud service (Azure/AWS) features such as Functions, VM, Container, DevOps (CI/CD)
-
Great skills in evaluating performance and security of software applications and delivering solutions which are efficient and performant.