ServiceNow is seeking a Senior Machine Learning Engineer AI Inferencing to join their PLATO team. The ideal candidate will have experience in leveraging AI to transform user experience and workflow efficiency, and will be responsible for building and optimizing a high-performance inferencing platform.
Requirements
- Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving.
- Low Latency Optimization: Experience in optimizing models for low latency inference, important for real-time applications.
- High Throughput Optimization: Knowledge of maximizing inference throughput.
- Real-time Systems: Understanding the constraints of real-time systems on model inference.
- Model Quantization and Compression: Practical experience in reducing model size and computational cost.
- Proficient in prompt engineering and developing LLM based features
- Experience in using AI productivity tools such as Cursor, Windsurf, etc
- Minimum 5 years of experience working in Software Development role.
- Proficiency in Python and Golang, with a strong grasp of software engineering principles.
- Hands-on experience with prompt engineering: ability to craft, test, and optimize prompts for task accuracy and efficiency.
- Demonstrated ability to thrive in fast-paced, dynamic environments.
- Knowledge of unit testing, profiling, and code tuning
Benefits
- Base pay of $158,500 - $269,500, plus equity (when applicable), variable/incentive compensation and benefits.
- Health plans, including flexible spending accounts, a 401(k) Plan with company match, ESPP, matching donations, a flexible time away plan and family leave programs.