Développeur principal d'opérations d'apprentissage automatique - Optimisation d'inférence/Principal Machine Learning Operations Developer - Inference Optimization
Apply NowCompany: Cerence Inc.
Location: Montreal, QC H1A 0A1
Description:
A Moving Experience.
English version below
Description de poste
Avez-vous une passion pour repousser les limites de l'innovation ? tes-vous enthousiaste l'ide du potentiel de l'IA pour amliorer l'exprience humaine ? Alors rejoignez-nous dans cette aventure !
Qui est Cerence AI?
Cerence AI est le leader mondial de l'IA pour le transport, spcialis dans la cration de compagnons aliments par l'IA et la voix pour les voitures, les deux-roues et plus encore, permettant aux utilisateurs de se concentrer sur l'essentiel. Avec plus de 500 millions de voitures quipes de la technologie Cerence AI, nous collaborons avec des constructeurs automobiles de renom tels que Volkswagen, Mercedes, Audi, Toyota, et bien d'autres, des fournisseurs de mobilit et des entreprises technologiques pour offrir des expriences intuitives et intgres, crant des trajets plus srs, plus connects et plus agrables pour les conducteurs et les passagers.
Notre force motrice
Notre quipe, engage repousser les limites de l'innovation en IA, opre l'chelle mondiale avec un sige social Burlington, Massachusetts, USA, et 16 bureaux rpartis en Europe, en Asie, et en Amrique du Nord. Nous runissons des talents divers avec l'objectif commun de faire progresser la prochaine gnration d'expriences utilisateur pour le transport. Notre culture est axe sur le client, collaborative, stimulante et conviviale, offrant des opportunits constantes d'apprentissage et de dveloppement pour accompagner l'volution de votre carrire.
Vous aspirez avoir un impact significatif dans une industrie dynamique au sein d'une quipe internationale performante ? Nous recherchons un(e) Dveloppeur senior d'oprations d'apprentissage automatique , prt(e) faonner l'avenir de la mobilit nos cts !
Votre impact :
Qualifications requises :
Qualifications prfres :
Ce que nous offrons :
Lieu :
Ce poste est bas Montral, avec des opportunits pour des arrangements de travail hybrides. Les candidats distance bass aux tats-Unis ou au Canada ayant des profils pertinents sont invits postuler.
Rejoignez-nous :
Si vous tes passionn par l'IA/ML et dsireux de collaborer sur des projets transformateurs en optimisation d'infrence, nous voulons vous entendre. Postulez maintenant et devenez une partie du voyage de Cerence AI pour redfinir la mobilit connecte !
Ce que nous offrons
Nous offrons un ensemble avantageux de rmunration et de bnfices, en supplment du salaire de base, comprenant :
Toutes les compensations et avantages sont soumis aux termes et conditions des plans ou programmes sous-jacents, selon le cas, et peuvent tre modifis, rsilis ou remplacs tout moment.
Do you have a passion for pushing the boundaries of innovation? Are you excited about AI's potential to improve the human experience? Then come join the ride!
Who is Cerence AI?
Cerence AI is the global leader in AI for transportation, specialized in building AI and voice-powered companions for cars, two-wheelers, and more that enable people to focus on what matters most. With over 500 million cars shipped with Cerence AI technology, we partner with leading automakers (such as Volkswagen, Mercedes, Audi, Toyota and many more), mobility providers, and technology companies to power intuitive, integrated experiences that create safer, more connected, and more enjoyable journeys for drivers and passengers alike.
Our Driving Force
Our team is dedicated to pushing the boundaries of AI innovation, working around the globe with headquarters in Burlington, Massachusetts, USA and 16 other offices across Europe, Asia, and North America. We bring together diverse backgrounds, and varied skill sets with the shared goal of advancing the next generation of transportation user experiences. Our culture is customer-centric, collaborative, fast-paced, and fun, with continuous opportunities for learning and development to support your career growth.
Interested in having a significant impact in a dynamic industry with a high-performing global team? We're looking for an exceptional Senior Machine Learning Operations Developer who is ready to drive the future of mobility with us!
Your Impact:
Required Qualifications:
Preferred Qualifications:
What we offer
We offer a generous compensation and benefits package (in addition to the base salary), including:
All compensation and benefits are subject to the terms and conditions of the underlying plans or programs, as applicable, and may be amended, terminated, or replaced from time to time
Cerence Inc. (Nasdaq: CRNC and www.cerence.com) is the global industry leader in creating unique, moving experiences for the automotive world. Spun out from Nuance in October 2019, Cerence is a new, independent company that has quickly gained traction as a leader in the automotive voice assistant space, working with all of the world's leading automakers - from Ford and Fiat Chrysler to Daimler, Audi and BMW to Geely and SAIC - to transform how a car feels, responds and learns. Its track record is built on more than 20 years of industry experience and leadership and more than 500 million cars on the road today across more than 70 languages.
As Cerence looks to the future and continues an ambitious growth agenda, we need someone to join the team and help build the future of voice and AI in cars. This is an exciting opportunity to join Cerence's passionate, dedicated, global team and be a part of meaningful innovation in a rapidly growing industry.
EQUAL OPPORTUNITY EMPLOYER
Cerence is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all federal, state and local laws that prohibit employment discrimination on the basis of age, race, color, gender, gender identity, gender expression, sex, sex stereotyping, pregnancy, national origin, ancestry, religion, physical or mental disability, medical condition, marital status, citizenship status, sexual orientation, protected military or veteran status, genetic information and other protected classifications. Cerence Equal Employment Opportunity Policy Statement.
All prospective and current Employees need to remain vigilant when it comes to executing security policies in the workplace. This includes:
- Following workplace security protocols and training programs to familiarize with the ways to maintain a safe workplace.
- Following security procedures to report any suspicious activity.
- Having respect for corporate security procedures to allow those procedures to be effective.
- Adhering to company's compliance and regulations.
- Encouraging to follow a zero tolerance for workplace violence.
- Basic knowledge of information security and data privacy requirements (e.g., how to protect data & how to be handling this data).
- Demonstrative knowledge of information security through internal training programs.
English version below
Description de poste
Avez-vous une passion pour repousser les limites de l'innovation ? tes-vous enthousiaste l'ide du potentiel de l'IA pour amliorer l'exprience humaine ? Alors rejoignez-nous dans cette aventure !
Qui est Cerence AI?
Cerence AI est le leader mondial de l'IA pour le transport, spcialis dans la cration de compagnons aliments par l'IA et la voix pour les voitures, les deux-roues et plus encore, permettant aux utilisateurs de se concentrer sur l'essentiel. Avec plus de 500 millions de voitures quipes de la technologie Cerence AI, nous collaborons avec des constructeurs automobiles de renom tels que Volkswagen, Mercedes, Audi, Toyota, et bien d'autres, des fournisseurs de mobilit et des entreprises technologiques pour offrir des expriences intuitives et intgres, crant des trajets plus srs, plus connects et plus agrables pour les conducteurs et les passagers.
Notre force motrice
Notre quipe, engage repousser les limites de l'innovation en IA, opre l'chelle mondiale avec un sige social Burlington, Massachusetts, USA, et 16 bureaux rpartis en Europe, en Asie, et en Amrique du Nord. Nous runissons des talents divers avec l'objectif commun de faire progresser la prochaine gnration d'expriences utilisateur pour le transport. Notre culture est axe sur le client, collaborative, stimulante et conviviale, offrant des opportunits constantes d'apprentissage et de dveloppement pour accompagner l'volution de votre carrire.
Vous aspirez avoir un impact significatif dans une industrie dynamique au sein d'une quipe internationale performante ? Nous recherchons un(e) Dveloppeur senior d'oprations d'apprentissage automatique , prt(e) faonner l'avenir de la mobilit nos cts !
Votre impact :
- Concevoir, dvelopper et mettre en uvre des stratgies pour optimiser les pipelines d'infrence IA/ML en termes de performance, d'volutivit et de rentabilit.
- Collaborer troitement avec d'autres ingnieurs principaux et seniors de l'quipe, en favorisant une culture de partage des connaissances et de rsolution commune des problmes.
- Travailler avec des quipes transversales, notamment en MLOps, science des donnes et ingnierie logicielle, pour intgrer des solutions d'infrence optimises dans les environnements de production.
- Innover dans les domaines de l'acclration matrielle, de la quantification, de la compression des modles et des techniques d'infrence distribue.
- Se tenir au courant des cadres d'hbergement LLM et de leur configuration au niveau des machines et des clusters (par ex. vLLM, TensorRT, KubeFlow).
- Optimiser les systmes l'aide de techniques telles que le regroupement, la mise en cache et le dcodage spculatif.
- Effectuer le rglage des performances, des benchmarks et des profils pour les systmes d'infrence, avec expertise en gestion de mmoire, threading, concurrence et optimisation GPU.
- Grer les dpts de modles, la livraison des artefacts et les infrastructures associes.
- Dvelopper et maintenir des mcanismes de journalisation pour les diagnostics et la recherche.
Qualifications requises :
- Plus de 10 ans d'exprience en ingnierie logicielle, avec un accent sur l'IA/ML.
- Expertise approfondie dans les techniques d'optimisation des modles IA, y compris la quantification, l'lagage, la distillation des connaissances et la conception de modles adapts au matriel.
- Matrise des langages de programmation tels que Python, C++ ou Rust.
- Exprience avec des cadres IA/ML tels que TensorFlow, PyTorch et ONNX.
- Exprience pratique avec l'acclration GPU/TPU et le dploiement dans des environnements cloud et edge.
- Forte mentalit DevOps avec exprience en Kubernetes, conteneurs, dploiements, tableaux de bord, haute disponibilit, mise l'chelle automatique, mtriques et journaux.
- Solides comptences en rsolution de problmes et capacit prendre des dcisions bases sur des donnes.
- Excellentes comptences en communication et capacit expliquer des concepts techniques complexes un public diversifi.
Qualifications prfres :
- Exprience avec Kubernetes, Docker et des pipelines CI/CD pour les charges de travail IA/ML.
- Connaissance des pratiques et outils MLOps, y compris le versioning et la surveillance des modles.
- Familiarit avec l'optimisation des moteurs d'infrence comme vLLM et les techniques telles que les adaptateurs LoRA.
- Comprhension de l'architecture et de l'optimisation des LLM.
- Contributions des projets open-source IA/ML.
- Connaissance des applications dans les industries automobile ou des transports.
- Master ou doctorat en informatique, apprentissage automatique ou domaine connexe.
Ce que nous offrons :
- L'opportunit de rejoindre une toute nouvelle quipe axe sur les avances IA/ML de pointe.
- Un environnement de travail collaboratif et inclusif avec un fort accent sur l'innovation.
- Un salaire comptitif et un ensemble complet d'avantages sociaux.
- Des opportunits de dveloppement professionnel et de progression de carrire.
- La possibilit de travailler avec des technologies de pointe et de gnrer un impact rel.
Lieu :
Ce poste est bas Montral, avec des opportunits pour des arrangements de travail hybrides. Les candidats distance bass aux tats-Unis ou au Canada ayant des profils pertinents sont invits postuler.
Rejoignez-nous :
Si vous tes passionn par l'IA/ML et dsireux de collaborer sur des projets transformateurs en optimisation d'infrence, nous voulons vous entendre. Postulez maintenant et devenez une partie du voyage de Cerence AI pour redfinir la mobilit connecte !
Ce que nous offrons
Nous offrons un ensemble avantageux de rmunration et de bnfices, en supplment du salaire de base, comprenant :
- Opportunit de bonus annuel
- Couverture d'assurance (mdicale, dentaire, vision, vie et invalidit)
- Congs pays
- Jours fris pays
- Contribution de l'entreprise au REER (Rgime enregistr d'pargne-retraite)
- Attribution d'actions pour certains postes et niveaux
- Tltravail et/ou travail hybride disponible selon le poste
Toutes les compensations et avantages sont soumis aux termes et conditions des plans ou programmes sous-jacents, selon le cas, et peuvent tre modifis, rsilis ou remplacs tout moment.
Do you have a passion for pushing the boundaries of innovation? Are you excited about AI's potential to improve the human experience? Then come join the ride!
Who is Cerence AI?
Cerence AI is the global leader in AI for transportation, specialized in building AI and voice-powered companions for cars, two-wheelers, and more that enable people to focus on what matters most. With over 500 million cars shipped with Cerence AI technology, we partner with leading automakers (such as Volkswagen, Mercedes, Audi, Toyota and many more), mobility providers, and technology companies to power intuitive, integrated experiences that create safer, more connected, and more enjoyable journeys for drivers and passengers alike.
Our Driving Force
Our team is dedicated to pushing the boundaries of AI innovation, working around the globe with headquarters in Burlington, Massachusetts, USA and 16 other offices across Europe, Asia, and North America. We bring together diverse backgrounds, and varied skill sets with the shared goal of advancing the next generation of transportation user experiences. Our culture is customer-centric, collaborative, fast-paced, and fun, with continuous opportunities for learning and development to support your career growth.
Interested in having a significant impact in a dynamic industry with a high-performing global team? We're looking for an exceptional Senior Machine Learning Operations Developer who is ready to drive the future of mobility with us!
Your Impact:
- Design, develop, and implement strategies to optimize AI/ML inference pipelines for performance, scalability, and cost efficiency.
- Collaborate closely with other Principal and Senior Engineers on the team, fostering a culture of knowledge-sharing and joint problem-solving.
- Work with cross-functional teams, including MLOps, data science, and software engineering, to integrate optimized inference solutions into production environments.
- Drive innovation in hardware acceleration, quantization, model compression, and distributed inference techniques.
- Stay up-to-date with LLM hosting frameworks and their configuration on both machine and cluster levels (e.g., vLLM, TensorRT, KubeFlow).
- Optimize systems using techniques such as batching, caching, and speculative decoding.
- Conduct performance tuning, benchmarking, and profiling for inference systems, with expertise in memory management, threading, concurrency, and GPU optimization.
- Manage model repositories, artifact delivery, and related infrastructure.
- Develop and maintain logging mechanisms for diagnostics and research purposes.
Required Qualifications:
- 10+ years of experience in software engineering, with a focus on AI/ML.
- Deep expertise in AI model optimization techniques, including quantization, pruning, knowledge distillation, and hardware-aware model design.
- Proficiency in programming languages such as Python, C++, or Rust.
- Experience with AI/ML frameworks such as TensorFlow, PyTorch, and ONNX.
- Hands-on experience with GPU/TPU acceleration and deployment in cloud and edge environments.
- Strong DevOps mindset with experience in Kubernetes, containers, deployments, dashboards, high availability, autoscaling, metrics, and logs.
- Strong problem-solving skills and the ability to make data-driven decisions.
- Excellent communication skills and the ability to articulate complex technical concepts to a diverse audience.
Preferred Qualifications:
- Experience with Kubernetes, Docker, and CI/CD pipelines for AI/ML workloads.
- Familiarity with MLOps practices and tools, including model versioning and monitoring.
- Familiarity with performance tuning of inference engines like vLLM and techniques such as LoRA adapters.
- Understanding of LLM architecture and optimization.
- Contributions to open-source AI/ML projects.
- Familiarity with automotive or transportation industry applications.
- Master's or Ph.D. in Computer Science, Machine Learning, or a related field.
What we offer
We offer a generous compensation and benefits package (in addition to the base salary), including:
- Annual bonus opportunity
- Insurance coverage (medical, dental, vision, life, and disability)
- Paid time off
- Paid holidays
- Company contribution to the RRSP (Registered Retirement Savings Plan)
- Equity awards for certain positions and levels
- Remote and/or hybrid work available depending on the position
All compensation and benefits are subject to the terms and conditions of the underlying plans or programs, as applicable, and may be amended, terminated, or replaced from time to time
Cerence Inc. (Nasdaq: CRNC and www.cerence.com) is the global industry leader in creating unique, moving experiences for the automotive world. Spun out from Nuance in October 2019, Cerence is a new, independent company that has quickly gained traction as a leader in the automotive voice assistant space, working with all of the world's leading automakers - from Ford and Fiat Chrysler to Daimler, Audi and BMW to Geely and SAIC - to transform how a car feels, responds and learns. Its track record is built on more than 20 years of industry experience and leadership and more than 500 million cars on the road today across more than 70 languages.
As Cerence looks to the future and continues an ambitious growth agenda, we need someone to join the team and help build the future of voice and AI in cars. This is an exciting opportunity to join Cerence's passionate, dedicated, global team and be a part of meaningful innovation in a rapidly growing industry.
EQUAL OPPORTUNITY EMPLOYER
Cerence is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all federal, state and local laws that prohibit employment discrimination on the basis of age, race, color, gender, gender identity, gender expression, sex, sex stereotyping, pregnancy, national origin, ancestry, religion, physical or mental disability, medical condition, marital status, citizenship status, sexual orientation, protected military or veteran status, genetic information and other protected classifications. Cerence Equal Employment Opportunity Policy Statement.
All prospective and current Employees need to remain vigilant when it comes to executing security policies in the workplace. This includes:
- Following workplace security protocols and training programs to familiarize with the ways to maintain a safe workplace.
- Following security procedures to report any suspicious activity.
- Having respect for corporate security procedures to allow those procedures to be effective.
- Adhering to company's compliance and regulations.
- Encouraging to follow a zero tolerance for workplace violence.
- Basic knowledge of information security and data privacy requirements (e.g., how to protect data & how to be handling this data).
- Demonstrative knowledge of information security through internal training programs.