Applied Scientist - LLM, Alexa
Cambridge, UK
Description
Alexa is looking for an Applied Scientist with a strong background in Natural Language Processing (NLP) and Large Language Models (LLMs) to help build state-of-the-art conversational systems.
In this role, you will collaborate with a large team of scientists training the Large Language Models that power the Alexa stack, as well as software engineers serving them in production systems. You will own solutions end-to-end: from ideation and research through to production deployment, enabling conversational assistants to support external tools, leverage diverse sources of information, and deliver novel reasoning capabilities to millions of Alexa customers.
Key job responsibilities
As an Applied Scientist, you will develop innovative solutions to complex problems to extend the functionalities of conversational assistants.
You will use your technical expertise to research and implement novel algorithms and modelling solutions in collaboration with other scientists and engineers.
You will analyze customer behaviors and define metrics to enable the identification of actionable insights and measure improvements in customer experience.
You will communicate results and insights to both technical and non-technical audiences through written reports, presentations and external publications.
You would be able to bi-modal on science and engineering: someone who combines strong scientific foundations with the execution skills to ship high-quality solutions.
A day in the life
As an Applied Scientist on the Alexa Science team, you'll drive innovation in evaluating new product experiences while discovering novel approaches to enhance model capabilities and enrich customer interactions. You'll collaborate with cross-functional teams of engineers and scientists to identify root causes of model and system integration issues, continuously improving the end-to-end customer experience.
You'll partner closely with scientists developing and fine-tuning large language models, engineers building low-latency inference infrastructure, and product teams defining customer experience metrics.
About the team
We are a team of applied scientists and engineers building the intelligence layer that powers Alexa+. Our work sits at the intersection of large language models, decision-making under uncertainty, and production ML systems. What we build directly shapes the customer experience: determining which models serve their requests, optimizing response latency, and creating natural, seamless interactions.
We're a collaborative team that values rigorous experimentation, clear communication, and delivering solutions that perform at scale in real-world environments.


