Unlock the Future of Intelligent Productivity with AI-Powered Innovation Job Description: We are seeking a talented Data Scientist to join our team and help shape the future of intelligent productivity tools. As a key member of our team, you will work closely with software engineers, product managers, and researchers to design, implement, and evaluate models and algorithms that continuously adapt and improve based on user interactions and feedback. You will have the opportunity to leverage advanced machine learning and information retrieval techniques to predict and optimize memory content selection, representation, and lifecycle management. Your work will directly impact Copilot, enabling personalized, context-aware experiences that empower millions of users across Microsoft 365. About the Team: The Substrate Core Substrate team powers the infrastructure that underpins Microsoft 365's most critical services and Copilot. We are a high-impact, forward-looking team focused on building intelligent, scalable, and cost-efficient platforms that enable Microsoft to deliver world-class productivity experiences to billions of users. Our mission is to drive operational excellence and innovation across the M365 fleet by leveraging AI, automation, and deep platform integration. You will be part of a culture of continuous learning, experimentation, and innovation, where you will have the opportunity to grow your skills and career rapidly. Key Responsibilities: - Develop and evaluate ML models using prepared datasets, customer feedback, and novel training/fine-tuning algorithms for language models. - Write production-quality code, apply debugging best practices, and stay current with industry trends. - Drive customer-centric solutions by aligning with business goals and managing stakeholder expectations. - Collaborate cross-functionally to define success metrics and improve AI quality at scale. - Lead research projects that yield new algorithms, tools, or insights solving open problems. - Analyze evaluation outputs to identify gaps in coverage, quality, and usability. Requirements: To succeed in this role, you will need: - Bachelor's degree in Statistics, Econometrics, Computer Science, Electrical/Computer Engineering, or related field. - 4+ years of experience in predictive analytics, statistics, or research. - Experience with synthetic data generation and data management for evaluation/training. - At least one year of experience publishing patents or peer-reviewed papers. - Deep motivation for user-centric AI and interest in human cognition, memory, and AI. Preferred Qualifications: We are also looking for candidates with: - Master's degree in Statistics, Econometrics, Computer Science, Electrical/Computer Engineering, or related field. - Experience with large-scale embedding models and transformer architectures. - Familiarity with reinforcement learning and distributed computing platforms (e.g., Heron, AML, Euclid). - Proficient analytical skills and experience with telemetry and performance metrics. - DevOps experience and cloud services knowledge (Azure preferred). Interpersonal Skills: We value collaboration, creativity, and strong communication skills. If you are a self-motivated and curious individual who thrives in a fast-paced environment, we encourage you to apply. We are an equal opportunity employer and welcome applications from diverse backgrounds. Please note that only qualified candidates will be contacted for an interview.