Research Intern - Training-Time Provenance (Data Dignity)
Microsoft
Hybrid 🏡🏢 21 November
Data Science & AI/ML
California, United States 🇺🇸 Washington, United States 🇺🇸
Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.
Training-time provenance is a research effort on estimating the influence of specific training data on outputs of large language models (LLMs). Current neural network architectures are opaque in terms of providing sources for their generations, and there are at least two good reasons to change this:
- “X-ray” into intent, so that we can detect bad human actors or dangerous AI activity by identifying the most influential source documents related to a given model output. For instance, sneaky prompts might invoke articles about bomb making that could evade guardrails otherwise. This will be a deeper method of countering this type of danger than others currently in use.
- “Data dignity”, meaning incentives, recognition, and potentially pay for people who contribute certain valuable data to unforeseen kinds of models we will want in the future, assuming the future will surprise us fundamentally. The goal is to foster new classes of creative professionals where possible, instead of relying solely on ideas like Universal Basic Income in the event of a future with very high-functioning large models.
We are attempting to demonstrate that LLMs can be trained in such a way that influence of specific training data on generated outputs can be efficiently and usefully estimated. You can read more about “Data dignity” in the article: There is no A.I. (The New Yorker).
Qualifications
Required Qualifications
- Currently enrolled in a PhD program in Computer Science or a related STEM field. Exceptional candidates enrolled in a master’s program might also be considered.
- At have at least 2 years of research experience, including peer-reviewed publications, researching a topic closely related to the above description, such as natural language processing, deep learning, generative models, approximation methods, etc.
Other Requirements
- Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
- In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.
Preferred Qualifications
- Demonstrated ability to develop original research agendas.
- Ability to collaborate effectively with other researchers and product development teams.
- Experience in training large AI models.
- Experience in approximation methods for deep learning systems.
- Proficient interpersonal skills, cross-group, and cross-culture collaboration.
- Ability to think unconventionally to derive creative and innovative solutions.
Applied Sciences IC2 : The base pay range for this internship is USD $5,460 -$10,680 per month. There is a different range applicable to specific work locations, with the San Francisco Bay area and New York City Metropolitan area, and the base pay range for this role in those locations is USD $7,040 -$11,640 per month.Applied Sciences IC3 : The base pay range for this internship is USD $6,550 -$12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13, 920 per month.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-intern-pay
Microsoft accepts applications and processes offers for these roles on an ongoing basis.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Responsibilities
Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.
For this Research Internship (summer 2025), we are seeking PhD students with a passion for fundamental Deep Learning research, particularly those with experience in training LLMs and other large AI models. The Research Intern's responsibilities will include (1) training small language models with novel schemes preserving provenance of data, (2) experimenting with these models to test their performance and reliability.
Similar jobs
Research Intern - Machine Learning
Hybrid 🏡🏢 13 November
Data Science & AI/ML
Washington, United States 🇺🇸
Image and Data Processing Libraries Intern
Remote 🌎🌍🌏 17 October
Data Science & AI/ML
Warsaw, Poland 🇵🇱 Remote, Poland 🇵🇱
Arch Solution Intern, HPC and AI Project
On-site 🏢 03 September
Data Science & AI/ML
Beijing, China 🇨🇳 Shanghai, China 🇨🇳
Research Intern, Deep Learning and Artificial Intelligence - 2025
On-site 🏢 14 October
Data Science & AI/ML
Taipei, Taiwan 🇹🇼 Hsinchu, Taiwan 🇹🇼
Research Intern - AI Frontiers - Foundation Model Evaluation and Understanding
Hybrid 🏡🏢 07 November
Data Science & AI/ML
Washington, United States 🇺🇸
2025 Software Development Engineer - Machine Learning (m/w/d)
On-site 🏢 01 November
Data Science & AI/ML
Berlin, Germany 🇩🇪
Research Intern - Data Systems
Hybrid 🏡🏢 28 October
Data Science & AI/ML
Washington, United States 🇺🇸
Research Intern - Transforming Research with AI
Hybrid 🏡🏢 04 November
Data Science & AI/ML
Washington, United States 🇺🇸
Business Analytics Intern - 2025 Internship, Shenzhen, China
On-site 🏢 21 October
Data Science & AI/ML
Shenzhen, China 🇨🇳
2025 Applied Scientist Internship, Amazon University Talent Acquisition
On-site 🏢 01 November
Data Science & AI/ML
Amsterdam, Netherlands 🇳🇱
Research Intern - Conversational AI
Hybrid 🏡🏢 05 November
Data Science & AI/ML
Washington, United States 🇺🇸 New York, United States 🇺🇸 Maryland, United States 🇺🇸 California, United States 🇺🇸
NVIDIA 2025 Internships: Deep Learning Computer Architecture
Remote 🌎🌍🌏 28 August
Data Science & AI/ML
California, United States 🇺🇸 Remote, United States 🇺🇸
PhD Research Intern, AI for Climate and Weather Simulation - Summer 2025
On-site 🏢 25 November
Data Science & AI/ML
California, United States 🇺🇸 Washington, United States 🇺🇸
Research Intern - Computational Social Science
Hybrid 🏡🏢 08 November
Data Science & AI/ML
New York, United States 🇺🇸
Machine Learning Software Engineer L4/L5, Algorithms
Remote 🌎🌍🌏 25 October
Data Science & AI/ML
USA - Remote
DFT Intern
On-site 🏢 07 October
Data Science & AI/ML
Shanghai, China 🇨🇳
Solutions Architect Intern, Canadian AI Partners – Summer 2025
Remote 🌎🌍🌏 25 November
Data Science & AI/ML
Remote, Canada 🇨🇦 Toronto, Canada 🇨🇦
Research Scientist, Human Behavioral Modeling and Machine Learning/AI - New College Grad 2025
Remote 🌎🌍🌏 26 November
Data Science & AI/ML
California, United States 🇺🇸 Texas, United States 🇺🇸
Research Intern - Efficient AI
Hybrid 🏡🏢 01 November
Data Science & AI/ML
Washington, United States 🇺🇸
Research Scientist, Computer Vision Perception and Rendering - New College Grad 2025
On-site 🏢 29 October
Data Science & AI/ML
California, United States 🇺🇸
2025 Business Intelligence Engineer Internship
On-site 🏢 09 October
Data Science & AI/ML
Istanbul, Türkiye 🇹🇷
2025 Applied Scientist Internship, Amazon University Talent Acquisition
On-site 🏢 18 October
Data Science & AI/ML
Cape Town, South Africa 🇿🇦
Data Science Intern - Global Industries
On-site 🏢 05 November
Data Science & AI/ML
United States 🇺🇸
Software Dev Engineer Intern - Machine Learning Chip Architect
On-site 🏢 03 October
Data Science & AI/ML
Cupertino, United States 🇺🇸
Deep Learning Engineering Intern - 2025
On-site 🏢 19 November
Data Science & AI/ML
Shenzhen, China 🇨🇳 Shanghai, China 🇨🇳
Data Analyst Intern - Portugal
On-site 🏢 13 November
Data Science & AI/ML
Oeiras, Portugal 🇵🇹
Research Intern - Machine Learning and Optimization - Redmond
Hybrid 🏡🏢 18 October
Data Science & AI/ML
Washington, United States 🇺🇸
2025 Working Student Internship - Business Intelligence Engineer
On-site 🏢 27 November
Data Science & AI/ML
Berlin, Germany 🇩🇪 Munich, Germany 🇩🇪
Product Machine Learning Research Leader
Remote 🌎🌍🌏 15 August
Data Science & AI/ML
USA - Remote
2025 Applied Science Internship - Reinforcement Learning & Optimization (Machine Learning) - United States, PhD Student Science Recruiting
On-site 🏢 02 October
Data Science & AI/ML
Seattle, United States 🇺🇸
Data Engineer: Intern Opportunities for University Students, Redmond
Hybrid 🏡🏢 21 November
Data Science & AI/ML
Washington, United States 🇺🇸
Data Science: PhD Internship Opportunities - Mountain View
Hybrid 🏡🏢 28 October
Data Science & AI/ML
California, United States 🇺🇸
Research Intern - AI-Driven System Design and Optimization
Hybrid 🏡🏢 31 October
Data Science & AI/ML
Washington, United States 🇺🇸
Data Science: PhD Internship Opportunities - Redmond
Hybrid 🏡🏢 23 August
Data Science & AI/ML
Washington, United States 🇺🇸
Research Intern - AI-Driven System Design and Optimization
Hybrid 🏡🏢 07 October
Data Science & AI/ML
British Columbia, Canada 🇨🇦
Research Intern - Algorithms Group: Differentially Private Synthetic Data
Hybrid 🏡🏢 28 October
Data Science & AI/ML
Washington, United States 🇺🇸
Cloud Support Associate - Artificial Intelligence and Machine Learning
On-site 🏢 12 November
Data Science & AI/ML
Taipei, Taiwan 🇹🇼
Data Engineer Summer Internship – 2025 (US)
On-site 🏢 27 August
Data Science & AI/ML
Seattle, United States 🇺🇸
Research Intern - Spatial AI
Hybrid 🏡🏢 25 October
Data Science & AI/ML
Washington, United States 🇺🇸
Data Scientist Co-Op – Construction & Engineering Global Industry Unit
On-site 🏢 24 October
Data Science & AI/ML
United States 🇺🇸
AI Algorithms Software Engineer (RDSS Intern)
On-site 🏢 23 October
Data Science & AI/ML
Hsinchu, Taiwan 🇹🇼 Taipei, Taiwan 🇹🇼
PhD Research Intern, Generative AI - 2025
Remote 🌎🌍🌏 21 October
Data Science & AI/ML
California, United States 🇺🇸 Remote, United States 🇺🇸
Research Scientist, Computer Vision and AI - New College Grad 2025
On-site 🏢 01 November
Data Science & AI/ML
California, United States 🇺🇸
Artificial Intelligence Research Intern - Deep Learning
On-site 🏢 11 June
Data Science & AI/ML
Taipei, Taiwan 🇹🇼 Hsinchu, Taiwan 🇹🇼
2025 Applied Science Intern (Machine Learning, Recommender Systems), Amazon International Machine Learning
On-site 🏢 03 October
Data Science & AI/ML
Adelaide, Australia 🇦🇺 Brisbane, Australia 🇦🇺 Sydney, Australia 🇦🇺 Melbourne, Australia 🇦🇺
Data Science Internship Opportunities
Hybrid 🏡🏢 03 November
Data Science & AI/ML
Ireland 🇮🇪
Cambridge Machine Intelligence Intern
Hybrid 🏡🏢 15 November
Data Science & AI/ML
Cambridgeshire, United Kingdom 🇬🇧
Research Intern - Bioinformatics
Hybrid 🏡🏢 19 November
Data Science & AI/ML
Washington, United States 🇺🇸 Massachusetts, United States 🇺🇸
Software Development Engineer Intern 2025, AI/ML
On-site 🏢 25 September
Data Science & AI/ML
Austin, United States 🇺🇸 Sunnyvale, United States 🇺🇸 Seattle, United States 🇺🇸 Redmond, United States 🇺🇸 Bellevue, United States 🇺🇸 Irvine, United States 🇺🇸 Cambridge, United States 🇺🇸
Data Analyst Intern/Co-Op (Undergrad | Winter, 2025 | Onsite/Hybrid)
Remote 🌎🌍🌏 06 November
Data Science & AI/ML
Ontario, Canada 🇨🇦
Business Intelligence Engineer Intern , Amazon University Talent Acquisition
On-site 🏢 15 November
Data Science & AI/ML
Sao Paulo, Brazil 🇧🇷
Dense Reconstruction Intern - 2025
On-site 🏢 28 October
Data Science & AI/ML
Shanghai, China 🇨🇳
Deep Learning Intern - 2025
On-site 🏢 23 October
Data Science & AI/ML
Shanghai, China 🇨🇳
Research Intern - Applied AI for Science
Hybrid 🏡🏢 13 November
Data Science & AI/ML
Washington, United States 🇺🇸
AI Computing Software Engineering Intern, TensorRT
On-site 🏢 29 October
Data Science & AI/ML
Taipei, Taiwan 🇹🇼 Hsinchu, Taiwan 🇹🇼
2025 Applied Science Internship - Information & Knowledge Management (Machine Learning) - United States, PhD Student Science Recruiting
On-site 🏢 02 October
Data Science & AI/ML
Seattle, United States 🇺🇸
NVIDIA 2025 Internships: Artificial Intelligence and Deep Learning
Remote 🌎🌍🌏 28 August
Data Science & AI/ML
California, United States 🇺🇸 Remote, United States 🇺🇸
Deep Learning Compiler Intern - 2025
On-site 🏢 07 November
Data Science & AI/ML
Zurich, Switzerland 🇨🇭
Data Science: PhD Internship Opportunities -
Remote 🌎🌍🌏 23 August
Data Science & AI/ML
United States 🇺🇸
Cambridge Internship Opportunities – Multimodal Deep Learning for Healthcare – Microsoft Research
Hybrid 🏡🏢 06 November
Data Science & AI/ML
Cambridgeshire, United Kingdom 🇬🇧
Stagiaire en recherche - apprentissage automatique | Research Intern - Machine Learning
Hybrid 🏡🏢 29 November
Data Science & AI/ML
Québec, Canada 🇨🇦
2025 Applied Science Internship - Recommender Systems/ Information Retrieval (Machine Learning) - United States, PhD Student Science Recruiting
On-site 🏢 02 October
Data Science & AI/ML
Seattle, United States 🇺🇸
2025 Business Intelligence Engineer Internship
On-site 🏢 09 September
Data Science & AI/ML
Barcelona, Spain 🇪🇸 Madrid, Spain 🇪🇸
Neuron Compiler Software Engineer Intern, Annapurna ML
On-site 🏢 10 September
Data Science & AI/ML
Toronto, Canada 🇨🇦
Estágio em Suporte e Coleta de Dados
On-site 🏢 21 October
Data Science & AI/ML
Campinas, Brazil 🇧🇷
2025 Business Intelligence Engineer Internship
On-site 🏢 09 September
Data Science & AI/ML
Clichy, France 🇫🇷
Machine Learning / Artificial Intelligence Intern/Co-Op (Undergrad | Summer 2025 | Onsite/Hybrid)
On-site 🏢 12 September
Data Science & AI/ML
Ontario, Canada 🇨🇦
Data Analyst Intern/Co-Op (Undergrad | Summer, 2025 | Onsite/Hybrid)
Remote 🌎🌍🌏 24 September
Data Science & AI/ML
Ontario, Canada 🇨🇦
Machine Learning Intern - Spring or Summer 2025
On-site 🏢 29 October
Data Science & AI/ML
California, United States 🇺🇸
Research Intern - Agent Systems for AI Infrastructure
Hybrid 🏡🏢 07 October
Data Science & AI/ML
British Columbia, Canada 🇨🇦
2025 Business Intelligence Engineer Internship
On-site 🏢 09 September
Data Science & AI/ML
Milan, Italy 🇮🇹
Research Intern - AI for Domains
Hybrid 🏡🏢 25 October
Data Science & AI/ML
Washington, United States 🇺🇸
2025 Data Science Internship - United States, PhD or Masters Student
On-site 🏢 02 October
Data Science & AI/ML
Seattle, United States 🇺🇸
2025 Applied Science Internship - Natural Language Processing and Speech Technologies - United States, PhD Student Science Recruiting
On-site 🏢 02 October
Data Science & AI/ML
Seattle, United States 🇺🇸
Research Intern - AI Frontiers - Agentic AI Models & Synthetic Data Generation
Hybrid 🏡🏢 14 November
Data Science & AI/ML
Washington, United States 🇺🇸 New York, United States 🇺🇸
2025 Applied Science Intern (Computer Vision), Amazon International Machine Learning
On-site 🏢 14 November
Data Science & AI/ML
Canberra, Australia 🇦🇺 Adelaide, Australia 🇦🇺 Brisbane, Australia 🇦🇺 Perth, Australia 🇦🇺 Sydney, Australia 🇦🇺 Melbourne, Australia 🇦🇺
Research Intern - Machine Learning for Biology and Healthcare
Hybrid 🏡🏢 23 October
Data Science & AI/ML
Massachusetts, United States 🇺🇸
Research Intern - AI Mediated Sensemaking
Hybrid 🏡🏢 19 November
Data Science & AI/ML
Washington, United States 🇺🇸
Research Scientist, Fundamental Generative AI - New College Grad 2025
On-site 🏢 31 October
Data Science & AI/ML
California, United States 🇺🇸
Research Intern - Agent Systems for AI Infrastructure
Hybrid 🏡🏢 07 November
Data Science & AI/ML
Washington, United States 🇺🇸
2025 Data Engineer Internship
On-site 🏢 09 September
Data Science & AI/ML
Luxembourg 🇱🇺
Software Dev Engineer - Machine Learning Apps, Accelerator, Annapurna ML
On-site 🏢 27 September
Data Science & AI/ML
Seattle, United States 🇺🇸 Cupertino, United States 🇺🇸
Machine Learning Engineer Intern - 2025
On-site 🏢 22 November
Data Science & AI/ML
Shanghai, China 🇨🇳 Beijing, China 🇨🇳
Research Intern - Reliable and Safe AI Agents
Hybrid 🏡🏢 23 October
Data Science & AI/ML
Washington, United States 🇺🇸
Data Science Intern, Generative AI Infrastructure (Meraki)
On-site 🏢 31 October
Data Science & AI/ML
California, United States 🇺🇸
Data Science: Internship Opportunities - Redmond
Hybrid 🏡🏢 23 August
Data Science & AI/ML
Washington, United States 🇺🇸
Research Intern - AI, Machine Learning, Statistics
Hybrid 🏡🏢 30 October
Data Science & AI/ML
Massachusetts, United States 🇺🇸 New York, United States 🇺🇸
Research Intern - Deep Learning Group
Hybrid 🏡🏢 25 November
Data Science & AI/ML
Washington, United States 🇺🇸
Corporate Innovation Support (INTERNSHIP)
Hybrid 🏡🏢 04 September
Data Science & AI/ML
Milano, Italy 🇮🇹
Research Intern - AI (Agentic Systems & Interaction)
Hybrid 🏡🏢 26 November
Data Science & AI/ML
Washington, United States 🇺🇸 New York, United States 🇺🇸
Product Analytics Intern - Oracle Internship Program
On-site 🏢 21 November
Data Science & AI/ML
Singapore 🇸🇬
2025 Software Development Engineer Intern - Machine Learning (m/w/d)
On-site 🏢 01 November
Data Science & AI/ML
Berlin, Germany 🇩🇪
AI Computing Architect Intern - 2025
On-site 🏢 13 November
Data Science & AI/ML
Shanghai, China 🇨🇳
Data Engineer Co-Op – Construction & Engineering Global Industry Unit
On-site 🏢 24 October
Data Science & AI/ML
United States 🇺🇸
Research Scientist, Efficient Deep Learning - New College Grad 2025
On-site 🏢 28 October
Data Science & AI/ML
California, United States 🇺🇸
PhD Research Intern, AI for Climate & Weather Simulation - Summer 2025
On-site 🏢 25 November
Data Science & AI/ML
Helsinki, Finland 🇫🇮 Roskilde, Denmark 🇩🇰 Munich, Germany 🇩🇪
Product Analyst Intern, AI Infrastructure - 2025
On-site 🏢 08 November
Data Science & AI/ML
Shanghai, China 🇨🇳
Software Engineer - Analytics Cloud
On-site 🏢 28 August
Data Science & AI/ML
United States 🇺🇸
Business Intelligence Engineer Summer Internship – 2025 (US)
On-site 🏢 27 August
Data Science & AI/ML
Seattle, United States 🇺🇸
Research Intern - FATE NYC (Fairness, Accountability, Transparency, and Ethics in AI)
Hybrid 🏡🏢 14 November
Data Science & AI/ML
New York, United States 🇺🇸
Data Science Intern - Oracle Internship Program
On-site 🏢 20 November
Data Science & AI/ML
Singapore 🇸🇬
Research Intern - Artificial Specialized Intelligence
Hybrid 🏡🏢 07 October
Data Science & AI/ML
British Columbia, Canada 🇨🇦
Research Intern - Generative AI
Hybrid 🏡🏢 18 October
Data Science & AI/ML
Washington, United States 🇺🇸
Research Scientist, ML Systems - New College Grad 2025
Remote 🌎🌍🌏 17 October
Data Science & AI/ML
California, United States 🇺🇸 Massachusetts, United States 🇺🇸 Texas, United States 🇺🇸 Washington, United States 🇺🇸
Research Intern - Artificial Intelligence and Machine Learning
Hybrid 🏡🏢 11 November
Data Science & AI/ML
New York, United States 🇺🇸 Massachusetts, United States 🇺🇸
AI Algorithms SW Engineer (RDSS Intern)
On-site 🏢 25 November
Data Science & AI/ML
Hsinchu, Taiwan 🇹🇼 Taipei, Taiwan 🇹🇼
Research Scientist, Deep Learning and Computer Vision (New College Graduate)
On-site 🏢 16 October
Data Science & AI/ML
Taipei, Taiwan 🇹🇼 Hsinchu, Taiwan 🇹🇼
Research Intern - AI Reasoning
Hybrid 🏡🏢 28 November
Data Science & AI/ML
Washington, United States 🇺🇸
Research Intern - Artificial Specialized Intelligence
Hybrid 🏡🏢 07 November
Data Science & AI/ML
Washington, United States 🇺🇸
Machine Learning Engineer Intern, Summer 2025
On-site 🏢 06 November
Data Science & AI/ML
California, United States 🇺🇸
Applied Scientist, Shanghai AI Lab
On-site 🏢 20 November
Data Science & AI/ML
Shanghai, China 🇨🇳
Share this job, spread the word!
Similar jobs
Research Intern - Machine Learning
Hybrid 🏡🏢 13 November
Data Science & AI/ML
Washington, United States 🇺🇸
Image and Data Processing Libraries Intern
Remote 🌎🌍🌏 17 October
Data Science & AI/ML
Warsaw, Poland 🇵🇱 Remote, Poland 🇵🇱
Arch Solution Intern, HPC and AI Project
On-site 🏢 03 September
Data Science & AI/ML
Beijing, China 🇨🇳 Shanghai, China 🇨🇳