Customer-obsessed science

Amazon Science Fulfillment Center OAK4 in Tracy, CA

Automated evaluation of RAG pipelines with exam generation

June 13, 2024

The fight against hallucination in retrieval-augmented-generation models starts with a method for accurately assessing it.

Conversational AI
A quick guide to Amazon’s papers at CVPR 2024

June 13, 2024

As in other areas of AI, generative models and foundation models — such as vision-language models — are a hot topic.

Computer vision
A quick guide to Amazon’s 30+ papers at NAACL 2024

June 07, 2024

Although work involving large language models predominates, classical and more-general techniques remain well represented.

Conversational AI
Conference calendar
- NAACL 2024
  
  Conversational AI
  
  June 16 - 21, 2024
- CVPR 2024
  
  Computer vision
  
  June 17 - 21, 2024
- SIGIR 2024
  
  Search and information retrieval
  
  July 14 - 18, 2024

Do large language models understand the world?

February 15, 2024

In addition to its practical implications, recent work on “meaning representations” could shed light on some old philosophical questions.

Conversational AI

Learn more

Virtual try-all: Visualizing any product in any personal setting

Karim Bouyarmane

April 16, 2024

First model to work across a wide range of products uses a second U-Net encoder to capture fine-grained product details.

Computer vision
Adapting language model architectures for time series forecasting

Abdul Fatir Ansari, Lorenzo Stella

March 18, 2024

Tokenizing time series data and treating it like a language enables a model whose zero-shot performance matches or exceeds that of purpose-built models.

Machine learning
Scenario Diffusion helps Zoox vehicles navigate safety-critical situations

Ethan Pronovost, Kai Wang

February 20, 2024

Generative AI supports the creation, at scale, of complex, realistic driving scenarios that can be directed to specific locations and environments.

Robotics
New tool, dataset help detect hallucinations in large language models

Xiangkun Hu, Dongyu Ru

January 17, 2024

Representing facts using knowledge triplets rather than natural language enables finer-grained judgments.

Conversational AI

A critical analysis of circular product attributes and limitations of product circularity assessment methods

Junwon Ko, Gisele Bortolaz Guedes, Fazleena Badurdeen, I.S. Jawahir, KC Morris, Vincenzo Ferrero, Buddhika Hapuwatte, Ryan Bradley, Ardeshir Raihanian

Resource, Conservation and Recycling

2024

The Circular Economy (CE) has been proposed as a strategy to promote the efﬁcient use of resources, maximizing the beneﬁts derived from materials and products through value recovery strategies, and minimizing waste generation. However, ambiguity remains in deﬁning what makes a product circular and its characteristics when adapting the CE concept for application at the product level. More clarity about the

Sustainability
Balanced filtering via disclosure-controlled proxies

Tiffany Deng, Emily Diana, Michael Kearns, Aaron Roth

FORC 2024

2024

We study the problem of collecting a cohort or set that is balanced with respect to sensitive groups when group membership is unavailable or prohibited from use at deployment time. Specifically, our deployment-time collection mechanism does not reveal significantly more about the group membership of any individual sample than can be ascertained from base rates alone. To do this, we study a learner that

Machine learning
Transferring knowledge from large foundation models to small downstream models

Shikai Qiu, Boran Han, Danielle Maddix Robinson, Shuai Zhang, Bernie Wang, Andrew Wilson

ICML 2024

2024

How do we transfer the relevant knowledge from ever larger foundation models into small, task-specific downstream models that can run at much lower costs? Standard transfer learning using pre-trained weights as the initialization transfers limited information and commits us to often massive pre-trained architectures. This procedure also precludes combining multiple pre-trained models that learn complementary

Machine learning
Extreme miscalibration and the illusion of adversarial robustness

Vyas Raina, Samson Tan, Volkan Cevher, Aditya Rawal, Sheng Zha, George Karypis

ACL 2024

2024

Deep learning-based Natural Language Processing (NLP) models are vulnerable to adversarial attacks, where small perturbations can cause a model to misclassify. Adversarial Training (AT) is often used to increase model robustness. However, we have discovered an intriguing phenomenon: deliberately or accidentally miscalibrating models masks gradients in a way that interferes with adversarial attack search

Conversational AI
Efficient continual pre-training for building domain specific large language models

Yong Xie, Karan Aggarwal, Aitzaz Ahmad

ACL Findings 2024

2024

Large language models (LLMs) have demonstrated remarkable open-domain capabilities. LLMs tailored for a domain are typically trained entirely on domain corpus to excel at handling domain-specific tasks. In this work, we explore an alternative strategy of continual pre-training as a means to develop domain-specific LLMs over an existing open-domain LLM. We introduce FinPythia-6.9B, developed through domain-adaptive

Conversational AI

Career opportunities

We look for talent from around the world for applied scientists, data scientists, economists, research scientists, scholars, academics, PhDs, and interns.

Explore open roles
Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities. Learn more about each program and how to apply below.

Join the program
Amazon Research Awards

Supporting research at academic institutions and non-profit organizations in areas that align with our mission to advance customer-obsessed science.

Apply for awards

Customer-obsessed science

Conference calendar

Publications

Resources

Work with us