Customer-obsessed science

Amazon Science Fulfillment Center OAK4 in Tracy, CA

Automated evaluation of RAG pipelines with exam generation

June 13, 2024

The fight against hallucination in retrieval-augmented-generation models starts with a method for accurately assessing it.

Conversational AI
A quick guide to Amazon’s papers at CVPR 2024

June 13, 2024

As in other areas of AI, generative models and foundation models — such as vision-language models — are a hot topic.

Computer vision
A quick guide to Amazon’s 30+ papers at NAACL 2024

June 07, 2024

Although work involving large language models predominates, classical and more-general techniques remain well represented.

Conversational AI
Conference calendar
- NAACL 2024
  
  Conversational AI
  
  June 16 - 21, 2024
- CVPR 2024
  
  Computer vision
  
  June 17 - 21, 2024
- SIGIR 2024
  
  Search and information retrieval
  
  July 14 - 18, 2024

Do large language models understand the world?

February 15, 2024

In addition to its practical implications, recent work on “meaning representations” could shed light on some old philosophical questions.

Conversational AI

Learn more

Virtual try-all: Visualizing any product in any personal setting

Karim Bouyarmane

April 16, 2024

First model to work across a wide range of products uses a second U-Net encoder to capture fine-grained product details.

Computer vision
Adapting language model architectures for time series forecasting

Abdul Fatir Ansari, Lorenzo Stella

March 18, 2024

Tokenizing time series data and treating it like a language enables a model whose zero-shot performance matches or exceeds that of purpose-built models.

Machine learning
Scenario Diffusion helps Zoox vehicles navigate safety-critical situations

Ethan Pronovost, Kai Wang

February 20, 2024

Generative AI supports the creation, at scale, of complex, realistic driving scenarios that can be directed to specific locations and environments.

Robotics
New tool, dataset help detect hallucinations in large language models

Xiangkun Hu, Dongyu Ru

January 17, 2024

Representing facts using knowledge triplets rather than natural language enables finer-grained judgments.

Conversational AI

Eliciting better multilingual structured reasoning from LLMs through code

Bryan Li, Tamer Alkhouli, Daniele Bonadiman, Nikolaos Pappas, Saab Mansour

ACL 2024

2024

The development of large language models (LLM) has shown progress on reasoning, though studies have largely considered either English or simple reasoning tasks. To address this, we introduce a multilingual structured reasoning and explanation dataset, termed xSTREET, that covers four tasks across six languages. xSTREET exposes a gap in base LLM performance between English and non-English reasoning tasks

Conversational AI
Multi-modal retrieval for large language model based speech recognition

Jari Kolehmainen, Aditya Gourav, Prashanth Gurunath Shivakumar, Yi Gu, Ankur Gandhe, Ariya Rastrow, Grant Strimel, Ivan Bulyko

ACL 2024

2024

Retrieval is a widely adopted approach for improving language models leveraging external information. As the field moves towards multimodal large language models, it is important to extend the pure text-based methods to incorporate other modalities in retrieval as well for applications across the wide spectrum of machine learning tasks and data types. In this work, we propose multi-modal retrieval with

Conversational AI
Propagation and pitfalls: Reasoning-based assessment of knowledge editing through counterfactual tasks

Wenyue Hua, Jiang Guo, Marvin Dong, Henghui Zhu, Patrick Ng, Zhiguo Wang

ACL 2024

2024

Current knowledge-editing approaches struggle to effectively propagate updates to interconnected facts. In this work, we delve into the barriers that hinder the appropriate propagation of updated knowledge within these models for accurate reasoning. To support our analysis, we introduce a novel reasoning-based benchmark, ReCoE (Reasoning-Based Counterfactual Editing dataset), which covers six common reasoning

Conversational AI
Exploring ordinality in text classification: A comparative study of explicit and implicit techniques

Siva Rajesh Kasa, Aniket Goel, Sumegh Roychowdhury, Karan Gupta, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

ACL 2024

2024

Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have primarily focused on modifying existing or creating novel loss functions that explicitly account for the ordinal nature of labels. However, with the advent of Pretrained Language

Conversational AI
Bifurcated attention for single-context large-batch sampling

Ben Athiwaratkun, Sujan Gonugondla, Sanjay Krishna Gouda, Hantian Ding, Qing Sun, Jun Wang, Jiacheng Guo, Liangfu Chen, Haifeng Qian, Parminder Bhatia, Ramesh Nallapati, Sudipta Sengupta, Bing Xiang

ICML 2024

2024

In our study, we present bifurcated attention, a method developed for language model inference in single-context batch sampling contexts. This approach aims to reduce redundant memory IO costs, a significant factor in latency for high batch sizes and long context lengths. Bifurcated attention achieves this by dividing the attention mechanism during incremental decoding into two distinct GEMM operations,

Machine learning

Career opportunities

We look for talent from around the world for applied scientists, data scientists, economists, research scientists, scholars, academics, PhDs, and interns.

Explore open roles
Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Join the program
Amazon Research Awards

Supporting research at academic institutions and non-profit organizations in areas that align with our mission to advance customer-obsessed science.

Apply for awards

Customer-obsessed science

Conference calendar

Publications

Resources

Work with us