Data Lake Analytics

An on-demand analytics job service to power intelligent action

The first cloud analytics service where you can easily develop and run massively parallel data transformation and processing programs in U-SQL, R, Python, and .Net over petabytes of data. With no infrastructure to manage, you can process data on demand, scale instantly, and only pay per job.

Start in seconds, scale instantly, pay per job

Our on-demand service will have you processing big data jobs in seconds. There is no infrastructure to worry about because there are no servers, virtual machines, or clusters to wait for, manage, or tune. You can instantly scale the analytic units (processing power) from one to thousands for each job. You only pay for the processing used per job.

Develop massively parallel programs with simplicity

U-SQL is a simple, expressive, and extensible language that allows you to write code once and automatically have it be parallelized for the scale you need. You can process petabytes of data for diverse workload categories such as querying, ETL, analytics, machine learning, machine translation, imaging processing, and sentiment analysis by leveraging existing libraries written in .NET languages, R, or Python. . For example, watch the U-SQL video where we detect the type of objects in one million images using a U-SQL built-in cognitive library..

Debug and optimize your big data programs with ease

Debugging failures in cloud distributed programs are now as easy as debugging a program in your personal environment. Our execution environment actively analyzes your programs as they run and offers recommendations to improve performance and reduce cost. For example, if you requested 1000 AUs for your program and only 50 AUs were needed, the system would recommend that you only use 50 AUs resulting in a 20x cost savings.

Virtualize your analytics

The power to act on all your data with optimized data virtualization of your relational sources such as Azure SQL Database, and Azure SQL Data Warehouse. Queries are automatically optimized by moving processing close to the source data, without data movement, thereby maximizing performance and minimizing latency.

Enterprise-grade security, auditing, and support

Extend your on-premises security and governance controls to the cloud for meeting your security and regulatory compliance needs. Capabilities such as single sign-on (SSO), multi-factor authentication, and seamless management of millions of identities is built-in through Azure Active Directory. Role-based access control and the ability to audit all processing and management operations are on by default. We guarantee a 99.9% enterprise-grade SLA and 24/7 support for your big data solution.

Related products and services

Data Lake Store

Hyperscale repository for big data analytics workloads

HDInsight

Provision cloud Hadoop, Spark, R Server, HBase, and Storm clusters

Get started with Data Lake Analytics