Posted On: Jun 4, 2021

Starting today, Amazon EMR Studio is available in US East (Ohio, N. Virginia), US West (Oregon), Canada (Central), Europe (Ireland, Frankfurt, London, and Stockholm), and Asia Pacific (Mumbai, Seoul, Singapore, Sydney, and Tokyo) regions.

EMR Studio is an integrated development environment (IDE) that makes it easy for data scientists and data engineers to develop, visualize, and debug big data and analytics applications written in R, Python, Scala, and PySpark. EMR Studio provides fully managed Jupyter Notebooks, and tools like Spark UI and YARN Timeline Service to simplify debugging. EMR Studio uses AWS Single Sign-On and allows you to log in directly with your corporate credentials without logging into the AWS console.

To get started, as an administrator, you can either create and configure EMR Studios from the EMR console, or automate the Studio creation by specifying the configurations and dependencies in a CloudFormation template. You can use the AWS SSO console to enable AWS SSO, choose from supported identity providers including Okta, Azure AD, OneLogin, Ping Identity, and Microsoft AD, and use the EMR console to assign users and groups to EMR Studio.