Exploratory data analysis, feature engineering, and operationalizing your data flow into your ML pipeline with Amazon SageMaker Data Wrangler

By Dustin Ward

Amazon Web Services FeedExploratory data analysis, feature engineering, and operationalizing your data flow into your ML pipeline with Amazon SageMaker Data Wrangler According to The State of Data Science 2020 survey, data management, exploratory data analysis (EDA), feature selection, and feature engineering accounts for more than 66% of a data scientist’s time (see the following…

Amazon QuickSight now supports Amazon Elasticsearch Service, and adds new box plot and filled map visuals

By Dustin Ward

Amazon Web Services FeedAmazon QuickSight now supports Amazon Elasticsearch Service, and adds new box plot and filled map visuals Amazon QuickSight dashboards can now visualize data from Amazon Elasticsearch Service. Amazon Elasticsearch Service is a fully managed service that makes it easy for you to deploy, secure, and run Elasticsearch cost effectively at scale. Authors…

Identifying training bottlenecks and system resource under-utilization with Amazon SageMaker Debugger

By Dustin Ward

Amazon Web Services FeedIdentifying training bottlenecks and system resource under-utilization with Amazon SageMaker Debugger At AWS re:Invent 2020, AWS released the profiling functionality for Amazon SageMaker Debugger. In this post, we expand on the importance of profiling deep neural network (DNN) training, review some of the common performance bottlenecks you might encounter, and demonstrate how…

Working with timestamp with time zone in your Amazon S3-based data lake

By Dustin Ward

Amazon Web Services FeedWorking with timestamp with time zone in your Amazon S3-based data lake With a data lake built on Amazon Simple Storage Service (Amazon S3), you can use the purpose-built analytics services for a range of use cases, from analyzing petabyte-scale datasets to querying the metadata of a single object. AWS analytics services…

Making sense of your health data with Amazon HealthLake

By Dustin Ward

Amazon Web Services FeedMaking sense of your health data with Amazon HealthLake We’re excited to announce Amazon HealthLake, a new HIPAA-eligible service for healthcare providers, health insurance companies, and pharmaceutical companies to securely store, transform, query, analyze, and share health data in the cloud, at petabyte scale. HealthLake uses machine learning (ML) models trained to…