Detecting sensitive data in DynamoDB with Macie

By Dustin Ward

Amazon Web Services FeedDetecting sensitive data in DynamoDB with Macie Amazon Macie is a fully managed data security and data privacy service that uses machine learning and pattern matching to discover and protect your sensitive data in Amazon Web Services (AWS). It gives you the ability to automatically scan for sensitive data and get an…

Exploratory data analysis, feature engineering, and operationalizing your data flow into your ML pipeline with Amazon SageMaker Data Wrangler

By Dustin Ward

Amazon Web Services FeedExploratory data analysis, feature engineering, and operationalizing your data flow into your ML pipeline with Amazon SageMaker Data Wrangler According to The State of Data Science 2020 survey, data management, exploratory data analysis (EDA), feature selection, and feature engineering accounts for more than 66% of a data scientist’s time (see the following…

Working with timestamp with time zone in your Amazon S3-based data lake

By Dustin Ward

Amazon Web Services FeedWorking with timestamp with time zone in your Amazon S3-based data lake With a data lake built on Amazon Simple Storage Service (Amazon S3), you can use the purpose-built analytics services for a range of use cases, from analyzing petabyte-scale datasets to querying the metadata of a single object. AWS analytics services…

Amazon QuickSight now supports Amazon Elasticsearch Service, and adds new box plot and filled map visuals

By Dustin Ward

Amazon Web Services FeedAmazon QuickSight now supports Amazon Elasticsearch Service, and adds new box plot and filled map visuals Amazon QuickSight dashboards can now visualize data from Amazon Elasticsearch Service. Amazon Elasticsearch Service is a fully managed service that makes it easy for you to deploy, secure, and run Elasticsearch cost effectively at scale. Authors…

Identifying training bottlenecks and system resource under-utilization with Amazon SageMaker Debugger

By Dustin Ward

Amazon Web Services FeedIdentifying training bottlenecks and system resource under-utilization with Amazon SageMaker Debugger At AWS re:Invent 2020, AWS released the profiling functionality for Amazon SageMaker Debugger. In this post, we expand on the importance of profiling deep neural network (DNN) training, review some of the common performance bottlenecks you might encounter, and demonstrate how…