{"id":1098,"date":"2019-07-09T11:21:29","date_gmt":"2019-07-09T11:21:29","guid":{"rendered":"https:\/\/www.testpreptraining.com\/tutorial\/?page_id=1098"},"modified":"2020-05-01T10:12:46","modified_gmt":"2020-05-01T10:12:46","slug":"determine-the-tools-and-techniques-required-for-analysis","status":"publish","type":"page","link":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/","title":{"rendered":"Determine the Tools and Techniques Required for Analysis"},"content":{"rendered":"\n<h4 class=\"wp-block-heading\"><strong>Amazon Athena<\/strong><\/h4>\n\n\n\n<ul class=\"wp-block-list\"><li>It is an interactive query service <\/li><li>Easily analyze data in S3 using standard SQL. <\/li><li>It is serverless<\/li><li>No infrastructure to manage<\/li><li>Pay only for the queries that you run.<\/li><\/ul>\n\n\n\n<p><strong>Running AWS Athena <\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Point to data in S3<\/li><li>De\ufb01ne the schema<\/li><li>start querying using standard SQL. <\/li><li>Most\nresults are delivered within seconds. <\/li><li>No\nneed for complex ETL jobs to prepare data for analysis. <\/li><li>Anyone with SQL skills can quickly analyze\nlarge-scale datasets.<\/li><\/ul>\n\n\n\n<p>Well integrated with AWS Glue Data Catalog, <\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>to create a unified metadata repository across various\nservices<\/li><li>crawl data sources to discover schemas <\/li><li>populate Catalog with new and modified table <\/li><li>maintain schema versioning<\/li><li>Can also use Glue\u2019s ETL capabilities.<\/li><\/ul>\n\n\n\n<p><strong>Amazon EMR<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>It is a managed Hadoop framework<\/li><li>Simplifies running big data frameworks &#8211; Apache\nHadoop, Apache Spark, HBase, Presto, and Flink&nbsp;\non AWS <\/li><li>Process and analyze vast amounts of data. <\/li><li>Uses Apache Hive and Apache Pig, to process data\nfor analytics and BI. <\/li><li>Use to transform and move large amounts of data\ninto and out of other AWS data stores and databases. <\/li><li>Can interact with data in other AWS data stores\nlike S3, DynamoDB.<\/li><\/ul>\n\n\n\n<p><strong>EMR Notebooks<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Is based on the popular Jupyter Notebook<\/li><li>provide a development and collaboration environment for ad hoc querying and exploratory analysis.<\/li><\/ul>\n\n\n\n<p><strong>Amazon CloudSearch<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>It is a managed service <\/li><li>To set up, manage, and scale a search solution\nfor website or application. <\/li><li>Supports 34 languages <\/li><li>Supported search features <ul><li>Highlighting<\/li><\/ul><ul><li>Autocomplete<\/li><\/ul><ul><li>geospatial\nsearch<\/li><\/ul><\/li><\/ul>\n\n\n\n<p><strong>Amazon Elasticsearch Service<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Used to deploy, secure, operate, and scale Elasticsearch <\/li><li>Elasticsearch is used to search, analyze, and visualize data in real-time. <\/li><li>Access APIs and real-time analytics capabilities <\/li><li>Useful for <ul><li>log analytics<\/li><\/ul><ul><li>full-text search<\/li><\/ul><ul><li>application monitoring<\/li><\/ul><ul><li>clickstream analytics<\/li><\/ul><\/li><li>Integrations with Kibana and Logstash<\/li><li>Integrates with other AWS services Amazon VPC, AWS KMS, Amazon Kinesis Data Firehose, AWS Lambda, AWS IAM, Amazon Cognito, and Amazon CloudWatch.<\/li><\/ul>\n\n\n\n<p><strong>Amazon Kinesis<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Used to collect, process, and analyze real-time,\nstreaming data <\/li><li>Easily get timely insights and react quickly to\nnew information. <\/li><li>Offers flexibility to choose tools. <\/li><li>Ingest real-time data such <\/li><li>Can process and analyze data as it arrives and\nrespond instantly instead of waiting<\/li><li>Currently o\ufb00ers four services<ul><li>Kinesis\nData Firehose<\/li><\/ul><ul><li>Kinesis\nData Analytics<\/li><\/ul><ul><li>Kinesis\nData Streams<\/li><\/ul><ul><li>Kinesis\nVideo Streams<\/li><\/ul><\/li><\/ul>\n\n\n\n<p><strong>Amazon Redshift<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>It is a fast, scalable data warehouse <\/li><li>Used to analyze all data across data warehouse\nand data lake. <\/li><li>Integrates with machine learning, parallel query\nexecution, and columnar storage. <\/li><li>Setup and deploy a new data warehouse in minutes<\/li><li>Run queries across petabytes in Redshift, and\nexabytes in data lake.<\/li><\/ul>\n\n\n\n<p><strong>Amazon QuickSight<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>It is a fast, cloud-powered business\nintelligence (BI) service <\/li><li>Used to deliver insights. <\/li><li>Create and publish interactive dashboards <\/li><li>Dashboards accessible from browsers or mobile\ndevices. <\/li><li>Embed dashboards into applications for\nself-service analytics<\/li><li>Easily scales without any software to install or\ninfrastructure to manage.<\/li><\/ul>\n\n\n\n<p><strong>AWS Data Pipeline<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>It is a web service <\/li><li>Used to reliably process and move data <\/li><li>Move between di\ufb00erent AWS services, on-premises\ndata sources, at speci\ufb01ed intervals. <\/li><li>Regularly access data where it\u2019s stored,\ntransform and process it at scale<\/li><li>Transfer the results to AWS services.<\/li><li>Easily create complex data processing workloads\nthat are <ul><li>fault\ntolerant<\/li><\/ul><ul><li>repeatable<\/li><\/ul><ul><li>highly\navailable<\/li><\/ul><\/li><\/ul>\n\n\n\n<p><strong>AWS Glue<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Fully managed ETL service <\/li><li>Easily prepare and load data for analytics. <\/li><li>Create and run an ETL job in AWS Management Console.\n<\/li><li>Point to data stored on AWS, data and associated\nmetadata is discovered in Glue Data Catalog. <\/li><li>Once cataloged, data is immediately searchable, queryable, and available for ETL. <\/li><\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Amazon Athena It is an interactive query service Easily analyze data in S3 using standard SQL. It is serverless No infrastructure to manage Pay only for the queries that you run. Running AWS Athena Point to data in S3 De\ufb01ne the schema start querying using standard SQL. Most results are delivered within seconds. No need&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":1031,"menu_order":16,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"footnotes":""},"categories":[2],"tags":[167,166,156,34],"class_list":["post-1098","page","type-page","status-publish","hentry","category-amazon-aws","tag-amazon-kinesis","tag-aws-data-pipeline","tag-aws-glue","tag-aws-redshift"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Determine the Tools and Techniques Required for Analysis - Testprep Training Tutorials<\/title>\n<meta name=\"description\" content=\"Determine the tools and techniques required for analysis tutorial, notes\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Determine the Tools and Techniques Required for Analysis - Testprep Training Tutorials\" \/>\n<meta property=\"og:description\" content=\"Determine the tools and techniques required for analysis tutorial, notes\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/\" \/>\n<meta property=\"og:site_name\" content=\"Testprep Training Tutorials\" \/>\n<meta property=\"article:modified_time\" content=\"2020-05-01T10:12:46+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/\",\"url\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/\",\"name\":\"Determine the Tools and Techniques Required for Analysis - Testprep Training Tutorials\",\"isPartOf\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#website\"},\"datePublished\":\"2019-07-09T11:21:29+00:00\",\"dateModified\":\"2020-05-01T10:12:46+00:00\",\"description\":\"Determine the tools and techniques required for analysis tutorial, notes\",\"breadcrumb\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.testpreptraining.ai\/tutorial\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AWS Certified Big Data Specialty\",\"item\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Determine the Tools and Techniques Required for Analysis\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#website\",\"url\":\"https:\/\/www.testpreptraining.ai\/tutorial\/\",\"name\":\"Testprep Training Tutorials\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.testpreptraining.ai\/tutorial\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#organization\",\"name\":\"Testprep Training\",\"url\":\"https:\/\/www.testpreptraining.ai\/tutorial\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png\",\"contentUrl\":\"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png\",\"width\":583,\"height\":153,\"caption\":\"Testprep Training\"},\"image\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Determine the Tools and Techniques Required for Analysis - Testprep Training Tutorials","description":"Determine the tools and techniques required for analysis tutorial, notes","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/","og_locale":"en_US","og_type":"article","og_title":"Determine the Tools and Techniques Required for Analysis - Testprep Training Tutorials","og_description":"Determine the tools and techniques required for analysis tutorial, notes","og_url":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/","og_site_name":"Testprep Training Tutorials","article_modified_time":"2020-05-01T10:12:46+00:00","twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/","url":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/","name":"Determine the Tools and Techniques Required for Analysis - Testprep Training Tutorials","isPartOf":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#website"},"datePublished":"2019-07-09T11:21:29+00:00","dateModified":"2020-05-01T10:12:46+00:00","description":"Determine the tools and techniques required for analysis tutorial, notes","breadcrumb":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/determine-the-tools-and-techniques-required-for-analysis\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.testpreptraining.ai\/tutorial\/"},{"@type":"ListItem","position":2,"name":"AWS Certified Big Data Specialty","item":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/"},{"@type":"ListItem","position":3,"name":"Determine the Tools and Techniques Required for Analysis"}]},{"@type":"WebSite","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#website","url":"https:\/\/www.testpreptraining.ai\/tutorial\/","name":"Testprep Training Tutorials","description":"","publisher":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.testpreptraining.ai\/tutorial\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#organization","name":"Testprep Training","url":"https:\/\/www.testpreptraining.ai\/tutorial\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/","url":"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png","contentUrl":"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png","width":583,"height":153,"caption":"Testprep Training"},"image":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1098","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/comments?post=1098"}],"version-history":[{"count":3,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1098\/revisions"}],"predecessor-version":[{"id":1321,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1098\/revisions\/1321"}],"up":[{"embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1031"}],"wp:attachment":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/media?parent=1098"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/categories?post=1098"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/tags?post=1098"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}