{"id":1064,"date":"2019-07-09T10:51:20","date_gmt":"2019-07-09T10:51:20","guid":{"rendered":"https:\/\/www.testpreptraining.com\/tutorial\/?page_id=1064"},"modified":"2020-05-01T09:56:36","modified_gmt":"2020-05-01T09:56:36","slug":"aws-collection-system","status":"publish","type":"page","link":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/","title":{"rendered":"Select a Collection System that handles the frequency of data change and type of data being ingested"},"content":{"rendered":"\n<p>Data ingestion and synchronization into a big data\nenvironment is harder than most people think.&nbsp;\nLoading large volumes of data at high speed and managing the incremental\ningestion and synchronization of data at scale into an on premise or cloud data\nlake can present significant technical challenges.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Amazon Kinesis Firehose<\/h2>\n\n\n\n<p>Amazon Kinesis Firehose is a fully managed service for\ndelivering real-time streaming data directly to Amazon S3. Kinesis Firehose\nautomatically scales to match the volume and throughput of streaming data, and\nrequires no ongoing administration. Kinesis Firehose can also be configured to\ntransform streaming data before it\u2019s stored in Amazon S3. Its transformation\ncapabilities include compression, encryption, data batching, and Lambda\nfunctions.<\/p>\n\n\n\n<p>Kinesis Firehose can compress data before it\u2019s stored\nin Amazon S3. It currently supports GZIP, ZIP, and SNAPPY compression formats.\nGZIP is the preferred format because it can be used by Amazon Athena, Amazon\nEMR, and Amazon Redshift. Kinesis Firehose encryption supports Amazon S3\nserver-side encryption with AWS Key Management Service (AWS KMS) for encrypting\ndelivered data in Amazon S3. You can choose not to encrypt the data or to\nencrypt with a key from the list of AWS KMS keys that you own (see the section\nEncryption with AWS KMS). Kinesis Firehose can concatenate multiple incoming\nrecords, and then deliver them to Amazon S3 as a single S3 object. This is an\nimportant capability because it reduces Amazon S3 transaction costs and\ntransactions per second load.<\/p>\n\n\n\n<p>Finally, Kinesis Firehose can invoke Lambda functions to transform incoming source data and deliver it to Amazon S3. Common transformation functions include transforming Apache Log and Syslog formats to standardized JSON and\/or CSV formats. The JSON and CSV formats can then be directly queried using Amazon Athena. If using a Lambda data transformation, you can optionally back up raw source data to another S3 bucket.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"575\" height=\"251\" src=\"https:\/\/www.testpreptraining.ai\/tutorial\/wp-content\/uploads\/2019\/07\/select-a-collection-system-that-handles-the-frequency-of-data-change-and-type-of-data-being-ingested.png\" alt=\"select a aws collection system that handles the frequency of data change and type of data being ingested\n\" class=\"wp-image-1182\"\/><\/figure><\/div>\n\n\n\n<h2 class=\"wp-block-heading\">AWS Snowball<\/h2>\n\n\n\n<p>You can use AWS Snowball to securely and efficiently\nmigrate bulk data from on-premises storage platforms and Hadoop clusters to S3\nbuckets. After you create a job in the AWS Management Console, a Snowball\nappliance will be automatically shipped to you. After a Snowball arrives,\nconnect it to your local network, install the Snowball client on your\non-premises data source, and then use the Snowball client to select and\ntransfer the file directories to the Snowball device. The Snowball client uses\nAES-256-bit encryption. Encryption keys are never shipped with the Snowball\ndevice, so the data transfer process is highly secure. After the data transfer\nis complete, the Snowball\u2019s E Ink shipping label will automatically update.\nShip the device back to AWS. Upon receipt at AWS, your data is then transferred\nfrom the Snowball device to your S3 bucket and stored as S3 objects in their\noriginal\/native format. Snowball also has an HDFS client, so data may be\nmigrated directly from Hadoop clusters into an S3 bucket in its native format.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">AWS Storage Gateway<\/h2>\n\n\n\n<p>AWS Storage Gateway can be used to integrate legacy on-premises data processing platforms with an Amazon S3-based data lake. The File Gateway configuration of Storage Gateway offers on-premises devices and applications a network file share via an NFS connection. Files written to this mount point are converted to objects stored in Amazon S3 in their original format without any proprietary modification. This means that you can easily integrate applications and platforms that don\u2019t have native Amazon S3 capabilities\u2014such as on-premises lab equipment, mainframe computers, databases, and data warehouses\u2014with S3 buckets, and then use tools such as Amazon EMR or Amazon Athena to process this data. <\/p>\n\n\n\n<p>\nLink for free practice test &#8211; <a href=\"https:\/\/www.testpreptraining.ai\/aws-certified-big-data-specialty-free-practice-test\">https:\/\/www.testpreptraining.ai\/aws-certified-big-data-specialty-free-practice-test<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Data ingestion and synchronization into a big data environment is harder than most people think.&nbsp; Loading large volumes of data at high speed and managing the incremental ingestion and synchronization of data at scale into an on premise or cloud data lake can present significant technical challenges. Amazon Kinesis Firehose Amazon Kinesis Firehose is a&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":1031,"menu_order":3,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"footnotes":""},"categories":[2],"tags":[],"class_list":["post-1064","page","type-page","status-publish","hentry","category-amazon-aws"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Select a Collection System that handles the frequency of data change and type of data being ingested - Testprep Training Tutorials<\/title>\n<meta name=\"description\" content=\"select a AWS collection system that handles the frequency of data change and type of data being ingested tutorial, brief notes, dumps, summary and pdf.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Select a Collection System that handles the frequency of data change and type of data being ingested - Testprep Training Tutorials\" \/>\n<meta property=\"og:description\" content=\"select a AWS collection system that handles the frequency of data change and type of data being ingested tutorial, brief notes, dumps, summary and pdf.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/\" \/>\n<meta property=\"og:site_name\" content=\"Testprep Training Tutorials\" \/>\n<meta property=\"article:modified_time\" content=\"2020-05-01T09:56:36+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.testpreptraining.ai\/tutorial\/wp-content\/uploads\/2019\/07\/select-a-collection-system-that-handles-the-frequency-of-data-change-and-type-of-data-being-ingested.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/\",\"url\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/\",\"name\":\"Select a Collection System that handles the frequency of data change and type of data being ingested - Testprep Training Tutorials\",\"isPartOf\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#website\"},\"datePublished\":\"2019-07-09T10:51:20+00:00\",\"dateModified\":\"2020-05-01T09:56:36+00:00\",\"description\":\"select a AWS collection system that handles the frequency of data change and type of data being ingested tutorial, brief notes, dumps, summary and pdf.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.testpreptraining.ai\/tutorial\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AWS Certified Big Data Specialty\",\"item\":\"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Select a Collection System that handles the frequency of data change and type of data being ingested\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#website\",\"url\":\"https:\/\/www.testpreptraining.ai\/tutorial\/\",\"name\":\"Testprep Training Tutorials\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.testpreptraining.ai\/tutorial\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#organization\",\"name\":\"Testprep Training\",\"url\":\"https:\/\/www.testpreptraining.ai\/tutorial\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png\",\"contentUrl\":\"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png\",\"width\":583,\"height\":153,\"caption\":\"Testprep Training\"},\"image\":{\"@id\":\"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Select a Collection System that handles the frequency of data change and type of data being ingested - Testprep Training Tutorials","description":"select a AWS collection system that handles the frequency of data change and type of data being ingested tutorial, brief notes, dumps, summary and pdf.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/","og_locale":"en_US","og_type":"article","og_title":"Select a Collection System that handles the frequency of data change and type of data being ingested - Testprep Training Tutorials","og_description":"select a AWS collection system that handles the frequency of data change and type of data being ingested tutorial, brief notes, dumps, summary and pdf.","og_url":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/","og_site_name":"Testprep Training Tutorials","article_modified_time":"2020-05-01T09:56:36+00:00","og_image":[{"url":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-content\/uploads\/2019\/07\/select-a-collection-system-that-handles-the-frequency-of-data-change-and-type-of-data-being-ingested.png"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/","url":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/","name":"Select a Collection System that handles the frequency of data change and type of data being ingested - Testprep Training Tutorials","isPartOf":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#website"},"datePublished":"2019-07-09T10:51:20+00:00","dateModified":"2020-05-01T09:56:36+00:00","description":"select a AWS collection system that handles the frequency of data change and type of data being ingested tutorial, brief notes, dumps, summary and pdf.","breadcrumb":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/aws-collection-system\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.testpreptraining.ai\/tutorial\/"},{"@type":"ListItem","position":2,"name":"AWS Certified Big Data Specialty","item":"https:\/\/www.testpreptraining.ai\/tutorial\/aws-certified-big-data-specialty\/"},{"@type":"ListItem","position":3,"name":"Select a Collection System that handles the frequency of data change and type of data being ingested"}]},{"@type":"WebSite","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#website","url":"https:\/\/www.testpreptraining.ai\/tutorial\/","name":"Testprep Training Tutorials","description":"","publisher":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.testpreptraining.ai\/tutorial\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#organization","name":"Testprep Training","url":"https:\/\/www.testpreptraining.ai\/tutorial\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/","url":"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png","contentUrl":"https:\/\/www.testpreptraining.com\/tutorial\/wp-content\/uploads\/2020\/07\/tpt-logo-6.png","width":583,"height":153,"caption":"Testprep Training"},"image":{"@id":"https:\/\/www.testpreptraining.ai\/tutorial\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1064","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/comments?post=1064"}],"version-history":[{"count":6,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1064\/revisions"}],"predecessor-version":[{"id":5067,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1064\/revisions\/5067"}],"up":[{"embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/pages\/1031"}],"wp:attachment":[{"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/media?parent=1064"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/categories?post=1064"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.testpreptraining.ai\/tutorial\/wp-json\/wp\/v2\/tags?post=1064"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}