AWS Glue

In this article we will delve into the fascinating world of AWS Glue, exploring its origins, its impact on society and its relevance today. From its first manifestations to its evolution over time, AWS Glue has left an indelible mark on various aspects of daily life. Along these lines, we will carefully analyze the different aspects that make AWS Glue a topic of universal interest, as well as the implications it has in different contexts and cultures. It doesn't matter if you are an expert on the subject or if you are just beginning to discover its importance, this article will give you a complete and enriching insight into AWS Glue.

AWS Glue
Developer(s)Amazon.com
Initial releaseAugust 2017 (2017-08)
Operating systemCross-platform
Available inEnglish
Websiteaws.amazon.com/glue Edit this on Wikidata

AWS Glue is an event-driven, serverless computing platform provided by Amazon as a part of Amazon Web Services. It is a computing service that runs code in response to events and automatically manages the computing resources required by that code. It was introduced in August 2017.

The primary purpose of Glue is to scan other services in the same Virtual Private Cloud (or equivalent accessible network element even if not provided by AWS), particularly S3. The jobs are billed according to compute time, with a minimum count of 1 minute. Glue discovers the source data to store associated meta-data (e.g. the table's schema of field names, types lengths) in the AWS Glue Data Catalog (which is then accessible via AWS console or APIs).

Languages supported

Scala and Python are officially supported as of 2020.

Catalog interrogation via API

The catalog can be read in AWS console (via browser) and via API divided into topics including:

  • Database API
  • Table API
  • Partition API
  • Connection API
  • User-Defined Function API
  • Importing an Athena Catalog to AWS Glue

See also

References

  1. ^ "Introducing AWS Glue: A Simple, Flexible, and Cost-Effective Extract, Transfer, and Load (ETL) Service".
  2. ^ "AWS Services List". ParkMyCloud. Retrieved October 6, 2020.
  3. ^ "AWS Glue: crawlers and use cases". 5 January 2022. Retrieved July 13, 2022.
  4. ^ "AWS Glue version 2.0 featuring 10x faster job start times and 1-minute minimum billing duration". AWS. August 10, 2020. Retrieved October 6, 2020.
  5. ^ "AWS Glue API Documentation". AWS. Retrieved October 6, 2020.
  6. ^ "AWS Glue Now Supports Scala in Addition to Python". AWS. January 12, 2018. Retrieved October 6, 2020.
  7. ^ "Catalog API". AWS. Retrieved October 8, 2020.

External links