Guidewire_Cloud_Data_Access_Data_Sheet
Guidewire_Cloud_Data_Access_Data_Sheet
Cloud Data Access is hosted in Guidewire Cloud on the Amazon Web Services (AWS)
platform. Each instance communicates with a single InsuranceSuite database that is also
deployed in Guidewire Cloud. Cloud Data Access executes an initial bulk load to capture
existing historical data in the InsuranceSuite database and loads this data into Amazon
S3. After this initial load, it captures incremental changes.
Guidewire Data Platform uses Apache Kafka to build a real-time data pipeline. It streams
data (initial and incremental) into a Kafka topic. Additionally, schema changes are
Guidewire Data Platform
automatically identified and marked with metadata as the data streams into Kafka.
Subsequently, Cloud Data Access automatically accounts for these changes.
© 2020 Guidewire Software, Inc. For more information about Guidewire’s trademarks,
visit http://guidewire.com/legal-notices. Document Published: 2020-07-21
Guidewire Cloud Data Access
Cloud Data Access then executes an Apache Spark job on an Amazon EMR cluster to in
an Amazon S3 bucket. Cloud Data Access guarantees that there are no duplicate records.
It also supports additive, nondestructive changes to schema by identifying the schema
change in the source database and assigning a unique schema fingerprint.
—Novarica For each table in the InsuranceSuite database, Cloud Data Access creates a table-specific
folder in S3. In each table’s folder, it groups data by the schema’s fingerprint. In each
fingerprint folder, it further groups data according to the time that it completed writing
data into S3. For each incremental fetch, Cloud Data Access writes newly extracted data
into new folders that are named with the time stamp when it completed writing this
new data into S3.
Cloud Data Access also creates and maintains a manifest.json file at the same level as
table-specific folders. This file contains metadata about the replication process to
understand the S3 content that can be safely consumed. The manifest is continuously
updated to reflect the successful delivery of new incremental data into an S3 folder.
About Guidewire Data Platform After Cloud Data Access has successfully written data into an S3 folder, customers can
download this data into their self-managed environments to support any downstream
The Guidewire Data Platform is a P&C data integration use cases, such as reporting and data warehousing. Customers can use
insurance–specific data repository and factory their own application to download the data or can leverage the Guidewire Cloud Data
that continuously collects data from internal Access Client reference implementation, which has been made available as on open-
and external sources and provides analytical source GitHub repository.
insights across the insurance lifecycle.
Guidwire configures and manages the data pipeline, and customers can access only the
contents of their own S3 bucket. Cloud Data Access secures the S3 bucket with Amazon
Identity and Access Management (IAM) roles and bucket policies along with Amazon Key
Management Service (KMS) to protect your data.
Guidewire Cloud Data Access gathers InsuranceSuite data at the right scale and tempo
to best position insurers in today’s competitive environment. It provides secure data
Guidewire is the platform P&C insurers trust to engage, innovate, and access at a low latency to augment your enterprise data strategy and extract new value.
grow efficiently. We combine digital, core, analytics, and AI to deliver our
platform as a cloud service. More than 380 insurers, from new ventures
to the largest and most complex in the world, run on Guidewire. © 2020 Guidewire Software, Inc. For more information about Guidewire’s trademarks,
For more information, contact us at [email protected]. visit http://guidewire.com/legal-notices. Document Published: 2020-07-21