Section 5: Manage data

Introduction

Effective test data management, which includes data mining, data generation, and data maintenance, is critical for quality engineering. Test data serves a variety of purposes. Not only should it be available on-demand in our environments, but it should also be associated with each possible test and reflect the interdependence of numerous systems, as well as be combined for specific test cases.

Given the current regulatory requirements for test data management, it is critical to have a standardized mechanism for self-service, virtualization, synthetic data generation, and masking production data, as well as demand management, governance, and metrics for measuring and monitoring the health of testing activities.

So how do numerous organizations manage their test data?

Generally, it is a lengthy process. Following the creation of a copy of production data with multiple subsets, sensitive information is temporarily masked, and the data is copied to various testing environments.

This strategy presents two distinct challenges.

To begin, test data impairs the data's quality. Compliance concerns arise as a result of the fact that production data is not masked. Production data are subject to some variation. When data reaches quality assurance, it is often insufficient, invalid, or out of date.

Second, test data impairs the speed and agility of testing. Quality assurance teams rely on an upstream team. Testers are forced to sift through unwieldy copies of production data to locate the required combinations. When a test is modified, the scripts and queries associated with it become unusable. As a result of data loss, cannibalization, or consumption, there is a dearth of data reuse.

This section discusses how artificial intelligence can help mitigate the inherent risks associated with test data management by leveraging the concept of synthetic test data to ensure adequate coverage of test data, to comply with privacy regulations prohibiting the use of production data in test environments, and to shorten the time required to create representative test data for end-to-end testing. These machine learning-powered platforms can generate any type of data while retaining its characteristics and relationships. Data is anonymized in accordance with applicable laws.

In this section

Chapter 1: The rise of synthetic data

Chapter 2: An overview of synthetic data generation methods

Chapter 3: Coverage requires a command of test data

Chapter 4: Use case: a synthetic data platform for quality engineering

We respect your privacy

We use cookies to improve your experience on our website. They help us to improve site performance, present you relevant advertising and enable you to share content in social media.

You may accept all cookies, or choose to manage them individually. You can change your settings at any time by clicking Cookie Settings available in the footer of every page.

For more information related to the cookies, please visit our cookie policy.

Cookies	Description
Registered visitor cookie	Cookie given to each registered user.
Registered visitor functionality cookie	Cookies used to remember the unique identifier given to each registered user.
Social plug-in content sharing cookie	Cookies set by services such as Facebook Connect or Twitter Button, which allow social networks users to share the content of our websites on social networks.
Unregistered visitor cookie	Cookies used to give to unregistered users a unique identifier in order to recognize them and to analyze how they use the website.
Analytic cookie	Cookies used to store URLs of the previous page visited, enabling to track users navigating from inside or from outside the website. If you click on a Sogeti advertisement on a non-Sogeti website, a cookie may be used to log which website you are on, in order to ensure our advertisements are served effectively and to measure whether our advertisements are viewed. Google Analytics: cookies set by Google analytics are used for web analytical purpose, but are not used to track individual users. For further information on how Google Analytics collects and uses information on our behalf and the right to use such cookies, please refer to the Google Analytics products and services privacy statement. If you object to your Personal Data being collected by Google Analytics, you may download and install the Google Analytics Opt-out Browser Add-on. Pardot: cookies set by Pardot are used to track users on our website. Visits are tracked for known users only. Unknown users are recorded as anonymous users. Please refer to Pardot privacy policy for any further information on their use and your rights related to the use of such cookies.

Section 5: Manage data

Download the "Section 5: Manage data" as a PDF

Use the site navigation to visit other sections and download further PDF content

In this section