Bringing NLG into Software Development

In this short chapter, we demonstrate a sample of software engineering cases that can benefit from Natural Language Generation (NLG). This is a fledgling technology with numerous potential uses, one that we are just beginning to study.

In May 2020, OpenAI unveiled the world's largest neural network, dubbed GPT-3, a 175 billion-parameter language model. GPT-3 was trained utilizing practically all publicly available data on the Internet and outperformed state-of-the-art models in a range of natural language processing (NLP) tasks, including translation, question answering, and cloze tests. GPT-3 is a gigantic NLP model, which achieves state-of-the-art in a range of tasks. Its main breakthrough is eliminating the need for task-specific fine-tuning. One of the tasks GPT-3 is very good at is text generation – in our case code generation.

While GPT-3 does not require any training (a zero-shot learning example), its already excellent performance is surpassed by one-shot or few-shot learning.

After establishing the language task, we compile a small collection of handpicked samples. In more typical machine learning, a big dataset is required, followed by annotation. However, language models have a constrained input window. The text prompt and generated completion must be less than 2048 tokens (about 1500 words) in length when using the OpenAI API to GPT-3. Within this constraint, we must communicate very effectively what we want from the model (the'prompt'), guided by the new craft of 'prompt engineering'. This requires a very small collection of examples, in contrast to standard machine learning, which requires between 60% and 80% of the entire time spent on data preparation.

The GPT-3 API determines the optimal pattern for our purposes. It abstractly applies 'fuzzy matching' to a very large model. It will seek out the closest match, and its behavior can be influenced by several settings.

We combine Natural Language Understanding (NLU) with Natural Language Generation (NLP) in two domains. While we are generating structured code, this still falls within the umbrella of NLG.

Use case: Test case generation

The figure below illustrates how GPT-3 works with SQL. The format is question and answer, in which we ask GPT-3 to perform an action and it determines the best match. The critical engineering is the prompt design used to choose the appropriate cases for requesting GPT-3. The actual solution will be similar to this (which we cannot demonstrate owing to IP protection), in that users will upload test cases and Python code will be generated.

In our situation, we provide test cases as input and receive code as output.

Use case: Test coverage generation

Code coverage is the one of most critical aspect of agile projects since it enables us to build CI/CD pipelines. While programmers enjoy writing code, they are not as enthusiastic about building test cases. Typically, these code coverage test cases are written in the same language as the underlying software. GPT-3 is well-suited for this type of test code generation, as it is trained on Github, among other sources, as part of its training data. This significantly increases developer productivity because all they have to do is check that the test coverage code supplied by GPT-3 is accurate. This turns into a debugging exercise rather than a code coverage programming task.

Learnings from few customer implementations

One of the most important lessons we've learned from customer implementations is the diversity of templates that exist among various teams. Due to the fact that our Key-Value-Pair model must be trained on specific templates in order to enhance performance, obtaining sufficient training data (samples for each template type) is a significant difficulty, even more so when newer templates are utilized in projects.

While demonstrating these GPT-3-based solutions to customers, occasionally overselling occurs or they believe the models do everything. This is not true. GPT-3 frequently generates deprecated code and makes deprecated library calls. Occasionally, it just does not work properly and produces rubbish. Designing the appropriate prompt is a talent that must be gained via extensive use of GPT-3 and experimentation with how to configure the model parameters. As of July 2021, GPT-3 is not integrated with any cloud systems, posing security and integration difficulties.

Newer models like CuBERT (Code Understanding BERT) are constantly being developed and it is critical to use it for NLP/NLU jobs to increase quality.

About the author

Rajeswaran Viswanathan

Rajeswaran Viswanathan

Rajeswaran Viswanathan is the head of AI Center of Excellence in India. He has published many papers and articles. He is the author of proportion – A comprehensive R package for inference on single. Binomial proportion and Bayesian computations. It is most widely used for categorical data analysis. He has a passion for teaching and mentoring next generation of data scientists.

About Capgemini

Capgemini is a global leader in partnering with companies to transform and manage their business by harnessing the power of technology. The Group is guided everyday by its purpose of unleashing human energy through technology for an inclusive and sustainable future. It is a responsible and diverse organisation of 325,000 team members in nearly 50 countries. With its strong 55 year heritage and deep industry expertise, Capgemini is trusted by its clients to address the entire breadth of their business needs, from strategy and design to operations, fueled by the fast evolving and innovative world of cloud, data, AI, connectivity, software, digital engineering and platforms. The Group reported in 2021 global revenues of €18 billion.

Get the Future You Want I www.capgemini.com

We respect your privacy

We use cookies to improve your experience on our website. They help us to improve site performance, present you relevant advertising and enable you to share content in social media.

You may accept all cookies, or choose to manage them individually. You can change your settings at any time by clicking Cookie Settings available in the footer of every page.

For more information related to the cookies, please visit our cookie policy.

Cookies	Description
Registered visitor cookie	Cookie given to each registered user.
Registered visitor functionality cookie	Cookies used to remember the unique identifier given to each registered user.
Social plug-in content sharing cookie	Cookies set by services such as Facebook Connect or Twitter Button, which allow social networks users to share the content of our websites on social networks.
Unregistered visitor cookie	Cookies used to give to unregistered users a unique identifier in order to recognize them and to analyze how they use the website.
Analytic cookie	Cookies used to store URLs of the previous page visited, enabling to track users navigating from inside or from outside the website. If you click on a Sogeti advertisement on a non-Sogeti website, a cookie may be used to log which website you are on, in order to ensure our advertisements are served effectively and to measure whether our advertisements are viewed. Google Analytics: cookies set by Google analytics are used for web analytical purpose, but are not used to track individual users. For further information on how Google Analytics collects and uses information on our behalf and the right to use such cookies, please refer to the Google Analytics products and services privacy statement. If you object to your Personal Data being collected by Google Analytics, you may download and install the Google Analytics Opt-out Browser Add-on. Pardot: cookies set by Pardot are used to track users on our website. Visits are tracked for known users only. Unknown users are recorded as anonymous users. Please refer to Pardot privacy policy for any further information on their use and your rights related to the use of such cookies.

Bringing NLG into Software Development

Download the "Section 3.2: Inform & Interpret" as a PDF

Use the site navigation to visit other sections and download further PDF content

In this short chapter, we demonstrate a sample of software engineering cases that can benefit from Natural Language Generation (NLG). This is a fledgling technology with numerous potential uses, one that we are just beginning to study.

Learnings from few customer implementations

About the author

About Capgemini