What greenwashing is, and how we can use analytics to detect It


· 7 min read
Greenwashing is the practice of making misleading claims about the environmental benefits of a product or a service to communicate a false image of sustainability.
This act of embellishment or hiding falsehood has become a common challenge as companies seek the attention of environmentally conscious consumers.
In this article, we will delve into the world of greenwashing to explain its manifestations and how to use data analytics to detect and prevent these unethical practices.
I discovered Greenwashing when I conducted my first supply chain sustainability project.
As a Supply Chain Solution Manager, my task was to estimate the environmental footprint of the manufacturing and logistics operations of our customers.
And it was surprising to see the claims of some of their competitors considering that they were producing and selling similar products.
How can a company selling disposable plastic products can claim to be carbon neutral?
The objective of this article is to show you how analytics tools can help you to detect this kind of false claims.
Greenwashing is a portmanteau word of ‘green’ and ‘whitewashing’.
This dishonest practice is used by organizations to create a false impression of environmental responsibility.
The objective is to capitalize on the growing demand for eco-friendly products from customers and investors.
The most common forms of greenwashing include,
When you see an advertisement for a naturally sourced recycled t-shirt, consider:
With life cycle assessment (LCA), you have a data-driven method to evaluate these impacts considering the entire life cycle of the product and avoid this kind of trap.
The idea is to estimate the environmental impact of sourcing, producing and using a specific product or service.
This requires collecting and processing data from multiple sources using Business Intelligence tools.
Several high-profile cases have brought greenwashing into the spotlight.
The second one can be easily debunked using basic supply chain analytics and publicly available data.
Understanding the various forms and implications of greenwashing is crucial to implement proactive measures to tackle this problem.
While regulatory bodies and conscious consumers play a significant role in this fight, data analytics can be an additional boost to automate fraud detection.
The idea is to use…
In the following sections, we will explore how to use these tools to promote a more transparent and sustainable corporate world.
Identifying greenwashing is a complex task given the complexity of its manifestation and the overwhelming volume of information available.
Data analytics can provide powerful tools capable of filtering large datasets to identify patterns and anomalies to extract valuable insights.
In the following sections, we will explore how to use these solutions using an example of potential fraud.
A primary application of NLP in greenwashing detection is sentiment analysis.
Let us consider the example of major oil companies.
They regularly publish sustainability reports and press releases that highlight their commitment to environmental protection.
The data at our disposal consists of these pdf documents available on their websites.
A primary application of NLP in greenwashing detection is sentiment analysis.
Let us consider the example of major oil companies.
They regularly publish sustainability reports and press releases that highlight their commitment to environmental protection.
The data at our disposal consists of these pdf documents available on their websites.
💡 How to detect greenwashing?
If the statements carry overly positive sentiments not reflected in the actual environmental performance metrics, it could be a sign of greenwashing.
For instance, in the example above
There is a contradiction between the actual sustainability performance and the narrative sold in the report.
Change point analysis identifies the points in a data sequence where the statistical properties change.
For instance, a major automobile manufacturer reports a sudden decrease in its CO2 emissions.
The available data would include a time series of the company’s reported emissions and manufacturing outputs.
💡 How to detect greenwashing?
Change point analysis can detect if these reductions correspond to
The available data would include a time series of the company’s reported emissions and manufacturing outputs.
💡 How to detect greenwashing?
Change point analysis can detect if these reductions correspond to
I have used a CO2 emissions dummy data set and applied the Python library ruptures:

It has detected a major change in the 9th year, which we should investigate.
This is an initial assessment, the reduction may be due to the impact of actual initiatives.
And you can verify it in the detailed initiatives shared in the sustainability report.
💡 Check the code to get this visual,
Regression analysis can help establish relationships between different variables.
For instance, a major fashion brand reports sustainability expenditure (Euros) and waste production levels (Tons).
💡 How to detect greenwashing?
A regression model can identify whether increases in sustainability expenditure are leading to proportionate decreases in waste production.
If not, this could be an indication of greenwashing and this requires advanced investigations
Network analysis helps to understand relationships between entities in a network.
A company in the electronics sector might claim that its products are sourced from sustainable and ethical suppliers.
💡 How to detect greenwashing?
The data here would include the company’s supplier network and third-party reports on supplier practices.
Using network analysis, we can scrutinize the sustainability KPIs (ESG scores for instance) of the suppliers and the connections between them.
If there are nodes in the network with questionable sustainability practices, this could imply potential greenwashing.
As we look towards future ESG regulations, the connection between greenwashing and data analytics is set to take a significant turn.
As the awareness of sustainability among customers and investors is growing, corporations will find it risky to hide behind vague or misleading sustainability claims.
Therefore, greenwashing will face significant challenges in an increasingly data-driven world.
Instead of building false claims, companies can use advanced analytics to design and implement initiatives that will provide concrete results.
For instance, Sustainable Supply Chain Optimization is a data-driven approach combining cost reductions and footprint reduction.
Let’s imagine that your company is producing and selling items all around the world.
Where to locate your factories and distribution centers?
This is an optimization model considering,
What is the most sustainable (and economically viable ) combination?
This article is also published on the author's blog. illuminem Voices is a democratic space presenting the thoughts and opinions of leading Sustainability & Energy writers, their opinions do not necessarily represent those of illuminem.
Alveena Rizwan

Public Governance · Ethical Governance
Gokul Shekar

Energy Transition · Energy
Kevin Trenberth

Climate Change · Mitigation
Responsible Investor

ESG · Sustainable Finance
Sustainable Views

ESG · Sustainable Finance
Responsible Investor

Sustainable Finance · ESG