Just another day in Paradise?

International Consortium of Investigative Journalists (ICIJ) is using Talend Data Fabric as part of its Paradise Papers investigation, a leak involving 13.4 million encrypted documents from two offshore tax havens and 19 secret jurisdictions protecting the financial dealings of the world’s political players and corporate giants.

  • 6 years ago Posted in
ICIJ used Talend to load more than 1.4 TB of unstructured data into Neo4j graph database, which leverages the Linkurious graph visualization platform to organize and access the information. The data includes emails, Excel, CSV and PDF documents with text and images about companies and people who are using a hidden system built for avoiding tax payment. ICIJ also used other open source tools to support their “Knowledge Center” and make the information searchable by reporters.
 
“Talend is our preferred solution when it comes to cleaning, transforming, and integrating the data we receive. It works as a crucial mechanism for enabling us to build a robust database,” said Pierre Romera, CTO at ICIJ. “Working with open source tools like Talend ensures security and reliability of data as our extensive network of investigative journalists review terabytes of files. Backed by an extensive community of contributors, open source solutions enable us to benefit from the latest innovations in data processing, extraction, and visualization.”
 
 
“Moving to the cloud was obvious due to the nature of our mission and the large volume of data we process. Cloud technology offers the scalability we need when we need it, so we can easily manage our workload. With a robust power for processing and security, AWS was the most suitable choice for us,” explained Pierre.
 
The 13.4M tell-tale documents were obtained by German newspaper S?ddeutsche Zeitung that received data from two offshore services firms in countries ranging from Bermuda to Singapore, as well as 19 corporate registries around the world. For about a year, ICIJ worked with hundreds of journalists and media partners on exposing this new lead, which has had a significant impact on well-known individuals and large organizations.
“Since ICIJ revealed the Panama Papers leak in 2016 for which they won the Pulitzer Prize, we have seen how much data management and processing technologies can impact our society,” said Ciaran Dynes, SVP of Products, Talend. “We are pleased to support in-depth investigative journalism and those seeking meaningful insights from data.”
 
IT teams urged to resolve ‘data delays’ as UK executives struggle to access and use relevant...
The Seeq platform will be leveraged to maximize production and increase energy efficiency across...
Talent and training partner, mthree, which supports major global tech, banking, and business...
The 2024 State of Data Intelligence Report finds companies struggling with AI governance more than...
On average, only 48% of digital initiatives meet or exceed business outcome targets, according to...
Fivetran equips over half of Trinny London's workforce with self-service analytics, accelerating...
Techcombank, one of Vietnam’s leading financial institutions, has implemented the Databricks Data...
New survey data from Cohesity reveals that consumers surveyed worldwide are highly concerned about...