Just another day in Paradise?

International Consortium of Investigative Journalists (ICIJ) is using Talend Data Fabric as part of its Paradise Papers investigation, a leak involving 13.4 million encrypted documents from two offshore tax havens and 19 secret jurisdictions protecting the financial dealings of the world’s political players and corporate giants.

  • 6 years ago Posted in
ICIJ used Talend to load more than 1.4 TB of unstructured data into Neo4j graph database, which leverages the Linkurious graph visualization platform to organize and access the information. The data includes emails, Excel, CSV and PDF documents with text and images about companies and people who are using a hidden system built for avoiding tax payment. ICIJ also used other open source tools to support their “Knowledge Center” and make the information searchable by reporters.
 
“Talend is our preferred solution when it comes to cleaning, transforming, and integrating the data we receive. It works as a crucial mechanism for enabling us to build a robust database,” said Pierre Romera, CTO at ICIJ. “Working with open source tools like Talend ensures security and reliability of data as our extensive network of investigative journalists review terabytes of files. Backed by an extensive community of contributors, open source solutions enable us to benefit from the latest innovations in data processing, extraction, and visualization.”
 
 
“Moving to the cloud was obvious due to the nature of our mission and the large volume of data we process. Cloud technology offers the scalability we need when we need it, so we can easily manage our workload. With a robust power for processing and security, AWS was the most suitable choice for us,” explained Pierre.
 
The 13.4M tell-tale documents were obtained by German newspaper S?ddeutsche Zeitung that received data from two offshore services firms in countries ranging from Bermuda to Singapore, as well as 19 corporate registries around the world. For about a year, ICIJ worked with hundreds of journalists and media partners on exposing this new lead, which has had a significant impact on well-known individuals and large organizations.
“Since ICIJ revealed the Panama Papers leak in 2016 for which they won the Pulitzer Prize, we have seen how much data management and processing technologies can impact our society,” said Ciaran Dynes, SVP of Products, Talend. “We are pleased to support in-depth investigative journalism and those seeking meaningful insights from data.”
 
The Ataccama Data Trust Report 2025 identifies poor data quality as a critical obstacle to AI...
The web intelligence industry has decisively turned to artificial intelligence as the main method...
Built for teams using Snowflake, Google BigQuery and Databricks, Analyst Studio unlocks and unifies...
Integrations with Cloudera, Dremio, and GoodData deliver advanced capabilities for utilising data...
New partnership enables Qlik to reach tens of thousands of partners through TD SYNNEX’s extensive...
Hammerspace and Cloudian have formed a partnership to deliver a cutting-edge solution for managing...
Beacon, NY, Dec 20, 2024– DocuWare unveils its AI-powered Intelligent Document Processing...
Hitachi Vantara survey finds data demands to triple by 2026, highlighting critical role of data...