Spark

Catalogue techno Spark

Apache Spark is a fast and powerful data processing system that enables users to process and analyze massive volumes of data in a distributed way. It offers a unified platform for batch, real-time, streaming and machine learning data processing. Read more Category embedeed technologiesexternal techno Applications ModelizationPreparation Contexts 2.4 Java/Scala 11 Stable 2.4 Python 3.7 […]

Informatica

Catalogue techno Informatica

Informatica is a data management and enterprise integration platform that offers a comprehensive suite of tools and capabilities to help organizations effectively manage and transform their data.   Read more Category external techno Applications Preparation Techno Catalogue

Trifacta

Catalogue techno Trfacta

Trifacta is a cloud-based data preparation and analysis platform. It enables users to efficiently transform, cleanse and prepare their raw data for further analysis and use. Read more Category external techno Applications Preparation Techno Catalogue

Dataiku

Catalogue techno Dataiku

Dataiku is a data science and predictive analytics platform that facilitates collaboration between data teams and business users. It enables organizations to manage the entire lifecycle of data projects, from data preparation and cleansing to the creation of predictive models and the production of analyses. Read more Category external techno Applications Modelization Contexts Datasets V11.0 […]

Azure Databricks

Catalogue techno Azure Databricks

Azure Databricks is a fully managed data processing and analysis platform provided by Microsoft Azure. It is based on Apache Spark and offers a collaborative environment for data teams, enabling large-scale data processing workloads to be run quickly and efficiently. Read more Category external techno Applications Modelization Techno Catalogue