Spark
Apache Spark is a fast and powerful data processing system that enables users to process and analyze massive volumes of data in a distributed way. It offers a unified platform for batch, real-time, streaming and machine learning data processing. Read more Category embedeed technologiesexternal techno Applications ModelizationPreparation Contexts 2.4 Java/Scala 11 Stable 2.4 Python 3.7 […]
Informatica
Informatica is a data management and enterprise integration platform that offers a comprehensive suite of tools and capabilities to help organizations effectively manage and transform their data. Read more Category external techno Applications Preparation Techno Catalogue
Trifacta
Trifacta is a cloud-based data preparation and analysis platform. It enables users to efficiently transform, cleanse and prepare their raw data for further analysis and use. Read more Category external techno Applications Preparation Techno Catalogue
Dataiku
Dataiku is a data science and predictive analytics platform that facilitates collaboration between data teams and business users. It enables organizations to manage the entire lifecycle of data projects, from data preparation and cleansing to the creation of predictive models and the production of analyses. Read more Category external techno Applications Modelization Contexts Datasets V11.0 […]
Azure Databricks
Azure Databricks is a fully managed data processing and analysis platform provided by Microsoft Azure. It is based on Apache Spark and offers a collaborative environment for data teams, enabling large-scale data processing workloads to be run quickly and efficiently. Read more Category external techno Applications Modelization Techno Catalogue