Spark
Apache Spark is a fast and powerful data processing system that enables users to process and analyze massive volumes of data in a distributed way. It offers a unified platform for batch, real-time, streaming and machine learning data processing. Read more Category embedeed technologiesexternal techno Applications ModelizationPreparation Contexts 2.4 Java/Scala 11 Stable 2.4 Python 3.7 […]
Informatica
Informatica is a data management and enterprise integration platform that offers a comprehensive suite of tools and capabilities to help organizations effectively manage and transform their data. Read more Category external techno Applications Preparation Techno Catalogue
Trifacta
Trifacta is a cloud-based data preparation and analysis platform. It enables users to efficiently transform, cleanse and prepare their raw data for further analysis and use. Read more Category external techno Applications Preparation Techno Catalogue
GCP Cloud Run
GCP Cloud Run is a serverless computing service from Google Cloud Platform (GCP). It enables developers to run containerized applications in an automated and scalable way, according to demand. Read more Category external techno Applications Modelization Contexts Copy service EXPERIMENTAL New service EXPERIMENTAL Techno Catalogue
GCP Dataflow
GCP Dataflow is a streaming and batch data processing service from Google Cloud Platform (GCP). It enables developers to create scalable data pipelines to ingest, transform and analyze data in real time or in batches. Read more Category external techno Applications Preparation Contexts Clone job EXPERIMENTAL New job EXPERIMENTAL Techno Catalogue
GCP Cloud Functions
GCP Cloud Functions is a serverless computing service offered by Google Cloud Platform (GCP). It enables developers to run individual functions in response to specific events, without worrying about managing the underlying infrastructure. Read more Category external techno Applications Preparation Contexts Default EXPERIMENTAL Techno Catalogue
GCP Transfer
GCP Transfer is a service offered by Google Cloud Platform (GCP) that enables you to transfer and synchronize data between different sources and destinations, securely and scalably. Read more Category external techno Applications Preparation Versionning Amazon S3 transfer jobs EXPERIMENTAL GCS transfer jobs EXPERIMENTAL Techno Catalogue
Azure Machine Learning Services
Azure Machine Learning Services is a cloud service provided by Microsoft Azure that enables developers and data scientists to create, deploy and manage large-scale machine learning models. It offers a comprehensive set of tools and features to support the full machine learning lifecycle, from data preparation to model release. Read more Category external techno Applications […]
Azure Functions
Azure Functions is a serverless computing service provided by Microsoft Azure. It enables developers to run code in the cloud in response to specific events, without worrying about managing the underlying infrastructure. Read more Category external techno Applications Preparation Techno Catalogue
Azure Data Factory
Azure Data Factory is a data management service from Microsoft Azure. It enables users to create, plan and orchestrate hybrid, cloud-based data flows, leveraging a variety of Azure data sources and services. Read more Category external techno Applications Preparation Techno Catalogue
Dataiku
Dataiku is a data science and predictive analytics platform that facilitates collaboration between data teams and business users. It enables organizations to manage the entire lifecycle of data projects, from data preparation and cleansing to the creation of predictive models and the production of analyses. Read more Category external techno Applications Modelization Contexts Datasets V11.0 […]
Azure Databricks
Azure Databricks is a fully managed data processing and analysis platform provided by Microsoft Azure. It is based on Apache Spark and offers a collaborative environment for data teams, enabling large-scale data processing workloads to be run quickly and efficiently. Read more Category external techno Applications Modelization Techno Catalogue