Spark

Catalogue techno Spark

Apache Spark is a fast and powerful data processing system that enables users to process and analyze massive volumes of data in a distributed way. It offers a unified platform for batch, real-time, streaming and machine learning data processing. Read more Category embedeed technologiesexternal techno Applications ModelizationPreparation Contexts 2.4 Java/Scala 11 Stable 2.4 Python 3.7 […]

Informatica

Catalogue techno Informatica

Informatica is a data management and enterprise integration platform that offers a comprehensive suite of tools and capabilities to help organizations effectively manage and transform their data.   Read more Category external techno Applications Preparation Techno Catalogue

Trifacta

Catalogue techno Trfacta

Trifacta is a cloud-based data preparation and analysis platform. It enables users to efficiently transform, cleanse and prepare their raw data for further analysis and use. Read more Category external techno Applications Preparation Techno Catalogue

GCP Cloud Run

Catalogue techno GCP Cloud Run

GCP Cloud Run is a serverless computing service from Google Cloud Platform (GCP). It enables developers to run containerized applications in an automated and scalable way, according to demand. Read more Category external techno Applications Modelization Contexts Copy service EXPERIMENTAL New service EXPERIMENTAL Techno Catalogue

GCP Dataflow

Catalogue techno GCP Data Flow

GCP Dataflow is a streaming and batch data processing service from Google Cloud Platform (GCP). It enables developers to create scalable data pipelines to ingest, transform and analyze data in real time or in batches. Read more Category external techno Applications Preparation Contexts Clone job EXPERIMENTAL New job EXPERIMENTAL Techno Catalogue

GCP Cloud Functions

Catalogue techno GCP Cloud Functions

GCP Cloud Functions is a serverless computing service offered by Google Cloud Platform (GCP). It enables developers to run individual functions in response to specific events, without worrying about managing the underlying infrastructure. Read more Category external techno Applications Preparation Contexts Default EXPERIMENTAL Techno Catalogue

GCP Transfer

Catalogue techno GCP Data Transfer

GCP Transfer is a service offered by Google Cloud Platform (GCP) that enables you to transfer and synchronize data between different sources and destinations, securely and scalably. Read more Category external techno Applications Preparation Versionning Amazon S3 transfer jobs EXPERIMENTAL GCS transfer jobs EXPERIMENTAL Techno Catalogue

Azure Machine Learning Services

Catalogue techno Azure machine learning services

Azure Machine Learning Services is a cloud service provided by Microsoft Azure that enables developers and data scientists to create, deploy and manage large-scale machine learning models. It offers a comprehensive set of tools and features to support the full machine learning lifecycle, from data preparation to model release. Read more Category external techno Applications […]

Azure Functions

Catalogue techno Azure fonctions

Azure Functions is a serverless computing service provided by Microsoft Azure. It enables developers to run code in the cloud in response to specific events, without worrying about managing the underlying infrastructure. Read more Category external techno Applications Preparation Techno Catalogue

Azure Data Factory

Catalogue techno Azure Data Factory

Azure Data Factory is a data management service from Microsoft Azure. It enables users to create, plan and orchestrate hybrid, cloud-based data flows, leveraging a variety of Azure data sources and services. Read more Category external techno Applications Preparation Techno Catalogue

Dataiku

Catalogue techno Dataiku

Dataiku is a data science and predictive analytics platform that facilitates collaboration between data teams and business users. It enables organizations to manage the entire lifecycle of data projects, from data preparation and cleansing to the creation of predictive models and the production of analyses. Read more Category external techno Applications Modelization Contexts Datasets V11.0 […]

Azure Databricks

Catalogue techno Azure Databricks

Azure Databricks is a fully managed data processing and analysis platform provided by Microsoft Azure. It is based on Apache Spark and offers a collaborative environment for data teams, enabling large-scale data processing workloads to be run quickly and efficiently. Read more Category external techno Applications Modelization Techno Catalogue