Popular repositories Loading
-
dist-keras
dist-keras Public archiveDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
-
spark-dashboard
spark-dashboard PublicSpark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
-
SparkPlugins
SparkPlugins PublicCode and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…
-
hdfs-metadata
hdfs-metadata PublicTool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.
-
grafana-mimir-cardinality-dashboards
grafana-mimir-cardinality-dashboards PublicGrafana Mimir dashboards used for cardinality exploration
-
SparkDLTrigger
SparkDLTrigger PublicCode and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
Repositories
- opentelemetry-collector-contrib Public Forked from open-telemetry/opentelemetry-collector-contrib
Contrib repository for the OpenTelemetry Collector
cerndb/opentelemetry-collector-contrib’s past year of commit activity - spark-dashboard Public
Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
cerndb/spark-dashboard’s past year of commit activity - SparkDLTrigger Public
Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
cerndb/SparkDLTrigger’s past year of commit activity - SparkTraining Public
Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/
cerndb/SparkTraining’s past year of commit activity - NotebooksExamples Public
This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery
cerndb/NotebooksExamples’s past year of commit activity - SparkPlugins Public
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.
cerndb/SparkPlugins’s past year of commit activity - sparkMeasure Public
This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics.
cerndb/sparkMeasure’s past year of commit activity - jdbc-connector-for-apache-kafka Public Forked from Aiven-Open/jdbc-connector-for-apache-kafka
Aiven's JDBC Sink and Source Connectors for Apache Kafka®
cerndb/jdbc-connector-for-apache-kafka’s past year of commit activity