InterHop is proud to announce the launch of LibreDataHub.org
LibreDataHub is a ubiquitous, opensource and secure data analysis platform.

LibreDataHub is a meta-project that integrates several open-source projects into a convenient, secure, turnkey solution.

LibreDataHub is easy to install on a Linux server.

Presentation

With the organization of the datathon, InterHop is proud to announce the launch of its newest project: LibreDataHub.org

LibreDataHub provides free tools for data warehousing, decentralized AI, statistics, machine learning (ML) and Deep Learning.

LibreDataHub is a scalable, modular, opensource and collaborative IT meta-project. LibreDataHub is assembled by healthcare data processing experts from the InterHop association.

LibreDataHub can be freely installed on a Linux machine (Debian, for example).

LibreDataHub is a project developed and maintained by the InterHop association.

Technologies used

Architecture diagram

Shiny Proxy

Shiny Proxy is a java application for deploying applications. Resources (CPU and RAM) are shared between users.

The principle: the user connects to Shinyproxy, clicks on the application he wishes to run, a docker container of the application is launched and the application is displayed in the user’s browser.

The following applications are installed within LibreDataHub’s Shiny Proxy:

  • Metabase
  • LinkR
  • Jupyter Notebook
  • RStudio
  • CloudBeaver
  • SchemaSpy

Our special thanks go to the contributors to the Shiny Proxy project (including the openanalytics team), who made the creation of LibreDataHub possible.

Metabase

Metabase is a versatile Business Intelligence (BI) tool for data visualization, suitable for a wide range of users, for data exploration, analysis and decision support.

LinkR

At the heart of LibreDataHub is LinkR, an open-source web application developed by InterHop.

LinkR enables users to access, manipulate and analyze healthcare data with low-code tools, i.e. without the need for in-depth programming knowledge. LinkR uses the common data model OMOP to facilitate code exchange between multiple centers.

It provides both a graphical interface for clinicians and a programming environment for data scientists, making it ideal for collaborative healthcare projects.

Jupyter

Jupyter is a web application used for programming in over 40 languages, including Python, Julia, Ruby, R and Scala. Jupyter lets you create notebooks, i.e. programs containing both Markdown text and code in Julia, Python, R… These code notebooks are used in data science to explore and analyze data.

CloudBeaver

CloudBeaver is a lightweight web application for working with different types of databases, all through a single, secure cloud solution accessible via a browser.

Schema Spy

SchemaSpy is designed to simplify the understanding and documentation of database schemas. It generates detailed reports and interactive diagrams of database structure, helping analysts and researchers to easily navigate complex databases. This tool is particularly useful for understanding the relationships between different data tables and ensuring data integrity.

Grafana

For data visualization and real-time monitoring, LibreDataHub integrates Grafana, an open-source web application that lets users create dynamic, customizable dashboards to visualize data. LibreDataHub uses Graphana to monitor its technical infrastructure.

Here’s the result of a dashboard collected during the datathon.

DuckDB

LibreDataHub also integrates DuckDB, an opensource database management system designed for data analysis.

Its support for columnar storage formats such as Parquet allows seamless integration with other LibreDataHub components, such as Jupyter Notebooks or LinkR, enabling high-performance data queries directly in the search environment.

Development

LibreDataHub.org is a project run by the InterHop association.

The source code for the LibreDataHub platform is available online: https://framagit.org/interhop/libre-data-hub

We welcome any suggestions for improvement or reports of problems: https://framagit.org/groups/interhop/libre-data-hub/-/issues

The InterHop team is on hand to host LibreDataHub on HDS servers, for user feedback within the association’s network and for help with RGPD.