InterHop is proud to announce the launch of LibreDataHub.org
LibreDataHub is a ubiquitous, opensource and secure data analysis platform.
LibreDataHub is a meta-project that integrates several open-source projects into a convenient, secure, turnkey solution.
LibreDataHub is easy to install on a Linux server.
Presentation
With the organization of the datathon, InterHop is proud to announce the launch of its newest project: LibreDataHub.org
LibreDataHub provides free tools for data warehousing, decentralized AI, statistics, machine learning (ML) and Deep Learning.
LibreDataHub is a scalable, modular, opensource and collaborative IT meta-project. LibreDataHub is assembled by healthcare data processing experts from the InterHop association.
LibreDataHub can be freely installed on a Linux machine (Debian, for example).
LibreDataHub is a project developed and maintained by the InterHop association.
Technologies used
Architecture diagram
Shiny Proxy
Shiny Proxy is a java application for deploying applications. Resources (CPU and RAM) are shared between users.
The principle: the user connects to Shinyproxy, clicks on the application he wishes to run, a docker container of the application is launched and the application is displayed in the user’s browser.
The following applications are installed within LibreDataHub’s Shiny Proxy:
- Metabase
- LinkR
- Jupyter Notebook
- RStudio
- CloudBeaver
- SchemaSpy
Our special thanks go to the contributors to the Shiny Proxy project (including the openanalytics team), who made the creation of LibreDataHub possible.
Metabase
Metabase is a versatile Business Intelligence (BI) tool for data visualization, suitable for a wide range of users, for data exploration, analysis and decision support.
LinkR
At the heart of LibreDataHub is LinkR, an open-source web application developed by InterHop.
LinkR enables users to access, manipulate and analyze healthcare data with low-code tools, i.e. without the need for in-depth programming knowledge. LinkR uses the common data model OMOP to facilitate code exchange between multiple centers.
It provides both a graphical interface for clinicians and a programming environment for data scientists, making it ideal for collaborative healthcare projects.
Jupyter
Jupyter is a web application used for programming in over 40 languages, including Python, Julia, Ruby, R and Scala. Jupyter lets you create notebooks, i.e. programs containing both Markdown text and code in Julia, Python, R… These code notebooks are used in data science to explore and analyze data.
CloudBeaver
CloudBeaver is a lightweight web application for working with different types of databases, all through a single, secure cloud solution accessible via a browser.
Schema Spy
SchemaSpy is designed to simplify the understanding and documentation of database schemas. It generates detailed reports and interactive diagrams of database structure, helping analysts and researchers to easily navigate complex databases. This tool is particularly useful for understanding the relationships between different data tables and ensuring data integrity.
Grafana
For data visualization and real-time monitoring, LibreDataHub integrates Grafana, an open-source web application that lets users create dynamic, customizable dashboards to visualize data. LibreDataHub uses Graphana to monitor its technical infrastructure.
Here’s the result of a dashboard collected during the datathon.
DuckDB
LibreDataHub also integrates DuckDB, an opensource database management system designed for data analysis.
Its support for columnar storage formats such as Parquet allows seamless integration with other LibreDataHub components, such as Jupyter Notebooks or LinkR, enabling high-performance data queries directly in the search environment.
Development
LibreDataHub.org is a project run by the InterHop association.
The source code for the LibreDataHub platform is available online: https://framagit.org/interhop/libre-data-hub
We welcome any suggestions for improvement or reports of problems: https://framagit.org/groups/interhop/libre-data-hub/-/issues
The InterHop team is on hand to host LibreDataHub on HDS servers, for user feedback within the association’s network and for help with RGPD.