Data Science Programme

Coordinators: Prof. Dimitris Plexousakis, Prof. Nicolas Spyratos

The Data Science research programme (created in late 2017) aims at exploiting the computational infrastructure and expertise of the research labs of the Institute in managing, analyzing and visualizing large volumes of data with the purpose of exploring the value chain related to big data. Emphasis is placed on the development of techniques for the collection, storage, processing, analysis and interpretation of large volumes of heterogeneous data that are dynamic, interlinked, and concern various domains including biology, medicine, particle physics, astronomy, remote sensing, environmental studies, finance etc. The programme intends to gradually create a large-scale computational infrastructure, along with methods and tools that will cater to the needs of the research laboratories of ICS and the other institutes of FORTH for big data management and analysis. The programme exploits the computational infrastructure of the CARV and ISL laboratories including software tools for managing and analyzing large volumes of data (e.g., Apache Spark, Flink, Blazegraph, Cassandra).

The following major issues are part of the strategic direction of the Data Science programme:

  • Storage, indexing and querying very large volumes of heterogeneous data
  • Data provenance and summarization
  • Large-scale data integration
  • Data mining and causal discovery
  • Information extraction from the Web
  • Linked data management, querying and updating
  • Data quality assurance
  • Metadata extraction from low-level signals
  • Statistical machine learning
  • Real-time visualization
  • Large-scale analytics for decision support
  • Resource-constrained reasoning with large volumes of data
  • Applications in medicine, biology, astrophysics

 

Participating lCS laboratories: Information Systems Laboratory (ISL) (coordinator), Computer Architecture and VLSI Systems (CARV) Laboratory, Computational BioMedicine Laboratory (CBML), Human Computer Interaction (HCI) Laboratory, Signal Processing Laboratory (SPL), Telecommunications and Networks Laboratory (TNL).