HALiance Project

The HALiance project aims at redesigning HAL’s core services and aligning them with recommendations and excellence criteria defined by the Ministry of Higher education and research as part of the National Plan for Open Science and within the framework of international initiatives in favour of open science (COAR, EOSC, cOAlition S). It builds on the renewal of hardware and software infrastructure, and aims to address the international stakes with regard to excellence, technological agility and reinforced interconnection.

The project is divided into 9 workpackages.

WP1 - Hardware infrastructure

Objectives: Upgrade and secure the hardware infrastructure of HAL
Deliverables:
  • The hardware infrastructure of HAL is able to adapt to a significant increase of the data to store and to treat
  • The backup system is redundant

 

Achievements :

 

2022

 

Installation of the HAProxy ALOHA load balancer

Change of a switch (replaced with an optical switch)

Creation of the Solr cluster

Installation of a MySQL cluster for future use

Technical investigation of a redundant backup solution

2023 Recruitment of a systems engineer

New NAS storage server

Switch to HaProxy to guarantee high availability

Improved network performance (installation of optical switches and 10 Gb cards)

Preparation of data backup in a second datacentre

WP2 - Software

Objectives: Migrate, secure and open the HAL software code
Deliverables:
  • A new application development environment is deployed
  • The HAL source code is open and published

 

Achievements :

 

2022

 

Study of architectures and migration

Creation of a product backlog

2023 Recruitment of a developer

Start of the application migration project (service)

WP3 - Metadata extraction and alignment with HAL reference data

Objectives: Extracting metadata and identifiers in the deposited files and automatically enrich the HAL database
Deliverables:
  • The extraction of named-entities within the pdf files of publications is optimised and automated (Authors, institutions, funders and projects, licences, infrastructures ; Research software and research data Citations)
  • The named-entities are automatically aligned with HAL reference data

 

Achievements :

 

2022

 

Feasibility studies for improved extraction of funding and licensing information from the full text

Definition of specifications for the automatic author affiliation service

Exchanges with partners Inria and Science-Miner

2023 Enrichment of auréHAL with the ROR identifier, in collaboration with the Open Science Barometer (BSO) team

Preliminary study and prototype for the automatic retrieval of funding data from PDF files (partnership with Science-Miner for the development of the Grobid application)

Preparation of the redesign of the automatic author affiliation service

WP4 - Organising and documenting the life cycle of bibliographic metadata imported to HAL

Objectives: Organize and document the lifecycle of bibliographic metadata imported into HAL
Deliverables:
  • The source of metadata is documented
  • Metadata traceability is documented (metadata life cycle)
  • Conflict management rules are defined and implemented
  • Synchronization of the imported metadata with those of HAL database

 

Achievements :

 

2022

 

Definition of specifications (metadata traceability, conflict management rules)

Exchanges with partner IN2P3

2023

The actions in this work package depend on the progress of WP2

WP5 - Preprint review and curation

Objectives: HAL to preprint review services and display the publication cycle
Deliverables:
  • HAL is automatically notified of a preprint review and updated versions of the preprint
  • The different statuses of the preprint are known and displayed (reviewed, recommended, accepted for publication, open peer review, etc.)
  • Researchers who deposit preprints in HAL can access to external reviewing services

Actions in this work package are a continuation of the HALOWIN project. This project has been extended to the end of 2023

WP6 - Selective publications' harvesting

Objectives: Implementing a new method for populating HAL by collecting scientific publications (full text)
Deliverables:
  • A mechanism identifies the scholarly publications’ full text which can be imported into HAL and feeds HAL
  • Tools for deduplication, enrichment and version management are available
  • Researchers have web interfaces to validate or not the import of their publications into HAL
  • Conference papers on SciencesConf are automatically imported into HAL

 

Achievements :

 

2022

 

Use of the corpus produced by INIST as part of the CorHAL project

Start-up of implementation of the service

Recruitment of a developer

2023

Launch of the deposit suggestions service: user interface, monitoring back office, data import workflow in HAL

WP7 - Link publications - research data

Objectives: Link publications and their research data
Deliverables:
  • An automated solution locates and associates to the publication deposited in HAL the identifier and the citation of the associated research data
  • A service helps to deposit a research dataset associated with a publication and transfer it to the appropriate data repository
  • Interoperability of HAL with Nakala (SHS) and Dataverse repositories (Recherche.data.gouv for ex)

 

Achievements :

 

2022

 

Exchanges with partner INRAe in the framework of the national repository recherche.data.gouv
2023 Integration of the COAR Notify protocol into HAL

 

WP8 - Supporting and involve users communities

Objectives: Ensure the visibility and the appropriation of the new services by its users
Deliverables:
  • Users are associated to the conception of the new services
  • Functional and technical documentation is available
  • A multi-modal training offer is proposed to the users
  • Users are informed of the general progress of the project

 

Achievements :

 

2022

 

Definition of a communication plan

Definition of an action plan for user engagement

2023 ROR alignment campaign (WP3)

Gathering user feedback for the new suggestions service (WP6)

Drafting of user documentation on the suggestions service

Publication of blog posts on the CCSD website

WP9 - FAIR principles

Objectives: Ensure consistency with FAIR principles
Deliverables:
  • A Core Trust Seal certification process is initiated
  • FAIR indicators are implemented

 

Achievements :

 

2022

 

Recruitment of a data steward to take charge of Core Trust Seal certification
2023 Recruitment of a data steward

Internal audit for CoreTrustSeal certification, gathering of information for report writing

Drafting of about.hal.science pages

Project sheet

Funding

Equipment for research P.I.A.3 – ESR/EquipEx+

Project reference

21-ESRE-0047

Duration

5,5 years

Investment

3,4 M€

Project kick-off

January 2022

Consortiums

CNRS, Inria, IN2P3, INRAE

Partners

Equipex+ Commons
(OpenEdition, Metopes, Huma-Num)