Offre d'emploi
Centres Inria associés
Type de contrat
Contexte
<p>This Master 2 internship is in the context of IPCEI-CIS (Important Project of Common European Interest – Next Generation Cloud Infrastructure and Services) DXP (Data Exchange Platform) project involving Amadeus and three Inria research teams (Loreley, CEDAR and MAGELLAN). This project aims to design and develop an open-source management solution for a federated and distributed data exchange platform (DXP), operating in an open, scalable, and massively distributed environment (cloud-edge continuum).</p>
<p>The internship will be located at The Inria Center of the University of Lorraine in the Loreley team. It will be supervised by Claudia-Lavinia Ignat, Research Director at Inria in Nancy and Stefania Dumbrava, Assistant Professor at ENSIIE/Inria Paris/Télécom Sud-Paris.</p>
<p>The Inria Center of the University of Lorraine is one of Inria's nine centers and has twenty project teams, located in Nancy, Strasbourg and Saarbrücken. Its activities occupy over 400 people, scientists and research and innovation support staff, including 45 different nationalities. The Inria Center is a major and recognized player in the field of digital sciences. It is at the heart of a rich R&D and innovation ecosystem: highly innovative PMEs, large industrial groups, competitiveness clusters, research and higher education players, laboratories of excellence, technological research institutes, etc.</p>
<p>The internship will be located at The Inria Center of the University of Lorraine in the Loreley team. It will be supervised by Claudia-Lavinia Ignat, Research Director at Inria in Nancy and Stefania Dumbrava, Assistant Professor at ENSIIE/Inria Paris/Télécom Sud-Paris.</p>
<p>The Inria Center of the University of Lorraine is one of Inria's nine centers and has twenty project teams, located in Nancy, Strasbourg and Saarbrücken. Its activities occupy over 400 people, scientists and research and innovation support staff, including 45 different nationalities. The Inria Center is a major and recognized player in the field of digital sciences. It is at the heart of a rich R&D and innovation ecosystem: highly innovative PMEs, large industrial groups, competitiveness clusters, research and higher education players, laboratories of excellence, technological research institutes, etc.</p>
Mission confié
<p>Modern data management increasingly involves distributed, collaborative environments where data is jointly produced and shared across sites. At the same time, SQL/PGQ makes it possible to define property graphs as views over relational data, unifying relational querying with graph pattern matching and exposing graphs as relational views ([Angles18], [Wolde23], [Han24]). Replication techniques such as CRDTs show how independently evolving replicas can converge under concurrent updates while preserving invariants ([Preguiça19], [Balegas18], [Yu20], [Rault22], [Ignat24]). Centralized incremental view maintenance is well studied ([Budiu22], [Battiston24], [Thomassen23]), and there are proposals for key-like constraints and schemas for property graphs ([Angles21], [Angles23]). However, most systems still replicate raw tables or documents rather than treating such views as the primary unit of synchronization. This internship tackles this question at a small, controlled proof-of-concept scale.</p>
<p><strong>References</strong><br /> <br /> [Angles18] Renzo Angles: The Property Graph Database Model. AMW, 2018.<br /> [Angles21] Renzo Angles, Angela Bonifati, Stefania Dumbrava, George Fletcher, Keith W. Hare, Jan Hidders, Victor E. Lee, Bei Li, Leonid Libkin, Wim Martens, Filip Murlak, Josh Perryman, Ognjen Savkovic, Michael Schmidt, Juan F. Sequeda, Slawek Staworko, Dominik Tomaszuk: PG-Keys: Keys for Property Graphs. SIGMOD Conference 2021: 2423–2436.<br /> [Angles23] Renzo Angles, Angela Bonifati, Stefania Dumbrava, George Fletcher, Alastair Green, Jan Hidders, Bei Li, Leonid Libkin, Victor Marsault, Wim Martens, Filip Murlak, Stefan Plantikow, Ognjen Savkovic, Michael Schmidt, Juan Sequeda, Slawek Staworko, Dominik Tomaszuk, Hannes Voigt, Domagoj Vrgoc, Mingxi Wu, Dusan Zivkovic: PG-Schema: Schemas for Property Graphs. Proc. ACM Manag. Data 1(2): 198:1–198:25 (2023).<br /> [Balegas18] Valter Balegas, Sérgio Duarte, Carla Ferreira, Rodrigo Rodrigues, Nuno M. Preguiça: IPA: Invariant-preserving Applications for Weakly Consistent Replicated Databases. Proc. VLDB Endow. 12(4): 404–418 (2018).<br /> [Battiston24] Ilaria Battiston, Karan Kathuria, Peter Boncz: OpenIVM: A SQL-to-SQL Compiler for Incremental Computations. SIGMOD Companion, 2024.<br /> [Budiu22] Mihai Budiu, Frank McSherry, Leonid Ryzhyk, Val Tannen: DBSP: Automatic Incremental View Maintenance for Rich Query Languages. arXiv:2203.16684, 2022.<br /> [Clark22] Stanley Clark, Nikolay Yakovets, George Fletcher, Nicola Zannone: ReLOG: A Unified Framework for Relationship-Based Access Control over Graph Databases. DBSec 2022: 303–315.<br /> [Han24] Sunwoo Han, Zachary G. Ives: Implementation Strategies for Views over Property Graphs. SIGMOD, 2024.<br /> [Ignat24] Claudia-Lavinia Ignat, Victorien Elvinger, Habibatou Ba: SynQL: A CRDT-Based Approach for Replicated Relational Databases with Integrity Constraints. DAIS, 2024.<br /> [Preguiça19] Nuno M. Preguiça, Carlos Baquero, Marc Shapiro: Conflict-Free Replicated Data Types (CRDTs). Encyclopedia of Big Data Technologies, 2019.<br /> [Rault22] Pierre-Antoine Rault, Claudia-Lavinia Ignat, Olivier Perrin: Distributed Access Control for Collaborative Applications Using CRDTs. PaPoC@EuroSys 2022: 33–38.<br /> [Thomassen23] J. Thomassen, W. Yu: Eventually-Consistent Replicated Relations and Updatable Views. In: Abelló, A. et al. New Trends in Database and Information Systems (ADBIS 2023). Communications in Computer and Information Science, vol. 1850. Springer, 2023.<br /> [Wolde23] Daniel Wolde, Gábor Szárnyas, Peter Boncz: DuckPGQ: Bringing SQL/PGQ to DuckDB. Proc. VLDB Endow. 16(12): 4034–4037 (2023).<br /> [Yu20] Weihai Yu, Claudia-Lavinia Ignat: Conflict-Free Replicated Relations for Multi-Synchronous Database Management at Edge. IEEE SMDS, 2020.</p>
<p><strong>References</strong><br /> <br /> [Angles18] Renzo Angles: The Property Graph Database Model. AMW, 2018.<br /> [Angles21] Renzo Angles, Angela Bonifati, Stefania Dumbrava, George Fletcher, Keith W. Hare, Jan Hidders, Victor E. Lee, Bei Li, Leonid Libkin, Wim Martens, Filip Murlak, Josh Perryman, Ognjen Savkovic, Michael Schmidt, Juan F. Sequeda, Slawek Staworko, Dominik Tomaszuk: PG-Keys: Keys for Property Graphs. SIGMOD Conference 2021: 2423–2436.<br /> [Angles23] Renzo Angles, Angela Bonifati, Stefania Dumbrava, George Fletcher, Alastair Green, Jan Hidders, Bei Li, Leonid Libkin, Victor Marsault, Wim Martens, Filip Murlak, Stefan Plantikow, Ognjen Savkovic, Michael Schmidt, Juan Sequeda, Slawek Staworko, Dominik Tomaszuk, Hannes Voigt, Domagoj Vrgoc, Mingxi Wu, Dusan Zivkovic: PG-Schema: Schemas for Property Graphs. Proc. ACM Manag. Data 1(2): 198:1–198:25 (2023).<br /> [Balegas18] Valter Balegas, Sérgio Duarte, Carla Ferreira, Rodrigo Rodrigues, Nuno M. Preguiça: IPA: Invariant-preserving Applications for Weakly Consistent Replicated Databases. Proc. VLDB Endow. 12(4): 404–418 (2018).<br /> [Battiston24] Ilaria Battiston, Karan Kathuria, Peter Boncz: OpenIVM: A SQL-to-SQL Compiler for Incremental Computations. SIGMOD Companion, 2024.<br /> [Budiu22] Mihai Budiu, Frank McSherry, Leonid Ryzhyk, Val Tannen: DBSP: Automatic Incremental View Maintenance for Rich Query Languages. arXiv:2203.16684, 2022.<br /> [Clark22] Stanley Clark, Nikolay Yakovets, George Fletcher, Nicola Zannone: ReLOG: A Unified Framework for Relationship-Based Access Control over Graph Databases. DBSec 2022: 303–315.<br /> [Han24] Sunwoo Han, Zachary G. Ives: Implementation Strategies for Views over Property Graphs. SIGMOD, 2024.<br /> [Ignat24] Claudia-Lavinia Ignat, Victorien Elvinger, Habibatou Ba: SynQL: A CRDT-Based Approach for Replicated Relational Databases with Integrity Constraints. DAIS, 2024.<br /> [Preguiça19] Nuno M. Preguiça, Carlos Baquero, Marc Shapiro: Conflict-Free Replicated Data Types (CRDTs). Encyclopedia of Big Data Technologies, 2019.<br /> [Rault22] Pierre-Antoine Rault, Claudia-Lavinia Ignat, Olivier Perrin: Distributed Access Control for Collaborative Applications Using CRDTs. PaPoC@EuroSys 2022: 33–38.<br /> [Thomassen23] J. Thomassen, W. Yu: Eventually-Consistent Replicated Relations and Updatable Views. In: Abelló, A. et al. New Trends in Database and Information Systems (ADBIS 2023). Communications in Computer and Information Science, vol. 1850. Springer, 2023.<br /> [Wolde23] Daniel Wolde, Gábor Szárnyas, Peter Boncz: DuckPGQ: Bringing SQL/PGQ to DuckDB. Proc. VLDB Endow. 16(12): 4034–4037 (2023).<br /> [Yu20] Weihai Yu, Claudia-Lavinia Ignat: Conflict-Free Replicated Relations for Multi-Synchronous Database Management at Edge. IEEE SMDS, 2020.</p>
Principales activités
<p>The project will focus on a collaborative data management scenario based on a graph-relational model, in which a graph view is defined over a set of relational tables, together with a set of basic integrity constraints, and replicated across multiple sites. The intern will help define an abstract replication model that treats such views as the primary unit of synchronization between a small number of replicas. This model will describe how a restricted class of updates is propagated and reconciled under concurrency, and under which assumptions basic constraints are preserved or can be restored after conflicts. Building on this model, the intern will design and implement a lightweight prototype on top of an existing data management system. This prototype will simulate several replicas, apply simple but well-specified replication and merge policies, and maintain materialized graph views in the presence of updates.<br /> <br /> Depending on the results and mutual interest, this work can lead to a PhD project.</p>
Compétences
<p>The internship targets a Master 2 (or equivalent) student in Computer Science with interest in databases, distributed systems, and graph data management. The student should be comfortable programming and have prior exposure to relational databases and SQL. Familiarity with graph databases or graph query languages is a plus but not mandatory. The topic is research-oriented and well suited to a student considering a PhD in this area.</p>
Référence
2025-09651
Domaine d'activité