Hoai Le Nguyen, PhD student in the COAST team, will defend his thesis on Thursday, January 14th a at 9am.
His defense is entitled “Study of conflicts in Collaborative Editing” and will be held in English.
Early researches about Collaborative Editing (CE) (in the 1990s and the early 2000) focused on describing different characteristics of CE based on interviewing people who had participated in some CE projects. Some recent researches about CE started analyzing the logs of CE activities to study how people edit together with support of modern CE tools such as Git version control systems and Google Docs.
From the general view point, the process of CE is the continuous synchronization of ‘multiple, parallel streams of activity’ of collaborators. If the synchronization takes place less often, for example in the development of a software project based on Git version control system, the work mode is called ‘asynchronous’. And if the synchronization takes place within a small interval, for example in editing a shared document in ShareLaTex, the work mode is called ‘synchronous’. The longer the divergence is, more conflicts are likely to happen during the synchronization. Resolving conflicts is costly, especially after a long period of divergence. Understanding how often conflicts happen and how users resolve conflict in real CE projects is important to ensure good performance and user experience in collaborative editing. In the first part of this thesis, we study the collaboration traces of four large open source projects in Git version control system. We analyze different types of textual conflicts that arise during the development and how developers resolve these types of conflict. In particular regarding ‘adjacent-lines conflicts’, we found that users mostly resolve them by applying changes from both sites. Besides, we also analyze how often users use ‘roll-back to previous version’ as a way to resolve merge conflicts.
The process of CE based on online collaborative editor is more specific. It can be split into several ‘sessions’ of editing which are performed by a single author or several authors. They are denoted as ‘single-authored sessions’ and ‘co-authored sessions’ respectively. This fragmentation process requires a predefined ‘interval’ or ‘maximum time gap’ which is not yet well defined in previous studies. In the second part of this thesis, we analyze the logs of CE works of students of an Engineering School using ShareLaTeX which were collected and anonymized for privacy purpose. By examining different ‘maximum time gaps’ from 30 seconds to 15 minutes on the logs we found that we can define a suitable ‘maximum time gap’ to split CE activities into sessions by evaluating the distribution of the ‘external-distance’. Besides, we analyse the editing activities inside each ‘co-author session’. We borrow a [30 seconds, 10 characters] time-position window to examine these ‘potential conflict’ cases. The result shows that people rarely edit closely in both time-position. However, conflicts are more likely to happen in these cases.
Keywords: collaborative editing, conflict, version control, real-time collaborative editors
Mme Sophie CHABRIDON, Professeur, Télécom SudParis – FRANCE
M Stefano ZACCHIROLI, Maître de Conférences, Université de Paris – FRANCE
M Horatiu CIRSTEA, Professeur, Université de Lorraine – FRANCE
Mme Hala SKAF-MOLLI, Maître de Conférences, Université de Nantes – FRANCE
M François CHAROY, Professeur, Université de Lorraine – FRANCE
Mme Claudia IGNAT, Chargée de recherche, Inria Nancy-Grand Est – FRANCE