Abstract

The most used format for submitting and publishing papers in the academic domain is Portable Document Format (PDF), since its possibility of being rendered in the same way independently from the device used for visualising it. However, the PDF format has some important issues as well: it lacks of interactivity, it is not machine-readable, it has seen as a monolithic artefact in the Web, and it is not offering an acceptable accessibility to people with disabilities. In order to address these issues, recently some journals, conferences, and workshops have started to accept also HTML as Web-first submission/publication format. However, most of the people are not able to produce a well-formed HTML5 article from scratch, and they would, thus, need an appropriate interface, e.g. a wordprocessor, for creating such HTML-compliant scholarly article.

So as to provide a solution to the aforementioned issue, in this paper we introduce the RASH Javascript Editor (a.k.a. RAJE), which is a multiplatform wordprocessor for writing scholarly article in HTML natively. In particular it generates documents compliant with the RASH format, that is a subset of HTML5 developed for sharing scholarly articles with appropriate Web technologies. In particular, RAJE allows authors to write research papers by means of a user-friendly interface hiding the complexities of HTML5. We also discuss the outcomes of an user study where we ask several people from the Academia to write a scientific paper using RAJE.

RASH version: https://rawgit.com/gspinaci/raje-doceng2017/master/raje-doceng2017.html

Introduction

Currently the most used format for submitting and publishing academic research articles is the Portable Document Format (PDF) [7]. One of the main strengths if this format is its hard formatting style that ensures to be read and visualised by any device in the same way. However, PDF has also important drawbacks, such as the lack of interactivity, the fact that its monolithic structure is not appropriate for sharing its content on the Web as a common webpage, and its known difficulties regarding the accessiblity of its content by people with disabilities [8]. In order to create an article, authors need to create a document that will be exported as PDF. These documents are usually written with word processor (e.g. Microsoft Word and OpenOffice) or by means of appropriate markup languages such as LaTeX. While in the first case the rendering of the page (and, thus, of the final PDF to export) is what the wordprocessor show in the screen, LaTeX obliges its user to compile the document in order to understand how it will be rendered in PDF.

In the last years, the use of PDF as submission/publication format has been criticised by several people and movements (e.g. Linked Research), and several discussion has raised the issue of using another (Web-first, accessible, and open) format for perparing and sharing scholarly works, i.e. HTML [4]. Following this idea, several conferences and journals have started to experiment the use of HTML as submission format. For instance the 15th and the 16th International Semantic Web Conferece, the 13th and the 14th Extended Semantic Web Conference, the 20th International Conference on Knowledge Management and Knowledge Engineering, the Poster and Demo session at 13th International Conference on Semantic Systems, and journals like Data Science published by IOS Press, are among those that have started to adopt HTML.

One of the most used and appreciated 2 formats for creating HTML-based scholarly papers is Research Articles in Simplified HTML (RASH) [1]. RASH is a markup language that restricts the use of HTML elements to only 32 elements, and enables one to include also RDF statements [10]. In addition, it uses the the new Digital Publishing WAI-ARIA Module 1.0 [9] for specifying accessible structural semantics to the various document parts. While the framework that has beed developed for RASH includes a lot of conversion tools from wordprocessor formats (e.g. the Open Document Format used by Open Office and the Open XML format used by Microsfot Word), no native editor for facilitating the creation of Web-first RASH article has been proposed yet.

In this paper we introduce the RASH Javascript Editor (RAJE), a What You See Is What You Get (WYSIWYG) wordprocessor for writing scholarly articles in HTML by means of the RASH format. In particular RAJE allows authors to write research papers in HTML natively by means of a user-friendly interface, instead of writing raw markup with a IDE, text editors or external wordprocessors. It guarantees users the benefits of a wordprocessor combined with the ones given by a HTML-based format which are interactiveness, accessibility and easiness to be processed by machine. In addition, RAJE uses the GitHub API so as to allow authors to store their articles online, to keep track of changes by means of the GitHub services, and to share the articles with others. This paper has been written by using RAJE itself, and converted into the appropriate layout requested by the conference by means of the RASH Online Conversion Service (ROCS ) [11].

The rest of the paper is organised as follows. In Section 2 we describe some of the most important HTML-based editors. In Section 3 we introduce RAJE, and we provide high level principles and low level details. In Section 4 we present the outcomes of a user testing session we have run by involving people in the Academia so as to understand the level of preceived usability of RAJE when used for writing a scholarly article. Finally, in Section 5 we conclude the paper sketching out some future developments.

Related works

In the past several works have been developed for allowing an easy creation of HTML-based scholarly papers. In this section we introduce some of the most relevant ones, by discussing their main characteristics.

Authorea is an online platform to write and edit scientific papers. Every article is an HTML document and Git repository simultaneously. The main goal for the development of Authorea was to fix collaborative problems that could came out during the creation of technical, scholarly and scientific writings, so as to allow authors to take advantages of using Git versioning system and keeping track of every single displayed change. In addition, every article in accessible from anywhere, by using any device connected to the Internet, and no TeX installation is required for producing the final PDF layout necessary for the submission.In addition, Authorea allows one to write mathematical notations, tables, plots and figures in each LaTeX and MathML.

Dokieli [12] is a client side editor for decentralised article publishing, annotations and social interactions. Since dokieli follows a pure decentralization principle, authors can publish the paper where they prefer. In particular, dokieli complies with the Solid protocol, based on the Linked Data Platform W3C standard [13] [14], so as to allows one to edit the HTML-based scholarly article in the browser and to save it directly to server storage.

Fidus Writer is an open source WYSIWYG collaborative HTML-based word processor made for academics who need to use citations and formulas within papers. All the articles created with it can be exported in more ways: as a website, a paper, or an ebook. Independently from the particular export, the editor allows authors to focus on the content of the paper, while the final layout can be choose once the content is finalised. FidusWriter supports LaTeX for adding footnotes and citations directly inside the documents, and allows a real-time collaboratiion with other users.

TinyMCE is a WYSIWYG HTML-based editor that can be instantiated within an existing HTML page. It is a huge project that involves more than 150 contributors with almost 6500 commits on Github. Its include facilities for formatting, table insertion, image editing, customizable themes, and it is accesible for users with disabilities – in fact it follows WAI-ARIA guidelines [15] so as to make HTML documents compatible with screen readers such as JAWS and NVDA. It is already integrated with frameworks such as JQuery and Angular.js, or inside common content management systems such as Wordpress or Joomla.

The RASH Javascript Editor

In this section we provide a brief introduction about RASH and its Framework, and then we discuss the RASH Javascript Editor (RAJE) by preseting its features and by discussing some details about its implementation.

The RASH Framework

The Research Articles in Simplified HTML (RASH) format is a markup language that restricts the use of HTML elements to only 32 elements for writing academic research articles. It allows authors to add RDF statements [10] to the document by means of the element script with the attribute type set to application/rdf+xml, text/turtle or to application/ld+json. In addition, RASH strictly follows the Digital Publishing WAI-ARIA Module 1.0 [9] for expressing structural semantics on various markup elements used.

A RASH document begins as a simple (X)HTML5 document, by specifying the generic HTML DOCTYPE followed by the document element html with the usual namespace http://www.w3.org/1999/xhtml specified. The element html contains the element head for defining metadata of the document according to the DCTERMS and PRISM standards, and the element body for including the whole content of the document. On the one hand, the element head of a RASH document must/should include some information about the paper, i.e., the paper title (element title), at least one author and other related information (i.e., affiliations, keywords, categories, by using the elements meta and link). On the other hand, the element body mainly contains textual elements (e.g., paragraphs, emphases, links, and quotations) for describing the content of the paper, and other structural elements (e.g., abstract, sections, references, and footnotes) used to organised the paper in appropriate blocks and to present specific complex structures (e.g., figures, formulas, and tables).

RASH is accompanied by its Framework, which is a set of specifications and writing/conversion/extraction tools for writing articles in RASH. In particular the RASH Framework includes tools for checking the validity of RASH documents, visualising RASH documents on browsers with different layouts, for converting RASH documents into LaTeX and ODT/DOCX files into RASH, and for automatically annotate RASH elements with their actual (structural) semantics according to the Document Components Ontology (DoCO) [16].

RAJE: high-level overview

RAJE is a stand-alone multi-platform software used to generate RASH document. Its sources and the related binary files are available on GitHub a licensed according to the ISC License, as all the other tools included in the RASH Framework. Before starting the development of the tools we have decided a series of desiderata and needs that such an editor should properly address, as introduced in the following list:

being totally-compatible with RASH and HTML5;
relying on the existing tools developed for visualising RASH documents, included in the RASH Framework;
simplifying the writing of the document keeping things as natural as possible if one has to edit the document by means of her favourite wordprocessor;
not enabling the user to personalise the actual rendering of it according to specific layouts – the focus is on the content only;
following the best practices already used by existing wordprocessors such as Microsoft Word and Open Office for creating structures typical of scholarly articles (lists, tables, figures, references, etc.);
allowing the creation of mathematical formulas by means of an appropriate interface, hiding the complexities of the language used for storing such formulas with Web-first formats, e.g. MathML;
interacting with GitHub for easing the storing, sharing, and preservation of the articles.

In the following subsections we provide a brief discussion about how the typical structures of a scholarly article can be created by means of RAJE.

Article metadata

The metadata of a RASH document are stored inside the head element in the form of meta or link. These metadata are rendered by the browser as shown in Figure 1 and include: the title (mandatory), the author list (mandatory), the ACM subject categories (optional), and the keywords (optional). RAJE permits to act on any of these metadata.

img/header.png — **Figure 1.** Header of a rendered RASH document.

Every element in this metadata section can be easily single-edited by clicking it, and the cursor will be placed where requested.

The title is usually accompanied with a subtitle. To insert the subtitle, one can simply press enter at the end of the title. Once added, the subtitle can be filled up with text, and the author can proceed to the next editable element pressing enter again.

Any author listed has its own button set for adding a new author after it or for removing the selected one. It is possible to specify more affiliation for the same author by pressing pressing the enter key once the cursor is placed in the affiliation area, just after the author's name. This action will add a new placeholder that can be editer by typing new text or removed by pressing enter again as shown in Figure 2.

img/author.png — **Figure 2.** Adding new affiliation.

Categories and keywords can be also edited by clicking on the specific section, and pressing the enter key allows one to add additional keyword/categories.

Mathematical formulas

From the version 0.6, RASH has started to support the use of different languages for writing mathematical expressions, which include MathML, LaTeX and AsciiMath. All these formulas are actually rendered by means of the MathJax processor, while they will be stored in MathML in the final RASH source. This flexibility of using several formats allowed us to create an intuitive and easy-to-use environment in RAJE in which users write down formulas using only AsciiMath as input – since, from preliminary tests we run, AsciiMath seemed to be the language easier to learn, write and remember.

However, even if AsciiMath is the easier language for mathematical formulas that can be adopted in the contest of RASH documents, it could be difficult anyway to remember all the kinds of operation it allows one to specify. In order to address this issue, RAJE also includes several buttons (labelled Operations, Misc, Relations, Symbols, and Formulas, as shown in Figure 3) that allow one to select an operation among a list of preset instructions. The way all these buttons are presented to the user has been guided by considering the OpenOffice formula editor.

https://rawgit.com/gspinaci/spinaci-rajedoc2016/master/img/formulaeditor.png — **Figure 3.** The formulas editor in RAJE.

Storing the article online with GitHub

Since we wanted to have a system that allows to store and share the article on the Web, we implemented a module that allows authors to push the article on a GitHub repository by means of the GiyHub API. GitHub is a web-based version control repository that offers all of the distributed version control and source code management features of Git, and enables access control and several collaboration facilities (e.g. bug tracking, feature requests, task management). In this context, any article created by RAJE can be seen as a repository on GitHub.

The first time an author pushes the article on GitHub, a new repository is created with all the code (i.e. the main text content in HTML) and assets (CSSs, Javascript files, etc.) needed for guaranteeing a correct visualisation of the article. The repository URL is created according to the schema https://github.com/{author_username}/{repository_name}, where {author_username} is replaced with the author's GitHub username, while the {repository_name} is the title of the article.

During the editing phase, if a user wants to store online the content of a document, he must be first logged in with its own Github account, which certifies its identity. Then user has the option to save the document online, on Github servers. This action can be executed in two ways: the shortcut ctrl+shift+s (cmd+shift+s if OSX) or the button "push" under the dropdown with author's Github profile avatar on the top-right of the RAJE interface. A success message is shown once the commit finished successfully.

Technical overview -- TODO: Silvio

In this setion we discuss the used technologies and the adopted development process, in order to build up the tool. RAJE is based on Electron (a.k.a. Atom Shell) which allows developers to generate multi-platform softwares using only web-based technologies: HTML, CSS and JavaScript. Next other integrated modules and script are described, such as Tim Down's rangy.

Electron and APIs

Electron is a Open Source framework created by Github developers, based on the already famous node.js API. It is avery lively project, it count near 13.000 commits divided beetwen the 500 total contributors.Electron is pratically a browser wrapper. It utilize Chromium as browser to create multi platform softwares.

Thanks to this framework is guaranteed the development with web-technologies like HTML, CSS and JavaScript.Two are the important processes in any Electron-based software: The Main process and the Renderer process.The Main process is a node.js script, that includes every kind of necessary information and setting to guarantee the smooth progress of the software, e.g. it creates and shows the windows, uses node.js modules and can acces to databases.Instead the Renderer process is a script imported directly in the document with the classical syntax . It allows RAJE to utilize NPM packages (the manager of node.js modules) and can communicate in an Async or Sync way with the main process to exchange messages.

File System APIs

RAJE is a software Because of the FS api. If it was only a JavaScript file, like rash.js, cannot be possible locally save articles. Only few browsers allows scripts to access the FS and write files and directory [2].

The File System APIs are required, expecially for a software that can execute CRUD operations on files, indeed. The fs library is one of the most important library, because it is placed among default node.js modules, it contains some very useful methods such as readDir (get a the list of files contained in a directory), writeFile (write a file with the given content) and readFile (get the content of the selected file) which are asynchronous. RAJE sometimes uses also them sync counterpart (e.g. readFileSync).

When an author wants to generate a new article, RAJE creates its directory and move inside the whole required assets bundle (which are CSS stylesheets and JavaScript scripts) using few FS functions.

Github api

The APIs to communicate with Github are implemented using a wrapper: Octonode, which grants a lot of actions on contents, users and repositories. Octonode is built on the Github API v3, and it provides also OAuth2 login. With few code lines is possible to integrate an Electron project with the module which provides connection with the protocol OAuth2. To set up a project and arrange to have login, first of all is necessary that the developer application is already created and be in possession of the two necessary codes: client ID and secret ID.As soon the application is created, it is now possible to use some code lines to authenticate users with the protocol OAuth2.

The Octonode library also serves a different method to authenticate with Github. In RAJE the authentication is implemented as follows. All needed functions are stored inside a variable, created when an author pushes the right button to be logged in, at this moment also the client_token is generated. In order to obtain the client_token the author need to authorize RAJE to access its personal information as shown in Figure 4. This window will describe what kind of permissions RAJE needs, it can read all public information about the user (such as email, name, biography and so on) and about public repositories (the ones which are importantfor RAJE).

img/authorize.png — **Figure 4.** Authorize application modal.

Once authorizations are given by the use, a message is sent to Github servers, and a result comes back in return. Normally Github needs an URL callback, so the token can be directly sent there, where it will be elaborated. RAJE, instead, as a browser can handle the navigation events. It can listen for the callback and extract the result code from URL. Then RAJE execute a POST request, to get the access_token, with the function in Listing 1.

function requestGithubToken(githubOptions, code) {apiRequests.post('https://github.com/login/oauth/access_token', {client_id: githubOptions.client_id,client_secret: githubOptions.client_secret,code: code,}).end(function (err, response) {if (response && response.ok) {/** Save github token to settings */storage.setGithubSettings({ 'token': response.body.access_token }, (err) => {if (err) throw errgetUserInfo()})} else {// Error - Show messages.console.log(err)}})
}

Listing 1. Function to request the access_token.

The method in Listing 1 uses a node.js module called superagent, an high level wrapper for ajax requests. With it module RAJE can make a POST request, passing all needed informations in order to retrieve the token. In this snippet the token is saved with the electron_storage module, i.e. a simple environment JSON storage for the current machine.

The login workflow is the following:

The author express the will to be logged in, pressing the login button (placed in the right-top corner).
RAJE shows the authorization modal.
The author eccept authorization requirements.
Github send the URL callback containing the code.
RAJE extract the code and request the token.
Github send the token in return.
RAJE stores the token for next requests.

When the token is stored, every future requests (such as create a repository or push a commit) are instantly served because everything is saved inside the variable client.

When a new article is opened, a local folder is created with the document and all the needed assets, and when authors want to PUSH changes to a non existent repository, it is created. Every time a PUSH is requested the SHA and the content of file to update are required but if no repositories are founded, also in this case RAJE needs to create it.

Inside the client variable we can find the method .me() which returns an instance of the logged user. Object me has inside all allowed operations to users, and among them, the creation of a new repository. With the method client.me().repo({},callback) in our aid, a new repository is created, the first parameter is an object that contains properties like name, description and other less importantinformation.

Once the repository is created we need to push also the assets, which are Javascript scripts, CSS style files and fonts (required by FontAwesome). The current versione of RAJE needs to synchronize these files one after another (synchronous paradigm). Those two are the most important examples of integration with the big API of Github, but they give us the proof of what it can be possible to do.

Web based technology

As already mentioned in the previous sections, RAJE in built on Electron, which uses web technologies as Javascript and CSS to give at client-side behaviours and styles. The splash window3 is simply a html file with its own Javascript and CSS, rendered by the Main process. Instead the editor itself is the HTML file stored inside the article folder. When it is visualised by a browser (Chromium in our case) it has rash.js script that turn the document into RASH-rendered form. In the same way adding raje.js a RASH document can be turned into a editable document with toolbar and other useful elements.

For this reason, the core of this editor is not the software itself, but insted is the imported script. Software gives some important wiring behaviors (FS and Github APIs). The editor behaviour is inside the document (stored as a script), but if someone try to open the file with a normal browser, he can view only a normal RASH file, because the editor will be shown and added only if the document is opened with RAJE. In other words, this is very useful to send read-only documents or show it with rawgit.

Editor's core: raje.js

All the RAJE project flow around the raje.js script. It is built concatenating more scripts (each with its own behaviour and purpose). They are:

init.js
caret.js
const.js
raje.js
shortcuts.js
toolbar.js
derash.js
rendered.js

Init.js is the initialization script, it initializes the variables (like bodyContent 4 or edit_state), extends JQuery object adding more new functions, handle the creation of figures and call the $(document).ready() function to set up the whole editor. Two important functions added to the JQuery object are: setEditable() and setNotEditable(), both act on the document editability.

Very important is what happen to the sections: them are detached from the body and after attached to a new container, i.e. the section with contenteditable attribute sets to true (it is added right below the header).

The second script, that is caret.js. It provides some utility methods about the caret and its position. It contains functions to check if the caret is inside some elements e.g. if is inside the editor, header or an heading (h1,h2 and so on), or to create a selection that wraps entirely the node where the caret is. All the methods here are based on rangy.

Some constants to store numbers and string called multiple times are wrapped inside the const.js script, in order to aware of magic numbers anti-pattern.

Then there is raje.js, a set of actions to add elements directly onto the body. More of them uses the contenteditable APIs (e.g. to add undo/redo) or the method document.execCommand('insertHTML') that add a HTML string to the caret. All figure elements and cross-references are intended as objects, a new element is a new object. There are also methods to add sections of any kind, from normal ones to specials.

All shortcuts are stored in shortcuts.js script. All of them are binded inside an init function called when the document is ready. The shortcuts are all implemented using the Mousetrap module, and needed to trigger a different behaviour from the normal one. Over that the enter key press event, may triggers different behaviour based on where the caret is.

The entire graphic elements set is inside toolbar.js. Here there are variables which contain HTML strings that need to be added after. Among them there is the toolbar and all other modals, each with its own method to show it.

A rendered document is different from the stored one, and rash.js deals with the only transformation from the stored to the rendered article. RAJE need a kind of mechanism to do the specular thing, in otherwords the script derash.js. This comes in aid when we save the article, it creates and beautify the HTML string that is the file. When it creates the string, is completely minified for this reason inside this script there is a function which its objective is to mantain the multi line property and tabulation.

At least rendered.js contains everything to handle communication from and to the Main process, such as: Github login, save document locally, push updates etc.

Modules

@TimDown's rangy is very popular, and it has also a active community leaded by his founder Tim Down. On StackOverflow most of questions about contenteditable and selection management has a rangy-based answer. Every browser implements a naïve selection interface, and for a wider spectrum of methods, we adopted rangy because it grants a lot of useful operations. In a particular way, it works great to know where the caret is, or its position inside the parent. It can also allow to move the caret to start or the end of an element (this behaviour is saved inside caret.js).

Otherwise the submodule rangy-selectionsaverestore can save the current caret position to restore after. E.g. when the user wants to add an image inside the article, after clicking the button, a modal is shown. Here is where the editor needs to save the current selection, because next he probably change the focus to a button, a textbox or directly with a single click in anywhere else position and loose the previous reference where to insert the element. When he chose file and it is saved, RAJE restore the selection to move the caret where it was before, then it add the new element.

The other subpackage is rangy-textrange, and can allows to move the caret ahead or behind of some position (tipically characters). This behaviour is used to exit from inline elements. Inside code elements the spaces are allowed, instead if the author press enter he will exit from it (and the caret will be moved one character ahead).

All the shortcuts are attached directly to the section#rashEditor, because it have the mousetrap class, which is the one recognised by Mousetrap. This script is really easy yo use, is only about to connect somehow a character or a sequence of them. Everythig is referred to a Mousetrap class, and only the function bind is called. The signature means that it accepts a string (that is the sequence to trigger the event) and a callback called when the event is triggered. Mousetrap is used to bind ctrl+b or cmd+b (the plus indicates that the two buttons need to be pressed at the same time) to handle the bold behaviour.

Evaluation

Now, after the description of what RAJE is and how it is implemented, the last step is about testing the software to highlight critical issues and its level of usability. The test survey is made with eSurv.org, findable here. This test is a DUT, i.e. Discount Usability Test. These kind of tests are made involving only few people: it is possible to find up to 80% of the main bugs involving only 3 or 5 people [5][6]. In this section, we present the evaluation of RAJE based on the analysis of the survey's results.

Settings

There were two prerequisites that needed to be satisfied: the RAJE executable must be downloaded and unzipped by testers, and they need a Github account. The testers were required to fill a form organised with the preliminary answers about name, affiliation and reseach area and some assertion categorised as expertise with:

Formats (e.g. DOC(X), ODT...)
Word processors and HTML-based editors
Research articles writing
Control version systems

In total were 17 assertion, with the following format: "I am expert in HTML documents", each needed to be answered with one of the following: Strongly Agree, Agree, Neutral, Disagree and Strongly Disagree. With this answers is possible to create profiles anonymous of the testers' background, which are used before to generate the results with the SUS answers.

In order to capture the usability, the first task was to write this PDF article using RAJE. To obtain this document we followed the normal RASH workflow. We created the test document using RAJE, and we used ROCS to convert the output document in LaTeX with the Springer LNCS style. The TeX document was then compiled in PDF. Inside the document there is the whole set of elements and actions that a user can execute. In the abstract section there are some few text elements and a list. In the next sections there are tables, figures, listings and formulas, it also contains footnotes and bibliographic references.

The second task aimed to evaluate the integration beetwen RAJE and Github, commiting and pushing the repository. To complete this task testers are needed to be logged in with their Github account. Then they have to push the created document, with a initial comment. When the article is saved online as a repository, testers have to navigate to the RawGit link and save it inside the document. After that the output folder must be compressed into a archive and send to my email, in order to evaluate the HTML given in output.

The last pages contains two final questionnaires about testers' experience and feedback, with the aim of collecting enough informations to pursuit their elaboration.

Results

The initiative had a substantial success: RAJE received 6 complete feedback about the usage of its prototype. These feedback were submitted by 6 testers from Canada, Sweden, Germany, Italy and England. These result are very useful to catch real feedback from people which have particular interest in this research area.

We also performed a quantitative analysis of the usability of RAJE, using the System Usability Scale (SUS) questionnaire. The scores are acceptable, though not very high, especially if we consider that all testers was pushed to use a complete new tool, without any kind of introductive tutorial before. RASH yielded a mean score of 60 points, slightly lower than the average SUS score (68).

TODO Show feedback answers

Discussion

TODO Discuss the SUS

img/editor_experience.png — **Figure 5.** Editor experience score.

img/format_experience.png — **Figure 6.** Format experience score.

img/vcs_experience.png — **Figure 7.** VCS experience score.

img/writing_experience.png — **Figure 8.** Writing articles experience score.

TODO Intro: what is the grounded analysis?

Category	Positive	Negative
Interface	9	4
Text editor functionalities	8	15
Github integration and sharing	1	1
Publishing and output	0	2
Tool efficiency	6	1
Other integrations	0	1

Table 1. RAJE Grounded analysis.

The Listing 1 (BUG: change in table in TeX) show us some important data, thanks to the grounded analysis. First of all most of RAJE's lacks are inside the macro category "text editor functionalities". It means that some minor (but important) bugs were faced by testers, and in some cases those bugs denied the normal flow of tool. Basical keyboard shortcuts such as ctrl+a to select the entire content of a single metadata or the tab key to navigate from a table cell to the next one was not implemented yet. Inserting elements could face some issues that did not allowed testers to insert tables, figure or formula (inline or block) inside the document, at the specified position. Moreover another important lack was the editing of bibliographic references, it was intended as raw text. Some testers are accustomed to use pre-arranged libraries as BibTeX because it styles the whole reference.

Instead interface category is mainly positive. The interface itself is pretty intuitive and minimal, toolbar icons and its familiarity with the most important wordprocessors made it user-friendly and easy to understand. The integration with Github is important to grant collaboration beetwen authors was positive in some case, but negative in other (in some cases bugs stopped the module which handled the integration with Github).

At last, the tool was categorised as efficient because it allowed testers to write less - or none - code lines (at difference from LaTeX, or any other markup language) and pursuit this writing task in less time than other ways to reproduce it.

Conclusion

In this paper we talked about how the Semantic Web community and few publishers, are starting to talk about HTML submissions. It is thought as solution for the problems of the PDF format described above. The advantages that HTML bring with him are the easiness to be understood by machines and the interactiveness granted by browsers. PDF files and HTML documents are easily created with visual editors and word processors. Most of them are WYSIWYG, that allow authors to editate directly the output.

HTML wraps a huge amount of elements which can have ambiguity. Two elements can be different, but visualised as same. RASH fix this problem. It is a subset of 32 HTML elements, which grants that every element is univocal. Another streght of RASH is its framework. Validation, visualization and conversion are all actions that RASH framework can do. But it needs a tool to semplify the creation of a RASH file, hiding all the technical facts about the markup structure. RAJE will be placed inside the RASH framework, in order to cover this need. RAJE is a HTML-based WYSIWYG editor, with generates RASH well-formed documents.We described the most popular and used HTML-based editors and word processors, the functionalities and development process in order to create RAJE, with JavaScript libraries, CSS styles and HTML files, and deploy the software packages.

Future development

In addition of fixing bugs/issues , some new features are needed to be added. The most interesting feature are the annotations, that will be integrated using hypothes.is. We also noticed the necessity of a table of content, to navigate in a better way long documents, to expand RAJE to allow chairs to create conferences and manage peer reviewing directly with RAJE, or its hypotetical web site. These are just few way in how we can next expand RAJE.

References

Peroni S., Osborne F., Di Iorio A., Nuzzolese A. G., Poggi F., Vitali F., Motta E. Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles (2016) https://essepuntato.github.io/papers/rash-peerj2016.html
[X] [X] [X] [X] Di Iorio A., Nuzzolese A. G., Osborne F., Peroni S., Poggi F., Smith M., Vitali F., Zhao J. The RASH Framework: enabling HTML+RDF submissions in scholarly venues (2016) http://ceur-ws.org/Vol-1486/paper_72.pdf
Filesystem & FileWriter API http://caniuse.com/#feat=filesystem
Aalbersberg I. PDF versus HTML — which do researchers prefer? (2013) https://www.elsevier.com/connect/pdf-versus-html-which-do-researchers-prefer
Nielsen, J. (2009). Discount Usability: 20 Years. https://www.nngroup.com/articles/discount-usability-20-years/ (last visited March 10, 2017)
Nielsen J. Why You Only Need to Test with 5 Users (2000) https://www.nngroup.com/articles/why-you-only-need-to-test-with-5-users/ (last visited March 10, 2017)
Brady, E., Zhong, Y., Bigham, J.P. (2015). Creating accessible PDFs for conference proceedings. In Proceedings of the 12th Web for All Conference (W4A 2015): 34-37. ACM. DOI: https://doi.org/10.1145/2745555.2746665
Lazar, J., Allen, A., Kleinman, J., Malarkey, C. (2007). What frustrates screen reader users on the web: A study of 100 blind users. International Journal of Human-Computer Interaction, 22 (3): 247-269. http://dx.doi.org/10.1080/10447310709336964
Garrish, M., Siegman, T., Gylling, M., McCarron, S. (2016). Digital Publishing WAI-ARIA Module 1.0. W3C Candidate Recommendation, 15 December 2016. https://www.w3.org/TR/dpub-aria-1.0/
Cyganiak, R., Wood, D., Lanthaler, M. (2014). RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation, 25 February 2014. https://www.w3.org/TR/rdf11-concepts/
Di Iorio, A., Gonzalez-Beltran, A., Osborne, F., Peroni, S., Poggi, F., Vitali, F. (2016). It ROCS! The RASH Online Conversion Service. In the Companion Volume of the Proceedings of the 25th International World Wide Web Conference (WWW 2016): 25-26. DOI: https://doi.org/10.1145/2872518.2889408
Capadisli, S., Guy, A., Verborgh, R., Lange, C., Auer, S., Berners-Lee, T. (2017). Decentralised Authoring, Annotations and Notifications for a Read-Write-Web with dokieli. To appear in the Proceedings of the 17th International Conference on Web Engineering (ICWE 2017). Canonical URL: http://csarven.ca/dokieli-rww
Capadisli, S., Guy, A., Lange, C., Auer, S., Berners-Lee, T. (2017). Linked Data Notifications. To appear in the Proceedings of the 14th Extended Semantic Web Conference 2017. http://csarven.ca/linked-data-notifications
Capadisli, S., Guy, A. (2017). Linked Data Notifications. W3C Proposed Recommendation, 21 March 2017. https://www.w3.org/TR/ldn/
Craig, J., Cooper, M. (2014). Accessible Rich Internet Applications (WAI-ARIA) 1.0. W3C Recommendation, 20 March 2014. https://www.w3.org/TR/wai-aria/
Constantin, A., Peroni, S., Pettifer, S., Shotton, D., Vitali, F. (2016). The Document Components Ontology (DoCO). In Semantic Web, 7 (2): 167-181. DOI: https://doi.org/10.3233/SW-150177

Footnotes

Also a little structural change in the structure can broke entirely the header and foreclose a correct RASH validation.^[back]

A list of venue that have adopted RASH as one of the HTML-based formats for the submissions is available at https://github.com/essepuntato/rash/#venues-that-have-adopted-rash-as-submission-format.^[back]

I.e. the first shown window, where authors can choose beetwen creating a new article or open an existing one.^[back]

This variable is used to save the html of the body. Every time some input events are triggered the variable is required to check if there are any content changes.^[back]