Hackathon 0x0c - Language Server Protocol (LSP)

Objective

Text editors are essential tools for software developers. These editors provide many useful features to make developers’ lives easier and improve their productivity.

There is a wide variety of text editors. Moreover, each language has its own characteristics and each company/project may decide to adopt different coding rules. For all these reasons, it is difficult to offer a configuration suitable for everyone.

In this hackathon, we tried to help developers configure their text editors correctly for optimal performance. For this purpose, using the Language Server Protocol (LSP) seemed like a good option.

LSP defines the protocol used between an editor or IDE and a language server that provides language features like autocomplete, go to definition, find all references etc.

– [Microsoft’s official page for Language Server Protocol(https://microsoft.github.io/language-server-protocol/)

With LSP, text editors communicate with a dedicated server for each specific language and benefit from its advanced features.

During this hackathon, we explored the possibilities of using LSP at Intersec. Our team consisted of Guillaume Chevallereau, Maxime Leblanc, Muyao Chen, and Yannick Li.

LSP for C

Our backend is mostly written in C. There are already many servers dedicated to C so we will reuse one that we can customize. A list of already implemented language servers is available here and here.

For the C language, we decided to use clangd implemented by the LLVM Project. clangd provides the main features: code completion, hover, jump to def, workspace symbols, find references and diagnostics.

clangd is a configurable tool. In order to work correctly, it needs to know the compile command required for a file. These compile commands are passed through a file named compile_commands.json.

At Intersec, we use the Waf framework as build system. Waf provides an extension to generate the compile command file but this extension is only available on recent versions, so features like block would not work correctly. We therefore added a modified version of the waf extension to our build system.

This version mainly handles blocks. To this end, block files (which are translated to C during the build) need specific flags while the generated C files must be discarded in the compilation database. The modified version is available here.

Python

There is not much to do for python besides choosing the python LSP implementation. We decided to use python-lsp-server from Spyder IDE team and community. Adding it to our poetry installation was enough.

IOP

The IOP (Intersec Object Packer) is a method to serialize structured data. Data structures are defined in a custom IOP description language, so we have to create our own dedicated LSP server.

The main features that could be implemented are: code completion, hover, jump to def, workspace symbols, find references and diagnostics.

Diagnostics (linting)

We started by focusing on diagnostics (also known as linting). Diagnostics enable you to see errors, warnings, hints and information right on the editor. This information is provided by the compiler but we ran into a problem with our IOP compiler. Currently, it stops after the first error encountered. So if we want to have a real-time display of all errors, we have to fix the IOP compiler so that it keeps compiling after it encounters the first error.

Allowing a compiler to continue listing correctly all errors is a complicated task as it requires ignoring previous errors while keeping its robustness.

Previously, when the compiler detected an error on an IOP file, it logged it and threw an error which stopped the compilation. (This is the safest and easiest way to implement a compiler).

To improve it without refactoring the whole compiler code, we decided to upgrade each parsing functions. So now, when an error is detected, the compiler still logs it, but sets an error flag and continues the parsing as if there was no error. The error flag is returned to the caller and the function ends. That way, we keep the same behavior as before. (This also allows us to update functions step by step).

However, we cannot handle all kinds of error. Especially syntaxical errors (like missing brackets…) as it can be tricky to detect the real end of the instruction. We thus wanted the compiler to end the parsing for this type of error.

Nevertheless, In this hackathon, we reused the old IOP compiler to show the first bug it found. The result is shown in the following screenshot.

IOP linter

In order to do that, we cloned the Microsoft’s LSP example and changed it to use our own compiler. The changes that we made are as follows:

Activate the LSP

LSP mode is activated on events. The most common event is the file language which is mainly determined by its extension. For example, a file with the extension *.c is considered as a C file. However, we can also choose the language in VS Code if the extension is not recognized automatically by the editor.

Since IOP is a new language that we have created, it is not recognized. We should create a new language package for IOP files to define all the syntax. Because this is not our main concern in the hackathon, we decided to simply declare all plaintext files as IOP files, as the editor will trigger our LSP server when it examines a text file which includes *.iop. If we want to really use the LSP server for IOP files, we have to properly declare a new language by following these steps .

lsp-sample/package.json

       "activationEvents": [
               "onLanguage:plaintext"
       ],