Creating a new diagnostic in quick-lint-js
A common task when contributing to quick-lint-js is to create a new diagnostic. In quick-lint-js, diagnostic is a warning or error reported while parsing the user's JavaScript code.
Creating a diagnostic involves four pieces of code:
- Diagnostic type and metadata
- Test for the diagnostic
- Detection and reporting of the diagnostic
- Documentation for the website
1. Diagnostic type and metadata
Diagnostic types are listed in
src/quick-lint-js/diag/diagnostic-types-2.h
. They look like this:
struct Diag_Missing_Body_For_Try_Statement {
[[qljs::diag("E0120", Diagnostic_Severity::error)]] //
[[qljs::message("missing body for try statement",
ARG(try_token))]] //
Source_Code_Span try_token;
};
Each diagnostic type is a C++
struct
with some custom C++ [[attributes]].
The C++ compiler ignores the attributes and just sees a class:
struct Diag_Missing_Body_For_Try_Statement {
Source_Code_Span try_token;
};
The custom attributes are processed by a separate tool as part of quick-lint-js's build system. We will discuss the custom attributes shortly.
Let's pick a name for our
diagnostic. The name needs to start with
Diag_
and be a legal C++ class name. We will
use this name later to report the diagnostic and to test for the
diagnostic:
struct Diag_Comparison_With_Empty_String {
[[qljs::diag(/* TODO */, /* TODO */)]] //
[[qljs::message(/* TODO */)]] //
/* TODO */
};
Each diagnostic in quick-lint-js has a unique diagnostic code as the
first argument to
[[qljs::diag]]
. A diagnostic code is the
letter E
followed by four decimal digits. Let's be lazy
and reuse an existing diagnostic code for now:
struct Diag_Comparison_With_Empty_String {
[[qljs::diag("E0001", /* TODO */)]] //
[[qljs::message(/* TODO */)]] //
/* TODO */
};
We also need to pick a diagnostic severity, either
Diagnostic_Severity::error
or
Diagnostic_Severity::warning
, as the
second argument to
[[qljs::diag]]
. Our new diagnostic is for a
possible issue, so let's pick
warning
:
struct Diag_Comparison_With_Empty_String {
[[qljs::diag("E0001", Diagnostic_Severity::warning)]] //
[[qljs::message(/* TODO */)]] //
/* TODO */
};
Each diagnostic is a class (struct
). The
class needs to store at least one
Source_Code_Span
member variable so the
editor knows where to put the
squigglies. We should think about
where the squigglies should go and name our member variable
appropriately to make the reporting and testing code easier to read.
Since our diagnostic is about string comparisons, let's name the
member variable comparison_operator
. We write
the member variables after
the C++ attributes:
struct Diag_Comparison_With_Empty_String {
[[qljs::diag("E0001", Diagnostic_Severity::warning)]] //
[[qljs::message(/* TODO */)]] //
Source_Code_Span comparison_operator;
};
Each diagnostic needs a message specified by one or more
[[qljs::message]]
attributes. Most diagnostics are simple and just have a simple string.
Other diagnostics might have formatting or multiple strings. Our
diagnostic is simple, so let's just write a single string with no
formatting. Don't forget to mention the
Source_Code_Span
member variable we defined
inside our Diag_
class:
struct Diag_Comparison_With_Empty_String {
[[qljs::diag("E0001", Diagnostic_Severity::warning)]] //
[[qljs::message("comparing against empty strings is silly",
ARG(comparison_operator))]] //
Source_Code_Span comparison_operator;
};
After adding the diagnostic type to diagnostic-types-2.h
,
build quick-lint-js. You should get a build error like the following:
diagnostic-types-2.h:655:20: error: diag code "E0001" already in use; try this unused diag code: "E0331" diagnostic-types-2.h:107:16: note: Diag_Assignment_Before_Variable_Declaration used code "E0001" here
A build check is telling us that the error code we chose
(E0001
) is already in use. Let's change our
Diag_
class in
diagnostic-types-2.h
to use the unused diagnostic code
suggested by the check:
struct Diag_Comparison_With_Empty_String {
[[qljs::diag("E0331", Diagnostic_Severity::warning)]] //
[[qljs::message("comparing against empty strings is silly",
ARG(comparison_operator))]] //
Source_Code_Span comparison_operator;
};
Now let's build quick-lint-js again and run the tests. We should get no failures, which means we didn't break anything:
ninja -C build quick-lint-js-test
ninja: Entering directory `build'
[103/103] Linking CXX executable test/quick-lint-js-test
./build/test/quick-lint-js-test --gtest_brief=1
Running main() from gmock_main.cc
[==========] 1980 tests from 151 test suites ran. (772 ms total)
[ PASSED ] 1980 tests.
The build scripts modified a few files for us. Make sure you include these files in your commit:
git status
git status
On branch dev3
Changes not staged for commit:
(use "git add <file>..." to update what will be committed)
(use "git restore <file>..." to discard changes in working directory)
modified: po/messages.pot
modified: src/quick-lint-js/diag/diagnostic-metadata-generated.cpp
modified: src/quick-lint-js/diag/diagnostic-types-2.h
modified: src/quick-lint-js/i18n/translation-table-generated.cpp
modified: src/quick-lint-js/i18n/translation-table-generated.h
modified: src/quick-lint-js/i18n/translation-table-test-generated.h
no changes added to commit (use "git add" and/or "git commit -a")
Now that we have created the diagnostic type, let's move on to writing a test.
2. Test for the diagnostic
All diagnostics must be tested with an automated test. To create a
test, copy-paste an existing test in a
test/test-parse-*.cpp
file and tweak it. Let's put our
test in
test/test-parse-warning.cpp
:
TEST_F(Test_Parse_Warning, warn_on_empty_string_literal_comparison) {
test_parse_and_visit_expression(
u8"a === ''"_sv, //
u8" ^^^ Diag_Comparison_With_Empty_String"_diag);
}
There are a few pieces of this test worth mentioning:
-
test_parse_and_visit_expression
-
quick-lint-js' parser can parse several things, including
statements, expressions, and TypeScript types. Our diagnostic is
specific to JavaScript expressions, so we call
test_parse_and_visit_expression
. -
u8"a == ''"_sv
-
The input source code we want to test. The
u8
prefix is required so the code is parsed as UTF-8. The_sv
suffix is required so that code containing null bytes is handled correctly. -
^^^
-
We need to tell
test_parse_and_visit_expression
where in the source code the diagnostic should be reported. This is represented using two parts inside a string: alignment (spaces) and a span (one or more^
s, or one`
). In our example, there are two leading spaces, so the diagnostic should start at the third character (byte offset 2). The span is three characters wide (^^^
), so the diagnostic covers three characters (up until byte offset 5). The alignment and span specify that the diagnostic should cover offsets 2, 3, and 4:a === '' ^^^ offsets 2, 3, and 4
-
Diag_Comparison_With_Empty_String
-
We need to tell
test_parse_and_visit_expression
which kind of diagnostic we expect. We do this by writing the diagnostic class's name after the span.If a diagnostic has multiple members (such as if the diagnostic has multiple messages), the member name must appear after the diagnostic class's name. See the documentation for
_diag
and NOTE[_diag-syntax] for details.
Build and run the test to make sure it fails. The failure says that we expected a diagnostic, but didn't get any. This makes sense because we haven't written the code to report the diagnostic yet:
ninja -C build quick-lint-js-test
ninja: Entering directory `build'
[2/2] Linking CXX executable test/quick-lint-js-test
./build/test/quick-lint-js-test --gtest_brief=1
Running main() from gmock_main.cc
/home/strager/Projects/quick-lint-js-sl/test/test-parse-warning.cpp:561: Failure
Value of: p.errors
Expected: has 1 element that has type Diag_Comparison_With_Empty_String
Actual: {}
[ FAILED ] Test_Parse_Warning.warn_on_empty_string_literal_comparison (0 ms)
main thread ID: 373982
[==========] 1981 tests from 151 test suites ran. (755 ms total)
[ PASSED ] 1980 tests.
3. Detection and reporting of the diagnostic
Now for the hard part: writing the production code. Most likely we will report our diagnostic in one of these files:
- src/quick-lint-js/fe/lex.cpp
- src/quick-lint-js/fe/parse.cpp or parse-*.cpp
- src/quick-lint-js/fe/variable-analyzer.cpp
But these files contain thousands of lines of code. How do we know where to put our new code?
One technique is to step through the code in a debugger:
$ gdb --args ./build/test/quick-lint-js-test --gtest_filter=Test_Parse_Warning.warn_on_empty_string_literal_comparison (gdb) b parse_and_visit_expression Breakpoint 1 at 0x754ae0: parse_and_visit_expression. (3 locations) (gdb) run Starting program: build/test/quick-lint-js-test --gtest_filter=Test_Parse_Warning.warn_on_empty_string_literal_comparison Running main() from gmock_main.cc Note: Google Test filter = Test_Parse_Warning.warn_on_empty_string_literal_comparison [==========] Running 1 test from 1 test suite. [----------] Global test environment set-up. [----------] 1 test from Test_Parse_Warning [ RUN ] Test_Parse_Warning.warn_on_empty_string_literal_comparison Breakpoint 1, quick_lint_js::Test_Parser::parse_and_visit_expression at test/./quick-lint-js/parse-support.h:97 97 this->parser_.parse_and_visit_expression(this->errors_); (gdb) c Continuing. Breakpoint 1, quick_lint_js::Parser::parse_and_visit_expression at src/./quick-lint-js/fe/parse.h:174 174 this->parse_and_visit_expression(v, precedence{}); (gdb) c Continuing. Breakpoint 1, quick_lint_js::Parser::parse_and_visit_expression at src/./quick-lint-js/fe/parse.h:550 550 Monotonic_Allocator &alloc = *this->expressions_.allocator(); (gdb) n 551 auto rewind_guard = alloc.make_rewind_guard(); (gdb) n 553 Expression *ast = this->parse_expression(v, prec); (gdb) n 555 auto disable_guard = alloc.disable(); (gdb) n 556 this->visit_expression(ast, v, variable_context::rhs); (gdb) step quick_lint_js::Parser::visit_expression at src/quick-lint-js/fe/parse-expression.cpp:36 36 auto visit_children = [&] { (gdb) n 41 switch (ast->kind()) { (gdb) n 67 visit_children(); (gdb) n 69 static_cast<Expression::Binary_Operator*>(ast)); (gdb) n 68 this->error_on_pointless_compare_against_literal(
error_on_pointless_compare_against_literal
looks like a good place to put our code.
Detecting when to report the diagnostic is up to you. But once you have the information you need, reporting a diagnostics is easy:
Source_Code_Span op_span = /* usually this->peek().span */;
this->diag_reporter_->report(Diag_Comparison_With_Empty_String{
.comparison_operator = op_span,
});
Build and test to prove that our code worked:
ninja -C build quick-lint-js-test
ninja: Entering directory `build'
[3/3] Linking CXX executable test/quick-lint-js-test
./build/test/quick-lint-js-test --gtest_brief=1
Running main() from gmock_main.cc
main thread ID: 375845
[==========] 1981 tests from 151 test suites ran. (785 ms total)
[ PASSED ] 1981 tests.
Huzzah! 🥳
But we're not done yet... We still have to write 💀 documentation 💀
4. Documentation for the website
Each diagnostic has associated documentation stored separately from the code. The docs are stored in docs/errors/ with one file per diagnostic. Let's write our documentation:
# E0331: comparing against empty strings is silly
Empty strings are error-prone, and comparing against empty strings is extra
error-prone:
```javascript
let x = prompt();
if (x === '') {
alert('You were supposed to type something!');
}
```
To fix this mistake, check the string's `length` property instead:
```javascript
let x = prompt();
if (x.length === 0) {
alert('You were supposed to type something!');
}
```
Alternatively, treat the string as a boolean:
```javascript
let x = prompt();
if (!x) {
alert('You were supposed to type something!');
}
```
(This is an example diagnostic for contributor documentation purposes. Comparing
against empty strings is totally fine, and the fact that quick-lint-js reports a
diagnostic in this case is sad.)
Some important parts of diagnostic documentation:
- title
-
The title of the document should include the diagnostic's code. The
diagnostic's message should follow the code. The title should
include the message mentioned in
diagnostic-types-2.h
, but it doesn't have to match exactly. Interpolation markers such as{1:headlinese}
should be omitted. - first example
-
The first code snippet should be fenced with
```javascript
or```typescript
(or another other support language). This code snippet demonstrates broken code and must cause quick-lint-js to report a diagnostic. A broken code snippet is required. - second example
- The second code snippet should also be fenced. This code snippet demonstrates working code and must not cause quick-lint-js to report any diagnostic. A working code snippet is required.
- extra examples
- You can include more code snippets after the second. Each of these extra code snippet must cause no diagnostics. Usually these code snippets show alternate ways of addressing the original issue. These extra examples are optional.