language specific tree-sitter major mode wont show up

Question

I'm using GNU Emacs 29.1 and have installed the tree-sitter haskell grammar: libtree-sitter-haskell.so via treesit-install-language-grammar but somehow haskell-ts-mode wont show up as a major mode under M-x.

However rust-ts-mode shows up even if I don't have the appropriate grammar installed.

Why is this?

There exists a rust-ts-mode.el file, which is quite long, do I need to to implement the tree-sitter language major mode for each language I want to use tree-sitter with? I thought that was what the language specific grammar was for.

score 0 · Accepted Answer · answered Aug 25 '23 at 20:57

Yes, you need both the tree sitter grammar, and a major mode that at the very least calls treesitter-parser-create when it is started. The first thing rust-ts-mode does is this:

  (when (treesit-ready-p 'rust)
    (treesit-parser-create 'rust)

Your mode should do the same for Haskell, so that the grammar you created gets loaded. How else would the treesit package know to load it?

Note that a tree sitter mode needs to do a number of other things as well. It needs to provide some translation between the grammar and Emacs font–lock settings, as this is what tells it how to do syntax highlighting.

It may need to add syntax properties to the buffer, if Haskell gives multiple meanings to the same type of character. For example, in Rust < and > might just be greater–than and less–than operators, or they might be paired delimiters around a list of generic type arguments. By adding the correct syntax properties to these characters in the buffer, it tells Emacs which ones are paired and which ones are operators. And of course, don’t forget to set up the syntax table for characters which are not ambiguous.

It almost certainly needs to set a bunch of variables related to comments, so that a number of Emacs features, such as text wrapping and cursor motion, work correctly.

Users will certainly expect it to set up some kind of automatic indentation. The treesit package doesn’t know how Haskell is indented, and the grammar doesn’t encode that sort of information even if it is helpful for determining when or how much to indent.

Not every Emacs user uses Imenu, but most of them probably do. You should think about how to configure it. The treesit package does provide some help for this; in the simple case all you have to do is set treesit-simple-imenu-settings appropriately. Treesit then looks for matching nodes produced by your grammar and sends the data off to imenu for you.

A really good language mode is a recursive fractal of complexity, but a Tree Sitter grammar is a good start.

That is unfortunate. Thank you very much for your detailed answer. It helped clear up some misunderstandings of what language modes (should) actually do. — hubbledeepfield, Aug 26 '23 at 00:28
You’re welcome. What exactly created those misunderstandings? — db48x, Aug 26 '23 at 08:25
I guess I was just hoping that tree-sitter would provide a unified interface that could be used with (by) font-lock out of the box. I haven't used imenu before so it makes sense, that you would need to define the major definitions somewhere as ts does not seem to do that. I am also fairly new to emacs and just stopped using doom, so I never was confronted with its inner workings. — hubbledeepfield, Aug 26 '23 at 13:25

language specific tree-sitter major mode wont show up

1 Answers1