diff --git a/CHANGELOG.md b/CHANGELOG.md index fb93ed5b8..8d9b29699 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -18,6 +18,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0 - Corresponding stdlib functions for new TVM instructions from 2023.07 and 2024.04 upgrades: PR [#331](https://github.com/tact-lang/tact/pull/331) - `slice`, `rawSlice`, `ascii` and `crc32` built-in functions: PR [#787](https://github.com/tact-lang/tact/pull/787), PR [#799](https://github.com/tact-lang/tact/pull/799) - `Builder.storeMaybeRef`, `parseStdAddress` and `parseVarAddress` stdlib functions: PR [#793](https://github.com/tact-lang/tact/pull/793) +- The compiler development guide: PR [#833](https://github.com/tact-lang/tact/pull/833) ### Changed diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md new file mode 100644 index 000000000..52f682eff --- /dev/null +++ b/CONTRIBUTING.md @@ -0,0 +1,178 @@ +# Hacking on Tact + +## General information + +The Tact smart contract programming language is a statically-typed smart contract programming language which is currently implemented as a transpiler into [FunC](https://docs.ton.org/develop/func), which in its turn compiles down to the [TVM](https://docs.ton.org/learn/tvm-instructions/tvm-overview) bitcode. This implementation strategy is likely to change in the future. + +The Tact compiler parses the input source code into an abstract syntax tree, type-checks it, generates FunC code, runs the FunC compiler, which produces the corresponding [Fift](https://docs.ton.org/develop/fift/overview) code and a TVM binary in the [BoC](https://docs.ton.org/develop/data-formats/cell-boc) format. + +Besides TVM binaries, the Tact compiler generates TypeScript "wrappers" to conveniently test Tact contracts locally in a simulated blockchain environment using, for instance, the standard de-facto [Sandbox](https://github.com/ton-org/sandbox) package. + +Additionally, it generates summaries for humans and machines in Markdown and JSON formats. +The summaries include information like + +- binary code size, +- [TL-B](https://docs.ton.org/develop/data-formats/tl-b-language) schemas for the program types including contract storage and message formats, +- TVM [exit codes](https://docs.ton.org/learn/tvm-instructions/tvm-exit-codes), +- trait inheritance and contract dependency diagrams. + +Currently, Tact does not have a (formal) language specification, so one needs to consult the [Tact docs](https://docs.tact-lang.org) and the tests in this repository. + +The list of known bugs can be obtained using the following GitHub request: . + +More detail about different part of the compiler including their corresponding entry points can be found below. + +## The Tact dependencies + +Tact is implemented in TypeScript. The minimum required version for Node.js is 22. + +The rest of the build and development dependencies are specified, as usual, in the [package.json](./package.json) file and the most important ones are described in the present document. + +Tact's pipeline uses a patched version of the FunC compiler vendored as a WASM binary with some JS wrappers, see the following files: + +- [src/func/funcfiftlib.js](./src/func/funcfiftlib.js) +- [src/func/funcfiftlib.wasm](./src/func/funcfiftlib.wasm) +- [src/func/funcfiftlib.wasm.js](./src/func/funcfiftlib.wasm.js) + +The message of the commit [`f777da3213e3b064a7f407b2569cfd546cca277e`](https://github.com/tact-lang/tact/commit/f777da3213e3b064a7f407b2569cfd546cca277e) explains how the patched version was obtained. We had to patch the FunC compiler because the corresponding [FunC compiler issue](https://github.com/ton-blockchain/ton/issues/971) is unresolved at the time of writing. + +## Building Tact + +The most up-to-date recipe to build Tact is described in [.github/workflows/tact.yml](./.github/workflows/tact.yml) GitHub Actions file. + +## Testing Tact implementation + +As our testing framework we use [Jest](https://jestjs.io). We use a combination of snapshot and expectation tests supported by Jest. + +Some tests are put in the same folder with the implementation and can be located in `*.spec.ts` files, +other tests are grouped into categories in the [src/test](./src/test) folder. The project map section has more information on tests relevant for each compiler component. + +### How to update test snapshots + +Updating all the test snapshots: + +```shell +yarn test -u +``` + +Updating a subset of the test snapshots can be done like so: + +```shell +yarn test -u spec-name-pattern1 spec-name-pattern2 +``` + +## Linting + +To pass CI, one needs to have a warning-free build. To run all the lints described below execute the following command in your terminal: + +```shell +yarn lint:all +``` + +### Linting the entire codebase with ESLint + +Running [ESLint](https://eslint.org) across the whole Tact codebase: + +```shell +yarn lint +``` + +### Spell-checking + +To spell-check the entire codebase with [CSpell](http://cspell.org) run: + +```shell +yarn spell +``` + +### Knip + +The [Knip](https://knip.dev) tool is used to check issues with the compiler dependencies and API. +It can be run with + +```shell +yarn knip +``` + +## Project map + +### Compiler driver + +Tact's command-line interface (CLI) is located in [bin/tact.js](./bin/tact.js). +Tact uses the [meow](https://github.com/sindresorhus/meow) CLI arguments parser. + +The main entry point for the Tact CLI is [src/node.ts](./src/node.ts) and [src/pipeline/build.ts](./src/pipeline/build.ts) is the platform-independent compiler driver which contains the high-level compiler pipeline logic described above. + +The Tact CLI gets Tact settings from a `tact.config.json` file or creates a default config for a single-file compilation mode. The format of `tact.config.json` files is specified in [schemas/configSchema.json](./schemas/configSchema.json). + +The so-called "pre-compilation" steps that include imports resolution, type-checking, building schemas for high-level Tact data structures to be serialized/deserialized as cells (this step is dubbed "allocation") are located in [src/pipeline/precompile.ts](src/pipeline/precompile.ts). + +Besides the terminal, the Tact compiler is supposed to work in browser environments as well. + +Some CLI tests can be found in [.github/workflows/tact.yml](./.github/workflows/tact.yml) GitHub Action file. + +### Parser + +The [src/grammar/grammar.ohm](./src/grammar/grammar.ohm) file contains the Tact grammar expressed in the PEG-like language of the [Ohm.js](https://ohmjs.org) parser generator. + +The helper file [src/grammar/grammar.ts](./src/grammar/grammar.ts) contains the logic that transforms concrete syntax trees produced with the help of the Ohm.js-generated parser into abstract syntax trees (ASTs) defined in [src/grammar/ast.ts](./src/grammar/ast.ts). The grammar.ts file also does a bit of grammar validation, like checking that function or constant attributes are not duplicated or that user identifiers do not start with certain reserved prefixes. + +The [src/grammar/test](./src/grammar/test) folder contains Tact files that are supposed to be parsed without any issues, and the [src/grammar/test-failed](./src/grammar/test-failed) folder contains grammatically incorrect test files which should result in parser errors. The parser error messages and the locations they point to are fixed in the [src/grammar/**snapshots**/grammar.spec.ts.snap](./src/grammar/__snapshots__/grammar.spec.ts.snap) Jest snapshot file. + +### Typechecker + +The Tact type-checker's implementation can be found mostly in the following files: + +- [src/types/resolveDescriptors.ts](./src/types/resolveDescriptors.ts) takes care of checking at the level of module-items, data type definitions, function signatures, etc. and it does not deal with statements (so does not traverse function bodies); +- [src/types/resolveStatements.ts](./src/types/resolveStatements.ts) checks statements and statements blocks; +- [src/types/resolveExpression.ts](./src/types/resolveExpression.ts) type-checks the Tact expressions. + +The current implementation of the typechecker is going to be significantly refactored, as per [issue #458](https://github.com/tact-lang/tact/issues/458). The corresponding pull request will have formally specified the Tact typing rules. + +Until we have the Tact type system specified, the only source of information about it would be the aforementioned Tact docs and the tests in the following locations: + +- [src/types/test](./src/types/test): positive well-formedness tests at the level of data types, contracts, traits and function signatures; +- [src/types/test-failed](./src/types/test-failed): negative well-formedness tests at the level of data types, contracts, traits and function signatures; +- [src/types/stmts](./src/types/stmts): positive type-checking tests at the level of function bodies; +- [src/types/stmts-failed](./src/types/stmts-failed): negative type-checking tests at the level of function bodies; +- [src/test/compilation-failed](./src/test/compilation-failed): negative type-checking tests that require full environment, for instance, the standard library (the other tests in `src/types` don't have access to the full environment). + +### Constant evaluator + +The constant evaluator is used as an optimizer to prevent some statically known expressions from being evaluated at run-time and increase gas consumption of the contracts. It will be later extended to perform partial evaluation of contracts and use various simplification rules such as applying some algebraic laws to further reduce gas consumption of contracts at run-time. + +The constant evaluator supports a large subset of Tact and handles, for instance, constants defined in terms of other constants, built-in and user-defined functions, logical and arithmetic operations. + +The main logic of the constant evaluator can be found in the file [src/interpreter.ts](./src/interpreter.ts). + +You can find the relevant tests in [src/test/e2e-emulated/contracts/constants.tact](./src/test/e2e-emulated/contracts/constants.tact) and the corresponding spec-file: [](./src/test/e2e-emulated/constants.spec.ts). + +The negative tests for constant evaluation are contained in the Tact files prefixed with `const-eval` in the [src/test/compilation-failed/contracts](./src/test/compilation-failed/contracts) folder. + +### Code generator + +Some general information on how Tact code maps to FunC is described in the Tact docs: . + +The code generator lives in the [src/generator](./src/generator) sub-folder with the entry point in [src/generator/writeProgram.ts](./src/generator/writeProgram.ts). + +The implementation that we have right now is being refactored to produce FunC ASTs and then pretty-print those ASTs as strings instead of producing source FunC code in one step. Here is the relevant pull request: . + +One can find the end-to-end codegen test spec files in the [src/test/e2e-emulated](./src/test/e2e-emulated/) folder. The test contracts are located in [src/test/e2e-emulated/contracts](./src/test/e2e-emulated/contracts) subfolder. Many of those spec files test various language features in relative isolation. +An important spec file that tests argument passing semantics for functions and assignment semantics for variables is here: [src/test/e2e-emulated/semantics.spec.ts](./src/test/e2e-emulated/semantics.spec.ts). + +Note: If you add an end-to-end test contract, you also need to include it into [tact.config.json](./tact.config.json) and run `yarn gen` to compile it and create TypeScript wrappers. + +`yarn gen` also re-compiles test contracts, so it's important to run it when code generation is changed. + +Some other codegen tests are as follows: + +- [src/test/exit-codes](./src/test/exit-codes): test that certain actions produce the expected exit codes; +- [src/test/codegen](./src/test/codegen/): test that these contracts compile just fine without running any dynamic tests: bug fixes for FunC code generation often add tests into this folder. + +### Pretty-printer and AST comparators + +The entry point to the Tact AST pretty-printer is [src/prettyPrinter.ts](./src/prettyPrinter.ts). It is going to be used for the Tact source code formatter once the parser keeps comments and other relevant information. + +The AST comparator is defined in [src/grammar/compare.ts](./src/grammar/compare.ts). This is useful, for instance, for static analysis tools which can re-use the Tact TypeScript API. + +The corresponding test spec files can be found in [src/test](./src/test/) folder with the test contracts in [src/test/contracts](./src/test/contracts/) folder. diff --git a/README.md b/README.md index c170c69d9..af7802c96 100644 --- a/README.md +++ b/README.md @@ -4,9 +4,10 @@ A next-gen smart contract language for TON focused on efficiency and simplicity. -- [Changelog](/CHANGELOG.md) -- [Roadmap](/ROADMAP.md) -- [Examples](/examples/) +- [Changelog](./CHANGELOG.md) +- [How to contribute to Tact development](./CONTRIBUTING.md) +- [Roadmap](./ROADMAP.md) +- [Examples](./examples/) ## Key Resources diff --git a/cspell.json b/cspell.json index f688ba7ae..b43015a3c 100644 --- a/cspell.json +++ b/cspell.json @@ -3,12 +3,14 @@ "version": "0.2", "language": "en", "words": [ + "abcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcd", "ADDRAND", "alnum", "assgn", "augmentedassign", "basechain", "BBITS", + "bitcode", "blockstore", "bounceable", "BREFS", @@ -19,6 +21,7 @@ "codegen", "CONFIGOPTPARAM", "CTOS", + "DEBUGSTR", "decompilation", "decompile", "Decompiled", @@ -27,7 +30,6 @@ "decompiling", "dentry", "Descr", - "DEBUGSTR", "disasm", "divmod", "dnsresolve", @@ -40,14 +42,14 @@ "funcfiftlib", "funcid", "funs", - "gettest", "GASCONSUMED", - "GETGASFEE", - "GETSTORAGEFEE", "GETFORWARDFEE", - "GETGASFEESIMPLE", "GETFORWARDFEESIMPLE", + "GETGASFEE", + "GETGASFEESIMPLE", "GETORIGINALFWDFEE", + "GETSTORAGEFEE", + "gettest", "HASHCU", "HASHEXT", "HASHSU", @@ -120,8 +122,8 @@ "SDEMPTY", "SDSKIPFIRST", "SEMPTY", - "SENDRAWMSG", "SENDMSG", + "SENDRAWMSG", "seqno", "SETCONTARGS", "SETINDEXVARQ", @@ -134,6 +136,7 @@ "STDICT", "stdlib", "STIX", + "stmts", "STOPTREF", "STREF", "struct", @@ -155,8 +158,7 @@ "unixfs", "workchain", "xffff", - "привет", - "abcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcdabcd" + "привет" ], "flagWords": [], "ignorePaths": [