Compiler: Start implementing the type checker #228

osa1 · 2025-09-26T19:05:10Z

Questions and TODOs:

Do we want to allow higher-kinded types now, or later? Do we have any immediate use cases?
Should we represent kinds with types? (i.e. type-in-type)
In the interpreter we implement Hash for Ty, because we represent predicate sets as HashSet<Pred> and Pred refers to Tys for the type arguments of traits. We can't do it automatically in Fir yet, so we'll have to manually implement a lot of Hashs. Also, FunArgs is currently using a HashSet. We may have to make it a sorted Vec[(name: Id, ty: Ty)] (for named arguments).

~~But I wonder if there's a better representation of predicate sets than HashSet[Pred]? It would be simpler if we could avoid hashing here, at least initially.~~

Reminder: pred sets are not just for function contexts [Iterator[iter, item, exn], Eq[item]] etc. they also hold generated predicates, to be resolved later.

~~Allowing duplicates could potentially generate a lot of redundant predicates, wasting time in the resolving phase. For example, every == in a function can generate a Eq[t] predicate.~~

~~I think using a Vec[Pred] and linear search when adding may be good enough for now. TODO: Check the max. pred set size in the interpreter when running the programs in the repo.~~

Predicate sets will be vectors, at least for now. See Predicate sets contain duplicate predicates #229 for the discussion.
In TyEnv (in the interpreter) we currently maintain two maps, one for type variables, one for type constructors. But type variables are also treated as type constructors (just with no known constructors) in most places. AFAICS only in conversions we use the type var map to preserve sharing when the type AST mentions the same type variable.

In the compiler we use TyId for type constructors and LocalId for type variables. So the map will have a key type with the union of these.

I wonder if it would make sense to have another map (three in total):
- cons: ScopeMap[TyId, TyCon]: for mapping named types to type constructors
- vars: ScopeSet[LocalId]: type variables in scope. In the compiler, TyCons for these don't have any information anyway, so a set should be OK.
- varConversions: ScopeMap[LocalId, Ty]: to be able to convert type variables in scope without breaking sharing.
I think this will require adding one more variant to Ty, to be able to distinguish type constructors from rigid type variables. E.g.:
```
## A type constructor, e.g. `Vec`, `Option`, `U32`.
Con(
    id: TyId,
    kind: Kind,
)

## A rigid type variable.
RVar(
    id: LocalId,
)
```

osa1 · 2025-09-30T11:44:03Z

I think the Ty type is currently not quite right.

Because we can't have a type variable with kind other than * or row, and we don't allow partial type applications (because you wouldn't be able to bind anything to partial apps), Con cannot be Vec, Option etc. It needs to be fully applied.

So the examples in the documentation are not right, Vec and Option cannot be Cons.

Because both rigid type variables and constructors are represented as Con, Cons currently can have row kinds.

If we separate rigid type variables and constructors to different Ty variants (see comment below), then Cons will only have kind * as row types are represented as Anonymous with the isRow flag set.
Because anonymous types are not Apps, I think App doesn't need a kind field. An App is always a *.

osa1 · 2025-09-30T12:36:26Z

Thinking about this more: a quantified type variable becomes

A rigid type variable (let's call this rvar) when the qvar is in a function signature and we're checking the function's body.
A unification variable (let's call this uvar) when the qvar is in a scheme of a function we're calling.

A type variable is only on eo of these (qvar, rvar, uvar) at any given time. It cannot be more than one of these.

So we would be able to use just one Ty variant for all of these, but currently we update uvar contents as we unify them (link them to other types), so they have different fields.

We could either have a map to map unification variables to their linked types, or keep using different variants.

If we use different variants, it might make sense to have a variant for rvars too.

If we do that, I think that also solves the problem with the TyEnvs mentioned in the PR description.

osa1 · 2025-10-27T09:39:43Z

TODO: Environments should keep (in addition to schemes etc.) unique top-level definition indices/ids to be able to import the same top-level thing multiple times, via different imports.

E.g. a library re-exports A/B/T (the type T) and I also import A/B/T directly.

This is also a language design question, but we may also want to allow importing different things under the same name, and only fail when we use that thing. (instead of when we import it)

If we design the data structures to allow this we may choose to not allow it later easily. So I think it makes sense to design for this.

osa1 added 2 commits September 26, 2025 20:04

Start implementing the type checker

69b4a13

Port Scheme and Pred types

4e5342a

osa1 mentioned this pull request Sep 27, 2025

Predicate sets contain duplicate predicates #229

Open

osa1 added 6 commits September 28, 2025 09:07

Implement PredSet, document why we don't use a HashSet for preds

f48e97e

Merge remote-tracking branch 'origin/main' into type_checking

118b99f

Start implementing instantiating type schemes

ddf0712

Merge remote-tracking branch 'origin/main' into type_checking

06b0438

Fix import

73e725c

Add LocalId type, implement substQVars

e7cdd5d

osa1 added 19 commits September 30, 2025 18:17

Merge remote-tracking branch 'origin/main' into type_checking

dd91d4f

Implement UVars and RVars as discussed

d1926e6

Update AST tc ty fields

5339f98

Merge branch 'main' into type_checking

9f63721

Merge branch 'main' into type_checking

a067c69

ToDoc impls

b94659a

Move QVar and RVar fields to their own types

5a1887c

Create a directory for type checker modules

47032df

Add placeholders for unification functions

b0b16ed

Start implementing unification

ff5e891

Merge remote-tracking branch 'origin/main' into type_checking

94d97c5

Implement deepNormalize, start implementing collectRows

bf8dbb7

Merge remote-tracking branch 'origin/main' into type_checking

2f2a262

Implement collectRows

a2a83e8

Merge remote-tracking branch 'origin/main' into type_checking

d9c995d

More unification

8dc3c4f

More unification

29ac400

Merge remote-tracking branch 'origin/main' into type_checking

583e021

Merge remote-tracking branch 'origin/main' into type_checking

fc21812

osa1 added 29 commits October 13, 2025 11:38

Merge remote-tracking branch 'origin/main' into type_checking

87bd4eb

Add missing imports

d75ce06

Simplify TyDefIdx

772b5d2

Add token values to id types

4e0d5c1

Add err loc and msg to unification errs

28f91d7

Start implementing one-way unification

270a1b7

Merge remote-tracking branch 'origin/main' into type_checking

9f6323b

Fix type errs

cdbf1b1

More unification

71dcb6b

More unification

e133e42

Merge remote-tracking branch 'origin/main' into type_checking

589e7b1

More types TyConDetails, TraitDetails, TraitMethod, TypeDetails

78b59fb

Start implementing convos

63ffd1f

Add TypeError type, wip stuff

a90bc98

More convos

be7beec

More convos

5469a40

Conversions = done?

38b87bb

Implement TraitEnv

706c4a6

Merge remote-tracking branch 'origin/main' into type_checking

83fb846

Merge remote-tracking branch 'origin/main' into type_checking

a6bc166

Update sytnax in some docs

81af533

WIP

ef28810

Merge remote-tracking branch 'origin/main' into type_checking

8a34396

Module and fun envs

f0088ae

Placeholders

e033c65

Fix import

2444f20

WIP

e018ede

Fix imports again

8f2b0fd

Code

6031c2b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compiler: Start implementing the type checker #228

Compiler: Start implementing the type checker #228

Uh oh!

osa1 commented Sep 26, 2025 •

edited

Loading

Uh oh!

osa1 commented Sep 30, 2025 •

edited

Loading

Uh oh!

osa1 commented Sep 30, 2025 •

edited

Loading

Uh oh!

osa1 commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Compiler: Start implementing the type checker #228

Are you sure you want to change the base?

Compiler: Start implementing the type checker #228

Uh oh!

Conversation

osa1 commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

osa1 commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

osa1 commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

osa1 commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

osa1 commented Sep 26, 2025 •

edited

Loading

osa1 commented Sep 30, 2025 •

edited

Loading

osa1 commented Sep 30, 2025 •

edited

Loading