Skip to content

Literature backing the model architecture #2

@geetu040

Description

@geetu040

Hi, I really like the model . It has been trained good and is generating good results compared to the size and the slight uniqueness in architecture. I know the dataset used here is coming from the paper TinyStories, but is there also a literature backing for the model architecture or are you planing to publish a paper?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions