Skip to content

Latest commit

 

History

History
38 lines (31 loc) · 788 Bytes

intro.md

File metadata and controls

38 lines (31 loc) · 788 Bytes

Introdcution

Breaking down texts

graph TD;
    Text --> Token;
    Token --> Embedding;
    Embedding --> Vector;
    Vector --> SimilarityMeasure
    Vector --> VectorDatabase;
    Embedding --> Transformers;
Loading

Modelling

  • Attention captures context.
  • Billions of parameters.
  • Huge training data.
  • Training is expensive and requires specialized hardware.
  • optionally Instruction tuning
  • RLHF to incorporate human feedback.

Improvements by user / customer

  • Prompt 'engineering'
  • Use RAG to add data sources to context
  • Fine-tuning of model

Limitations

  • Math / logic / reasoning
  • Training data:
    • Bias
    • Cut-off date
  • Censorship by some vendors
  • Hallicunation
  • computationally expensive
  • Ethics and copyright issues