Skip to content

v0.10.0

Compare
Choose a tag to compare
@awni awni released this 11 Apr 19:53
· 492 commits to main since this release
d07e295

Highlights

  • Improvements for LLM generation
    • Reshapeless quant matmul/matvec
    • mx.async_eval
    • Async command encoding

Core

  • Slightly faster reshapeless quantized gemms
  • Option for precise softmax
  • mx.metal.start_capture and mx.metal.stop_capture for GPU debug/profile
  • mx.expm1
  • mx.std
  • mx.meshgrid
  • CPU only mx.random.multivariate_normal
  • mx.cumsum (and other scans) for bfloat
  • Async command encoder with explicit barriers / dependency management

NN

  • nn.upsample support bicubic interpolation

Misc

  • Updated MLX Extension to work with nanobind

Bugfixes

  • Fix buffer donation in softmax and fast ops
  • Bug in layer norm vjp
  • Bug initializing from lists with scalar
  • Bug in indexing
  • CPU compilation bug
  • Multi-output compilation bug
  • Fix stack overflow issues in eval and array destruction