Releases: microsoft/genaiscript
v1.92.1
β¨ Enhanced FFmpeg Flexibility & Performance
- Introduced support for custom FFmpeg options, allowing more control over video and audio processing.
- Improved caching mechanism for FFmpeg operations, reducing redundant computations and enhancing efficiency.
π₯ New Video Processing Features
- Added new methods to the FFmpeg command builder, including
videoBitrate
,outputFps
,size
,autopad
, and more. - Enabled advanced video transformations, such as aspect ratio adjustments, padding, and grayscale conversion.
π΅ Streamlined Audio Extraction
- Refined audio extraction with customizable output options for better quality control.
π Developer Experience Improvements
- Simplified FFmpeg command builder API for easier and more intuitive usage.
- Improved type definitions for better IDE support and code clarity.
This release unlocks powerful media processing capabilities and optimizes workflows for developers. π
v1.92.0
What's New in GenAIScript 1.92.0
- π Enhanced Video Processing: Introduced a robust
FFmepgClient
for seamless video and audio operations, including frame extraction, audio track extraction, and metadata probing. Simplified API for working with video files. - π¨ Improved Output Logging: Added support for audio logging and markdown table rendering in
OutputTrace
, making it easier to visualize results. - π Remote Environment Support: Enhanced server to display remote repository details, including URL and branch, for better traceability.
- β¨ Mermaid Diagrams: Added support for rendering Mermaid diagrams in markdown outputs, enabling richer visualizations.
- π οΈ VS Code Integration: New webview support for viewing GenAIScript requests directly in VS Code.
- ποΈ File Search Flexibility:
findFiles
now supports anignore
option with array input for more granular file matching. - ποΈ Annotation Improvements: Enhanced annotation handling with better formatting and GitHub Action command integration.
This release brings significant upgrades for video processing, logging, and developer experience.
v1.91.2
- π Enhanced Data Handling: Introduced a unified
dataTryParse
function for seamless parsing of various data formats, improving flexibility and reducing redundancy. - π οΈ New Tools: Added
fs_infer_schema
andfs_data_query
tools to infer JSON schemas and query data using GROQ syntax, empowering data analysis capabilities. - π Data Agent: Launched a new "data" agent to analyze and query file data, equipped with schema inference and querying tools.
- π Safety Improvements: Integrated content safety checks for harmful and protected material in agents and tools.
- π₯ Video Agent Update: Enhanced the video agent with additional safety checks.
- π Teams Integration: Added functionality to read messages from Microsoft Teams channels, complementing the existing message posting feature.
- π§ͺ Sample Scripts: Introduced new sample scripts demonstrating data schema inference and multi-file data analysis.
v1.91.1
β¨ Enhanced Video Capabilities
Introducing powerful video processing tools! Transcribe audio/video files with speech-to-text, extract frames, probe metadata, and even extract audio from video files. Perfect for multimedia projects!
π WebSocket Improvements
WebSocket server now supports dispatching progress updates to all clients, enhancing real-time collaboration and monitoring.
π€ Speech-to-Audio Support
Added speech synthesis capabilities, enabling text-to-speech generation with customizable voices for a variety of use cases.
π οΈ Developer Utilities
New deleteEmptyValues
utility improves object handling, and CLI enhancements include options for dispatching progress events.
π Performance and Stability
Refactored internal structures and fixed minor issues, ensuring a smoother and more reliable experience.
v1.91.0
π GenAIScript 1.91.0 Highlights
π New Features & Improvements
- Enhanced Embedding Configurations: Added new embedding aliases for streamlined LLM setups.
- Streaming Output: Introduced robust real-time output streaming for better user feedback.
- Trace-Driven Output: Improved trace capabilities, enabling detailed output tracking and customization.
π οΈ Quality Enhancements
- Model Management: Simplified model pulling with expanded support for LMStudio and enhanced alias handling.
- Video Config & Fetch Updates: Improved video configuration documentation and fixed fetch command formatting for better usability.
- Error Handling: Enhanced error serialization for clearer debugging.
π§ Developer Experience
- Refined APIs: Modularized chat message collapsing and moved it to a dedicated rendering module.
- Expanded Trace API: Added new output trace capabilities, including detailed logging and structured output support.
π§Ή Maintenance
- Code Cleanup: Removed unused code paths, updated aliases, and optimized internal logic for better performance and readability.
v1.90.1
π Enhanced Video Utilities: Added support for WorkspaceFile
in video processing functions, enabling seamless handling of various file types. Extracting audio, frames, and metadata from videos is now more robust and flexible.
β¨ Improved Fetch Implementation: Integrated cross-fetch
for better compatibility and enhanced proxy support. Added detailed request tracing, including support for multipart form data, to simplify debugging.
π§ Smarter Configuration Handling: Configuration merging is now more intuitive, with support for arrays and object properties like modelAliases
. This update improves flexibility and reduces potential errors.
π Bug Fixes: Resolved issues with error handling, including a fix for misreported error messages. Enhanced the reliability of OpenAI API interactions and transcription workflows.
π Sample Configurations: Introduced sample genaiscript.config.yml
and large.genai.mjs
files to demonstrate configuration and scripting capabilities.
v1.90.0
β¨ New Features and Enhancements
- Added CLI commands for video processing:
- Extract audio from video files.
- Extract video frames with options for count, size, and output folder.
- Integrated HuggingFace's
pipeline
API for advanced transformer capabilities. - Introduced transcription support via OpenAI's Whisper API, enabling audio-to-text conversion with caching and SRT/VTT format generation.
- Enhanced data slicing in
defData
to support object field sampling and filtering.
β‘ Performance Improvements
- Optimized hashing with streaming file support and salt integration.
- Improved concurrency handling for video frame extraction and audio transcoding using FFmpeg.
π οΈ Bug Fixes and Stability
- Fixed edge cases in file handling for workspace paths.
- Resolved hashing inconsistencies for various data types like buffers and blobs.
- Enhanced error handling and logging for video processing and transcription workflows.
π₯ Video and Audio Processing
- Seamless integration of FFmpeg for video/audio tasks.
- Added caching for video probes, audio extraction, and frame generation.
π‘ Developer Experience
- Simplified runtime configuration for transcription and video utilities.
- Improved CLI usability with detailed descriptions and argument validations.
v1.89.2
π What's New in GenAIScript 1.89.2
- Enhanced Hash Functionality: The
hash
function now supports additional input types likeArrayBuffer
,Blob
, andnull/undefined
values. Improved flexibility ensures seamless handling of diverse data structures. Comprehensive tests included for reliability. πβ¨ - Filtered Script Listings: Unlisted scripts are now excluded from project listings, offering a cleaner and more focused view of available scripts. π¨
- Improved Error Logging: Added error logging during model pulls to help identify and debug issues more effectively. π οΈ
v1.89.1
π Enhanced Usability
The RunButton now dynamically updates to display "Abort" when a process is running, making interactions more intuitive.
π€ Model Updates
Introduced support for the new ollama:phi4
model, offering advanced capabilities. Additionally, updated model aliases for consistency and clarity.
β»οΈ Streamlined Integrations
Removed the unused DeepSeek base URL for a cleaner configuration. Improved key consistency across integrations, including topP
.
v1.89.0
π New Features
- File Selection for Imports: You can now select specific files to include when importing, streamlining your workflow.
- Video Recording in Browser: Added support for recording browser sessions with Playwright, including video file retrieval for analysis or debugging.
- ReadableStream for Images: Enhanced image handling with support for
ReadableStream
inputs, expanding compatibility. - Playground Setup: A new playground environment is now available, with live file watching for faster iteration.
π Improvements
- Enhanced Chat Participant Logic: Introduced support for modifying chat messages dynamically during processing.
- Updated Memory Aliases: Memory alias updated from
large
tosmall
for optimized configurations. - Markdown Rendering Fixes: Improved handling of complex content arrays in rendered markdown, ensuring better formatting.
β»οΈ Refactors
- Simplified image processing functions by removing redundant awaits.
- Reordered StatsTabPanel for better UI flow.
π Documentation
- Added instructions for running the playground in
CONTRIBUTING.md
.
This release brings powerful new tools and refinements to enhance your GenAIScript experience! π