Releases: microsoft/genaiscript
v1.95.0
What's New in 1.95.0 π
- Enhanced Security π: Introduced a robust Content Security Policy (CSP) for webviews, ensuring stricter control over resources like scripts, styles, and frames.
- Improved Webview Experience π₯οΈ: Added a persistent webview panel with seamless reveal functionality, providing a smoother user interface for error handling and output display.
- Streamlined Codebase π οΈ: Replaced custom nonce generation with a secure
randomHex
utility, enhancing maintainability and reliability.
v1.94.2
π Enhanced Model Management: Improved error handling and added support for listing models across multiple providers, ensuring a smoother user experience when managing language models.
π§ Client Language Model Refactor: Transitioned the client language model to runtimeHost
for better modularity and maintainability.
π Web Compatibility: Introduced support for web-browser scenarios, including enhanced CSP handling and iframe-based views for seamless integration.
πΉ Video Blogifier Update: Enhanced video transcription and processing with improved prompt handling and dynamic topic resolution for better YouTube optimization.
π οΈ Dependency Updates: Refreshed dependencies across packages to maintain stability and compatibility.
π Bug Fixes: Addressed issues with client builds, environment configurations, and context menu visibility to enhance overall reliability.
v1.94.1
π Enhanced File Handling: Added support for resolving data URIs directly in files, enabling seamless processing of base64-encoded data.
π Improved Token Configuration: Refined token handling logic for better compatibility with bearer tokens across various providers.
π Documentation Update: Updated Azure AI Foundry documentation for clarity and consistency.
π οΈ Simplified Validation: Removed redundant version checks in Azure environment parsing for a cleaner configuration process.
v1.94.0
What's New in 1.94.0
π Enhanced Video Processing
- Introduced keyframe extraction and scene detection using FFmpeg, enabling better video analysis and integration.
- Streamlined video transcription and screenshot capabilities for improved LLM input preparation.
π οΈ Improved CLI Functionality
- Added
--out-output
flag for better output management. - Centralized AI request dispatch logic for consistent and efficient processing.
π Advanced Data Parsing
- Added support for JSONL file parsing, expanding data format compatibility.
π€ AI Enhancements
- Introduced a new
classify
prompt for text and image categorization with entropy and logprob analysis. - Improved handling of Hugging Face token prefixes for multi-token scenarios.
π¨ Developer Experience
- Enhanced trace handling and output management for better debugging and insights.
- Refactored internal logic for clarity and maintainability.
π₯ Additional Video Tools
- Added support for high-detail image scaling and scene-based frame extraction.
This release brings significant improvements to video processing, CLI tools, and AI integration, ensuring a smoother and more powerful experience for developers.
v1.93.0
What's New in GenAIScript 1.93.0
π Keyframe Extraction and Scene Detection
Introducing enhanced video processing capabilities! Extract keyframes or detect scenes with adjustable thresholds using the new --keyframes
and --scene-threshold
options in the CLI.
π£οΈ Local Whisper ASR Support
Seamlessly integrate the Whisper ASR Webservice for transcription tasks. Configure it as a model provider and enjoy local transcription support.
π Improved Notebook Execution
Notebook frontmatter now supports additional run options like models and providers, enabling more flexible and powerful script execution.
π§ Enhanced FFmpeg Integration
Optimized frame extraction with wildcard-based output handling, scene-specific frame selection, and improved caching for smoother workflows.
π Bug Fixes
- Resolved issues with parsing notebook frontmatter.
- Fixed minor inconsistencies in transcription and frame extraction logic.
v1.92.3
What's New in GenAIScript 1.92.3
π Enhanced Video and Audio Processing
- Added
probe
command to extract detailed metadata from video/audio files. - Improved
extract-audio
andextract-frames
commands with new options for transcription, frame count, size, and format.
π₯ Large Media Support
- Optimized frame extraction for large images and videos, ensuring better performance and flexibility.
π Transcription Improvements
- Revamped transcription caching and error handling for seamless speech-to-text conversion.
βοΈ FFmpeg Integration
- Introduced FFmpeg feature installation in dev environments for enhanced media processing.
π οΈ Developer Experience
- Simplified CLI workflows and added comprehensive tests for video processing tasks.
Enjoy smoother media handling and powerful new tools!
v1.92.2
What's New in 1.92.2
π Enhanced Webview Handling
Improved handling of external URIs for webview assets, ensuring seamless integration and better external resource management.
π₯ Video Processing Improvements
- Added support for caching during audio extraction with FFmpeg, optimizing repeated operations.
- Updated video transcription to leverage caching for alt-text generation.
π Cleaner Repositories
Introduced .gitignore
for large local test video files, keeping repositories tidy.
π οΈ Bug Fixes & Optimizations
- Resolved issues with optional parameters in FFmpeg operations.
- Adjusted input handling to ensure consistent directory management.
v1.92.1
β¨ Enhanced FFmpeg Flexibility & Performance
- Introduced support for custom FFmpeg options, allowing more control over video and audio processing.
- Improved caching mechanism for FFmpeg operations, reducing redundant computations and enhancing efficiency.
π₯ New Video Processing Features
- Added new methods to the FFmpeg command builder, including
videoBitrate
,outputFps
,size
,autopad
, and more. - Enabled advanced video transformations, such as aspect ratio adjustments, padding, and grayscale conversion.
π΅ Streamlined Audio Extraction
- Refined audio extraction with customizable output options for better quality control.
π Developer Experience Improvements
- Simplified FFmpeg command builder API for easier and more intuitive usage.
- Improved type definitions for better IDE support and code clarity.
This release unlocks powerful media processing capabilities and optimizes workflows for developers. π
v1.92.0
What's New in GenAIScript 1.92.0
- π Enhanced Video Processing: Introduced a robust
FFmepgClient
for seamless video and audio operations, including frame extraction, audio track extraction, and metadata probing. Simplified API for working with video files. - π¨ Improved Output Logging: Added support for audio logging and markdown table rendering in
OutputTrace
, making it easier to visualize results. - π Remote Environment Support: Enhanced server to display remote repository details, including URL and branch, for better traceability.
- β¨ Mermaid Diagrams: Added support for rendering Mermaid diagrams in markdown outputs, enabling richer visualizations.
- π οΈ VS Code Integration: New webview support for viewing GenAIScript requests directly in VS Code.
- ποΈ File Search Flexibility:
findFiles
now supports anignore
option with array input for more granular file matching. - ποΈ Annotation Improvements: Enhanced annotation handling with better formatting and GitHub Action command integration.
This release brings significant upgrades for video processing, logging, and developer experience.
v1.91.2
- π Enhanced Data Handling: Introduced a unified
dataTryParse
function for seamless parsing of various data formats, improving flexibility and reducing redundancy. - π οΈ New Tools: Added
fs_infer_schema
andfs_data_query
tools to infer JSON schemas and query data using GROQ syntax, empowering data analysis capabilities. - π Data Agent: Launched a new "data" agent to analyze and query file data, equipped with schema inference and querying tools.
- π Safety Improvements: Integrated content safety checks for harmful and protected material in agents and tools.
- π₯ Video Agent Update: Enhanced the video agent with additional safety checks.
- π Teams Integration: Added functionality to read messages from Microsoft Teams channels, complementing the existing message posting feature.
- π§ͺ Sample Scripts: Introduced new sample scripts demonstrating data schema inference and multi-file data analysis.