The BlinkID SDK is a comprehensive solution for implementing secure document scanning on iOS. It offers powerful capabilities for capturing and analyzing a wide range of identification documents. The package consists of BlinkID, which serves as the core module, and an optional BlinkIDUX package that provides a complete, ready-to-use solution with a user-friendly interface.
- Requirements
- Quick Start
- BlinkID Components
- BlinkIDUX Components
- Creating custom UX component
- Localization
- SDK Integration Troubleshooting
- SDK size
- Additional info
SDK package contains BlinkID
framework, BlinkIDUX
package and one or more sample apps that demonstrate their integration. The BlinkID
framework can be deployed on iOS 15.0 or later and BlinkIDUX
package can be deployed on iOS 16.0 or later. The framework and package support Swift projects.
Both
BlinkID
andBlinkIDUX
are dynamic packages
This project is designed with Swift 6, leveraging the latest concurrency features to ensure efficient, and modern application performance.
BlinkIDUX
is built with full support for SwiftUI, enabling seamless integration of declarative user interfaces with modern, responsive design principles.
SDK | Platform | Installation | Minimum Swift Version | Type |
---|---|---|---|---|
BlinkID | iOS 15.0+ | Swift Package Manager | 5.10 / Xcode 15.3 | Dynamic |
BlinkIDUX | iOS 16.0+ | Swift Package Manager | 5.10 / Xcode 15.3 | Dynamic |
In this section, you will initialize the SDK, create a capture session, and submit the images for either server-side or client-side verification, resulting in a fraud verdict and a detailed analysis of the document images.
This Quick Start guide will get you up and performing document scanning as quickly as possible. All steps described in this guide are required for the integration.
This guide closely follows the BlinkID app in the Samples folder of this repository. We highly recommend you try to run the sample app. The sample app should compile and run on your device.
The source code of the sample app can be used as a reference during the integration.
To integrate the BlinkID SDK into your iOS project, you'll need to:
- Obtain a valid license key from the Microblink dashboard
- Add the SDK framework to your project
The Swift Package Manager is a tool for automating the distribution of Swift code and is integrated into the swift
compiler.
Once you have your Swift package set up, adding BlinkID and BlinkIDUX as a dependency are as easy as adding it to the dependencies
value of your Package.swift
or the Package list in Xcode.
We provide a URL to the public package repository that you can add in Xcode:
https://github.com/BlinkID/blinkid-sp
dependencies: [
.package(url: "https://github.com/BlinkID/blinkid-ios.git", .upToNextMajor(from: "7.0.0"))
]
Normally you'll want to depend on the BlinkIDUX
target:
.product(name: "BlinkIDUX", package: "BlinkIDUX")
BlinkIDUX
has a binary target dependency onBlinkID
, so use this package only if you are also using our UX.
You can see dependency in our Package.swift
:
.binaryTarget(
name: "BlinkID",
path: "Frameworks/BlinkID.xcframework"
)
If you prefer not to use Swift Package Manager, you can integrate BlinkID and BlinkIDUX into your project manually.
Download latest release (Download BlinkID.xcframework.zip
file or clone this repository).
-
Copy
BlinkID.xcframework
to your project folder. -
In your Xcode project, open the Project navigator. Drag the
BlinkID.xcframework
file to your project, ideally in the Frameworks group. -
Since
BlinkID.xcframework
is a dynamic framework, you also need to add it to embedded binaries section in General settings of your target and choose optionEmbed & Sign
.
- Open up Terminal, cd into your top-level project directory, and run the following command "if" your project is not initialized as a git repository:
$ git init
- Add BlinkIDUX as a git submodule by running the following command:
$ git submodule add https://github.com/BlinkID/blinkid-ios.git
To add a local Swift package as a dependency in Xcode: 1. Go to your Xcode project. 2. Select your project in the Project Navigator. 3. Go to the Package Dependencies tab under your project settings. 4. Click the ”+” button to add a new dependency. 5. In the dialog that appears, select the “Add Local…” option at the bottom left. 6. Navigate to the folder containing your local Swift package and select it.
This will add the local Swift package as a dependency to your Xcode project.
BlinkIDUX
has a binary target dependency onBlinkID
, so use this package only if you are also using our UX.
In files in which you want to use the functionality of the SDK place the import directive.
import BlinkID
- Initialize the SDK with your license key:
let settings = BlinkIDSdkSettings(
licenseKey: "your-license-key",
downloadResources: true
)
let sdk = try await BlinkIDSdk.createBlinkIDSdk(withSettings: settings)
- Create a capture session:
let session = await sdk.createScanningSession()
- Process images and handle results:
let result = await session.process(inputImage: capturedImage)
if result.processResult?.resultCompleteness.scanningStatus == .documentScanned {
let finalResult = await session.getResult()
Task { @ProcessingActor in
let sessionResult = BlinkIDSession.getResult()
}
}
We provide the BlinkIDUX package, which encapsulates all the necessary logic for the document scanning process, streamlining integration into your app.
In files in which you want to use the functionality of the SDK place the import directive.
import BlinkIDUX
- Initialize the BlinkIDAnalyzer:
- Begin by creating an instance of the BlinkIDAnalyzer after initializing the capture session:
let analyzer = await BlinkIDAnalyzer(
sdk: sdk,
eventStream: BlinkIDEventStream()
)
- Create a BlinkIDUXModel:
- Next, use the BlinkIDAnalyzer to initialize the BlinkIDUXModel:
let viewModel = BlinkIDUXModel(analyzer: analyzer)
- Display the BlinkIDUXView in SwiftUI:
- Add the BlinkIDUXView to your SwiftUI view hierarchy using the BlinkIDUXModel::
struct ContentView: View {
var body: some View {
BlinkIDUXView(viewModel: viewModel)
}
}
- Access the Capture Result:
- The BlinkIDUXModel exposes the scanning result through the result property, which is a @Published variable. You can observe it to handle the result of the document capture and verification process inside your ViewModel:
viewModel.$result
.sink { [weak self] scanningResultState in
if let scanningResultState {
if let scanningResult = scanningResultState.scanningResult {
// Handle the scanning result
print("Scanning completed with result: \(scanningResult)")
}
}
}
.store(in: &cancellables)
- or you can also directly observe it within your SwiftUI views using SwiftUI’s @ObservedObject or @StateObject property wrappers. This allows you to automatically update your UI based on the capture result without manually handling Combine subscriptions.
struct ContentView: View {
@StateObject var viewModel: BlinkIDUXModel
var body: some View {
VStack {
BlinkIDUXView(viewModel: viewModel)
if let result = viewModel.scanningResult {
Text("Scanning Result: \(result.scanningResult.description)")
} else {
Text("Awaiting scanning...")
}
}
}
}
BlinkID is a powerful document scanning solution designed to enhance the security and accuracy of document scanning processes.
The BlinkIDSdk
class serves as the main entry point for document scanning functionality. It manages SDK initialization, resource downloading, and session creation.
let settings = BlinkIDSdkSettings(
licenseKey: "your-license-key",
downloadResources: true
)
do {
let sdk = try await BlinkIDSdk.createBlinkIDSdk(withSettings: settings)
let session = await sdk.createScanningSession()
// Use the session for document scanning
} catch {
// Handle initialization errors
}
BlinkIDSession
is a Swift class that manages document scanning operations, providing a robust interface for capturing, processing, and validating documents through image analysis and scanning.
The BlinkIDSession
class serves as the primary controller for document scanning workflows, handling various aspects such as:
- Image processing and analysis
- Session lifecycle management
- Result generation and processing
let sdk = try await BlinkIDSdk.createBlinkIDSdk(withSettings: settings)
let BlinkIDSession = await sdk.createScanningSession()
Creates a new capture session with specified settings and resource path configurations.
public func cancelActiveProcessing()
Immediately terminates any ongoing processing operations. This method can be called from any context and is useful for handling user cancellations or session aborts.
@ProcessingActor
public func process(inputImage: InputImage) -> FrameProcessResult
Processes an input image and provides detailed analysis results. This method:
- Analyzes the provided image according to session settings
- Returns a
FrameProcessResult
containing analysis results and completion status - Must be executed within the ProcessingActor context
@ProcessingActor
public func getResult() -> BlinkIDScanningResult
Retrieves the final results of the capture session, including:
- All captured images
- Session metadata
- Must be called within the ProcessingActor context
/// Processes a camera frame for document analysis.
/// - Parameter image: The camera frame to analyze
public func analyze(image: CameraFrame) async {
guard !paused else { return }
let inputImage = InputImage(cameraFrame: image)
let result = await BlinkIDSession.process(inputImage: inputImage)
if result.processResult?.resultCompleteness.scanningStatus == .documentScanned {
guard !scanningDone else { return }
scanningDone = true
Task { @ProcessingActor in
let sessionResult = BlinkIDSession.getResult()
// Finish scanning
}
}
}
Please refer to our BlinkIDAnalyzer
in the BlinkIDUX module for implementation details and guidance on its usage.
- Actor Isolation: Many methods must be called within the
ProcessingActor
context to ensure thread safety - Session Management: Each session maintains its own unique identifier and state
- Resource Management: Proper initialization with valid resource paths is crucial for operation
- Cancellation Support: Operations can be cancelled at any time using
cancelActiveProcessing()
The class implements the Sendable
protocol and uses actor isolation (@ProcessingActor
) to ensure thread-safe operations in concurrent environments.
- Ensure proper error handling for processing operations
- Consider implementing timeout mechanisms for long-running operations
- Maintain proper lifecycle management of the session
- Handle results appropriately according to your application's needs
ProcessResult
is a Swift structure that encapsulates the complete results of a document scanning process, combining frame analysis with completion status information.
- Document detection status
- Completion status
Contains detailed analysis results for a single frame in the verification process.
An enumeration representing the status of document detection during scanning.
failed
: Document recognition failedsuccess
: Document recognition completed successfullycameraTooFar
: Document has been detected but the camera is too far from the documentcameraTooClose
: Document has been detected but the camera is too close to the documentcameraAngleTooSteep
: Document has been detected but the camera’s angle is too steepdocumentTooCloseToCameraEdge
: Document has been detected but the document is too close to the camera edgedocumentPartiallyVisible
: Only part of the document is visible
An enumeration that defines the possible statuses that can occur during the scanning operation, specifically for managing the progress of scanning sides and the entire document.
scanningSideInProgress
: Document recognition failedscanningBarcodeInProgress
: Document recognition completed successfullysideScanned
: Document has been detected but the camera is too far from the documentdocumentScanned
: Document has been detected but the camera is too close to the documentcancelled
: Document has been detected but the camera’s angle is too steep
A structure tracking the progress of different verification phases.
scanningStatus
:ScanningStatus
- Indicates the status of the scanning processvizExtracted
:Bool
- Indicates if the VIZ fields have been extractedmrzExtracted
:Bool
- Indicates if the MRZ fields have been extractedbarcodeExtracted
:Bool
- Indicates if the barcode fields have been extracteddocumentImageExtracted
:Bool
- Indicates if the document image has been extractedfaceImageExtracted
:Bool
- Indicates if the face image has been extractedsignatureImageExtracted
:Bool
- Indicates if the signature image has been extracted
Represents a 2D point in the coordinate system.
x
:Int32
- X-coordinatey
:Int32
- Y-coordinate
Represents a four-sided polygon defined by its corner points.
upperLeft
:Point
upperRight
:Point
lowerRight
:Point
lowerLeft
:Point
Combines physical position and orientation information of a detected document.
location
:Quadrilateral
- Boundary coordinates of the detected documentorientation
:CardOrientation
- Orientation of the detected document
if result.processResult?.resultCompleteness.scanningStatus == .documentScanned {
guard !scanningDone else { return }
scanningDone = true
Task { @ProcessingActor in
let sessionResult = session.getResult()
// Finish scanning
}
}
-
Error Handling
- Monitor
documentDetectionStatus
for potential capture issues - Check
processingStatus
for overall processing success
- Monitor
-
Quality Control
- Use
blur
andglare
detection to ensure optimal image quality
- Use
-
Progress Tracking
- Use
ResultCompleteness
to track verification progress - Monitor
scanningStatus
for overall process state - Handle partial completions appropriately
- Use
All types conform to the Sendable
protocol, ensuring thread-safe operations in concurrent environments.
- Always check
resultCompleteness.scanningStatus == .documentScanned
before concluding the scanning process - Implement proper error handling for all possible
DetectionStatus
cases - Monitor quality indicators (blur, glare, moire) for optimal capture conditions
- Implement appropriate user feedback based on
processingStatus
anddocumentDetectionStatus
InputImage
is a Swift class that wraps either a UIImage or camera frame for processing in the document scanning system.
// Create from UIImage
public init(uiImage: UIImage, regionOfInterest: RegionOfInterest = RegionOfInterest())
// Create from camera frame
public init(cameraFrame: CameraFrame)
A structure that defines the area of interest within an image for processing.
x
:Float
- X-coordinate (normalized between 0 and 1)y
:Float
- Y-coordinate (normalized between 0 and 1)width
:Float
- Width (normalized between 0 and 1)height
:Float
- Height (normalized between 0 and 1)
An enumeration representing the device orientation during frame capture.
portrait
: Device in normal upright positionportraitUpsideDown
: Device held upside downlandscapeRight
: Device rotated 90 degrees clockwiselandscapeLeft
: Device rotated 90 degrees counterclockwise
A structure representing a complete camera frame with its metadata.
buffer
:MBSampleBufferWrapper
- Raw camera buffer containing image dataroi
:RegionOfInterest
- Region of interest within the frameorientation
:CameraFrameVideoOrientation
- Camera orientationwidth
:Int
- Frame width in pixels (computed property)height
:Int
- Frame height in pixels (computed property)
public init(
buffer: MBSampleBufferWrapper,
roi: RegionOfInterest = RegionOfInterest(),
orientation: CameraFrameVideoOrientation = .portrait
)
func processCameraOutput(_ sampleBuffer: CMSampleBuffer) {
let frame = CameraFrame(
buffer: MBSampleBufferWrapper(buffer: sampleBuffer),
roi: RegionOfInterest(x: 0, y: 0, width: 1.0, height: 1.0),
orientation: .portrait
)
let inputImage = InputImage(cameraFrame: frame)
}
// Creating from UIImage
let inputImage1 = InputImage(
uiImage: documentImage,
regionOfInterest: RegionOfInterest(x: 0, y: 0, width: 1.0, height: 1.0)
)
// Creating from camera frame
let inputImage2 = InputImage(cameraFrame: cameraFrame)
-
Image Source Handling
- Choose appropriate initialization method based on image source (UIImage or camera frame)
- Consider memory management implications when working with camera frames
- Handle orientation conversions properly
-
Region of Interest
- Use normalized coordinates (0-1) for region of interest
- Validate region boundaries to prevent out-of-bounds issues
- Consider UI implications when setting custom regions
-
Performance Optimization
- Minimize frame buffer copies
- Consider frame rate and processing overhead
- Handle memory efficiently when processing multiple frames
InputImage
and related types conform to theSendable
protocolCameraFrame
is marked as@unchecked Sendable
due to buffer handling- Care should be taken when sharing frames across threads
-
Memory Management
- Release camera frames promptly after processing
- Avoid unnecessary copies of large image buffers
- Use appropriate autorelease pool when processing multiple frames
-
Error Handling
- Check frame dimensions before processing
- Handle invalid region of interest parameters
- Validate image orientation data
-
Performance
- Process frames in appropriate queue/thread
- Consider frame rate requirements
- Optimize region of interest for specific use cases
The SDK supports both downloaded and bundled resources:
- Automatic resource downloading and caching
- Bundle-based resource loading
- Resource validation and verification
The SDK supports downloading machine learning models from our CDN. Models are automatically retrieved from https://models.cdn.microblink.com/resources when enabled.
To enable model downloads, set the downloadResources property to true in your BlinkIDSdkSettings
:
let settings = BlinkIDSdkSettings(
licenseKey: yourLicenseKey,
downloadResources: true // Enable model downloads
)
By default, downloaded models are stored in the MLModels
folder. You can specify a custom storage location using the resourceLocalFolder
property in the settings.
Model downloads occur during SDK initialization in the createBlinkIDSdk
method:
do {
let instance = try await BlinkIDSdk.createBlinkIDSdk(withSettings: settings)
} catch let error as ResourceDownloaderError {
// Handle specific download errors
if case .noInternetConnection = error {
// Handle no internet connection
}
// Handle other download errors as needed
}
The operation may throw a ResourceDownloaderError
with the following possible cases:
Error Case | Description |
---|---|
invalidURL(String) |
The provided URL for model download is invalid |
downloadFailed(Int) |
Download failed with specific HTTP status code |
fileNotFound(URL) |
Resource file not found at specified location |
hashMismatch(String) |
File hash verification failed |
fileAccessError(Error) |
Error accessing file system |
cacheDirNotFound |
Cache directory not found |
fileCreationError(Error) |
Error creating file |
noInternetConnection |
No internet connection available |
invalidResponse |
Invalid or unexpected server response |
resourceUnavailable |
Requested resource is not available |
The SDK provides built-in components for handling network connectivity states.
NetworkMonitor
is ready-to-use network connectivity monitor that uses NWPathMonitor
:
@MainActor
public class NetworkMonitor: ObservableObject {
@Published public var isConnected = true
public var isOffline: Bool { !isConnected }
public init() {
setupMonitor()
}
}
NoInternetView
is a pre-built SwiftUI view for handling offline states:
public struct NoInternetView: View {
public init(retryAction: @escaping () -> Void)
}
To use bundled models with our SDK, ensure the required model files are included in your app package and set the downloadResources
property of BlinkIDSdkSettings
to false
. Specify the location of the bundled models using the bundleURL
property of BlinkIDSdkSettings
. If you are using the main bundle, you can retrieve its URL as follows:
let bundle = Bundle.main.bundleURL
The BlinkIDUX package is source-available, allowing you to customize and adapt its functionality to suit the specific needs of your project. This flexibility ensures that you can tailor the user experience and scanning workflow to align with your app’s design and requirements.
In the next section, we will explain the main components of the BlinkIDUX package and how they work together to simplify document scanning integration.
The BlinkIDAnalyzer component provides a robust set of features to streamline and enhance the document scanning process. It includes real-time camera frame analysis for immediate feedback during scanning and asynchronous event streaming to ensure smooth, non-blocking operations. The component supports pause and resume functionality, allowing users to temporarily halt the process and continue seamlessly. With session result handling, developers can easily access and process the final scanning outcomes. Additionally, it offers cancellation support, enabling users to terminate the scanning process at any time. Designed with comprehensive UI event feedback, it delivers clear and actionable guidance to users throughout the scanning workflow.
An enumeration that represents different sides of a document during the scanning process:
public enum DocumentSide: Sendable {
case front // Front side of the document
case back // Back side of the document
case barcode // Barcode region of the document
}
An actor that manages the stream of UI events during the document scanning process:
public actor BlinkIDEventStream: EventStream {
public func send(_ events: [UIEvent])
public var stream: AsyncStream<[UIEvent]>
}
The main analyzer component that processes camera frames and manages the document scanning workflow:
public actor BlinkIDAnalyzer: CameraFrameAnalyzer {
public init(
sdk: BlinkIDSdk,
blinkIDSessionSettings: BlinkIDSessionSettings = BlinkIDSessionSettings(inputImageSource: .video),
eventStream: BlinkIDEventStream
)
}
// Create an event stream
let eventStream = BlinkIDEventStream()
// Initialize the analyzer
let analyzer = await BlinkIDAnalyzer(
sdk: blinkIDVerifySdk,
blinkIDSessionSettings: BlinkIDSessionSettings(inputImageSource: .video),
eventStream: eventStream
)
// Analyze a camera frame
for await frame in await camera.sampleBuffer {
await analyzer.analyze(image: CameraFrame(buffer: MBSampleBufferWrapper(cmSampleBuffer: frame.buffer), roi: roi, orientation: camera.orientation.toCameraFrameVideoOrientation()))
}
The component provides real-time feedback through an event stream. Events can be observed to update the UI or trigger specific actions based on the scanning progress.
eventHandlingTask = Task {
for await events in await analyzer.events.stream {
if events.contains(.requestDocumentSide(side: .back)) {
firstSideScanned()
} else if events.contains(.requestDocumentSide(side: .barcode)) {
self.setReticleState(.barcode, force: true)
} else if events.contains(.wrongSide) {
self.setReticleState(.error("Flip the document"))
} else if events.contains(.tooClose) {
self.setReticleState(.error("Move farther"))
} else if events.contains(.tooFar) {
self.setReticleState(.error("Move closer"))
} else if events.contains(.tooCloseToEdge) {
self.setReticleState(.error("Move the document from the edge"))
} else if events.contains(.tilt) {
self.setReticleState(.error("Keep document parallel with the phone"))
} else if events.contains(.blur) {
self.setReticleState(.error("Keep document and phone still"))
} else if events.contains(.glare) {
self.setReticleState(.error("Tilt or move document to remove reflection"))
} else if events.contains(.notFullyVisible) {
self.setReticleState(.error("Keep document fully visible"))
} else if events.contains(.occlusion) {
self.setReticleState(.error("Keep document fully visible"))
}
}
}
When integrating the BlinkIDAnalyzer component, it is essential to follow best practices to ensure a seamless and efficient document scanning experience. By adhering to these guidelines, you can enhance user satisfaction and maintain robust application performance:
- Always handle the event stream to provide real-time user feedback during the scanning process.
- Implement proper error handling to manage scan failures gracefully and guide users effectively.
- Consider implementing timeout handling for production environments to prevent indefinite scanning sessions. Check out our implementation for more details.
- Manage memory efficiently by calling cancel() when the scanner is no longer needed, freeing up resources.
- Handle the pause/resume cycle appropriately to align with app lifecycle events, ensuring a consistent user experience.
The BlinkIDUXModel is a comprehensive view model that manages the document scanning user experience in iOS applications. This component handles camera preview, document detection, user guidance, and scanning state transitions.
BlinkIDUXModel serves as the business logic layer for the document scanning interface. It is designed as a MainActor to ensure thread-safe UI updates and implements the ObservableObject protocol for SwiftUI integration. The model manages the entire scanning workflow, from camera initialization to document capture and verification.
The model offers comprehensive camera control functionality through seamless integration with AVFoundation. It includes a fully implemented camera session, ensuring thread-safe access and synchronization with the BlinkIDAnalyzer for reliable and efficient performance.
The camera system is designed to provide robust and user-friendly functionality, including:
- Automatic session management for effortless setup and teardown.
- Torch/flashlight control to adapt to various lighting conditions.
- Frame capture and analysis synchronization for real-time document processing.
- Orientation handling to ensure correct alignment regardless of device orientation.
public class ScanningViewModel<T>: ObservableObject, ScanningViewModelProtocol {
let camera: Camera = Camera()
}
The model provides comprehensive user feedback mechanisms. The feedback features are designed to enhance user experience and guide users effectively throughout the document scanning process. These include:
- Visual guidance through reticle state management, helping users align documents accurately.
- Error messaging and recovery suggestions, providing clear instructions to resolve issues.
- Success animations and transitions, offering a smooth and engaging user experience upon successful scans.
- Accessibility announcements, ensuring inclusivity by providing auditory feedback for users with disabilities.
- Progress indicators, keeping users informed about the scanning status in real time.
@Published var reticleState: ReticleState = .front
The model features a sophisticated animation system designed to provide dynamic and engaging user feedback during the document scanning process. Key animations include:
- Document flip animations, guiding users to scan both sides of a document when required.
- Success indicators, visually confirming successful scans.
- Ripple effects, drawing attention to areas of interest during the scanning process.
- State transitions, ensuring smooth and intuitive changes between different scanning states.
When implementing the BlinkIDUXModel, it’s crucial to follow best practices to ensure smooth and efficient operation. Key considerations include:
- Implement proper error handling to manage all scanning states gracefully and provide a robust user experience.
- Monitor memory usage during extended scanning sessions to avoid potential performance bottlenecks or crashes.
- Clean up resources promptly when the scanning process is complete to maintain optimal app performance.
- Handle orientation changes appropriately to ensure consistent user experience across different device orientations.
BlinkIDUXView is the main scanning interface component that combines camera functionality with user interaction elements. The view is designed to provide real-time feedback during the document scanning process while maintaining a clean and intuitive user interface.
The view is built using SwiftUI and follows the MVVM (Model-View-ViewModel) pattern, where:
- The view (BlinkIDUXView) handles the UI layout and user interactions
- The view model (BlinkIDUXModel) manages the business logic and state
- The camera integration handles the document capture pipeline
The view incorporates a camera feed through the CameraView component and manages the entire capture pipeline, including:
- Automatic camera session management
- Support for torch/flashlight functionality
- Real-time frame processing
- Orientation handling
The interface consists of several key components:
-
Camera Feed
- Full-screen camera preview
- Real-time document boundary detection
- Automatic orientation adjustment
-
Reticle
- Visual guidance for document positioning
- Dynamic state feedback
- Accessibility support
- Animation capabilities
-
Control Buttons
- Cancel button for session termination
- Torch button for lighting control
- Help button for user guidance
-
Feedback Elements
- Success indicators
- Visual animations
- Accessibility announcements
The component provides comprehensive accessibility support:
- VoiceOver compatibility
- Dynamic text scaling
- Accessibility labels and hints
- Automatic announcements for state changes
- Always initialize the view with a properly configured view model
- Implement proper error handling and user feedback
- Monitor memory usage during extended scanning sessions
- Handle orientation changes appropriately
- Implement proper cleanup on view dismissal
You have the flexibility to create your own custom UX if needed. However, we strongly recommend following the implementations provided in this package as a foundation. Since the package is source-available, you can modify or extend the code directly within your project to tailor it to your specific requirements.
We also highly recommend using our built-in Camera and CameraView components, as they are fully optimized for performance and seamlessly integrated with the BlinkIDAnalyzer. However, if necessary, you can implement and use your own camera solution.
If implementing your own Camera component, be sure to wrap your CMSampleBufferRef to our own
MBSampleBufferWrapper
.MBSampleBufferWrapper
safely encapsulates a Core Media sample buffer, ensuring proper reference counting and memory management while maintaining binary compatibility across different Swift versions.
To integrate the document scanning workflow effectively, you will start by creating a ViewModel to manage the scanning logic and interface with the underlying components. The ViewModel acts as the bridge between the CameraFrameAnalyzer and your UI, handling data flow, state management, and event processing.
In this section, we will guide you through the process of setting up and configuring the ViewModel, integrating it with the camera and analyzer components, and linking it to your SwiftUI view. This approach ensures a modular and maintainable architecture while leveraging the optimized components provided by the SDK.
@MainActor
public final class ViewModel: ObservableObject {
// Use our Camera controller
let camera: Camera = Camera()
let analyzer: CameraFrameAnalyzer
// Instructions text for the View
@Published var instructionText: String = "Scan the front side"
// Published BlinkIDCaptureResult, use it in your ViewModel, or directly in your SwiftUI View
@Published public var captureResult: BlinkIDCaptureResult?
private var eventHandlingTask: Task<Void, Never>?
public init(analyzer: CameraFrameAnalyzer) {
self.analyzer = analyzer
startEventHandling()
}
The startEventHandling
method initializes an event task, allowing you to receive and process UIEvents from the CameraFrameAnalyzer’s event stream.
private func startEventHandling() {
eventHandlingTask = Task {
for await events in await analyzer.events.stream {
if events.contains(.requestDocumentSide(side: .back)) {
} else if events.contains(.requestDocumentSide(side: .barcode)) {
} else if events.contains(.wrongSide) {
instructionText = "Flip the document"
} else if events.contains(.tooClose) {
instructionText = "Move farther"
} else if events.contains(.tooFar) {
instructionText = "Move closer"
} else if events.contains(.tooCloseToEdge) {
instructionText = "Move farther"
} else if events.contains(.tilt) {
instructionText = "Keep document parallel with the phone"
} else if events.contains(.blur) {
instructionText = "Keep document and phone still"
} else if events.contains(.glare) {
instructionText = "Tilt or move document to remove reflection"
} else if events.contains(.notFullyVisible) {
instructionText = "Keep document fully visible"
}
}
}
}
Implement analyze
and pauseScanning
methods:
public func analyze() async {
Task {
let result = await analyzer.result()
if let scanningResult = result as? ScanningResult<BlinkIDScanningResult, BlinkIDScanningAlertType> {
switch scanningResult {
case .completed(let scanningResult):
finishScan()
self.result = BlinkIDResultState(scanningResult: scanningResult)
case .interrupted(let alertType):
self.alertType = alertType
showScanningAlert = true
case .cancelled:
showLicenseErrorAlert = true
case .ended:
self.result = BlinkIDResultState(scanningResult: nil)
}
}
}
for await frame in await camera.sampleBuffer {
await analyzer.analyze(image: CameraFrame(buffer: MBSampleBufferWrapper(cmSampleBuffer: frame.buffer), roi: roi, orientation: camera.orientation.toCameraFrameVideoOrientation()))
}
}
func pauseScanning() {
Task {
await analyzer.cancel()
}
}
The CameraFrameAnalyzer protocol defines the core interface for components that analyze camera frames during document scanning operations. This protocol is designed to provide a standardized way of processing camera input while maintaining thread safety through Swift's concurrency system.
The protocol defines essential methods and properties for analyzing camera frames in real-time, managing the analysis lifecycle, and providing feedback through an event stream.
public protocol CameraFrameAnalyzer: Sendable {
func analyze(image: CameraFrame) async
func cancel() async
func pause() async
func resume() async
func restart() async throws
func end() async
func result() async -> ScanningResult
var events: EventStream { get }
}
analyze(image: CameraFrame) async
Processes a single camera frame for analysis. This method operates asynchronously to prevent blocking the main thread during intensive image processing operations.
cancel() async
Terminates the current analysis operation immediately. This method ensures proper cleanup of resources when analysis needs to be stopped before completion.
pause() async
Temporarily suspends the analysis operation while maintaining the current state. This is useful for scenarios where analysis needs to be temporarily halted, such as when the app enters the background.
resume() async
Continues a previously paused analysis operation. This method restores the analyzer to its active state and resumes processing frames.
restart() async throws
Starts a new analysis operation. This method resets the analyzer to its initial state and resumes processing frames.
end() async
Ends the analysis operation. This is used when analysis needs to be stopped, such as when the cancel button is pressed.
result() async -> ScanningResult
Retrieves the final result of the analysis operation. This method returns a ScanningResult object containing the analysis outcome.
events: EventStream
Provides access to a stream of UI events generated during the analysis process. This stream can be used to update the user interface based on analysis progress and findings.
We strongly recommend using our
BlinkIDAnalyzer
for seamless integration. However, you can create your own custom implementation as long as it conforms to theCameraFrameAnalyzer
protocol.
Once the ViewModel is set up and configured, the next step is to create a SwiftUI view that interacts with it. The ViewModel serves as the central point for managing the document scanning workflow, including handling events, processing results, and updating the UI state. By linking the ViewModel to your view, you can create a dynamic and responsive interface that provides real-time feedback to users. In this section, we will demonstrate how to build a SwiftUI view using the ViewModel, ensuring seamless integration with the underlying document scanning components.
Start by creating new SwiftUI View called CaptureView.
struct CaptureView: View {
@ObservedObject private var viewModel: ViewModel
init(viewModel: ViewModel) {
self.viewModel = viewModel
}
}
We need to add CameraView to our body:
var body: some View {
GeometryReader { geometry in
ZStack {
CameraView(camera: viewModel.camera)
.ignoresSafeArea()
.statusBarHidden()
.task {
await viewModel.camera.start()
await viewModel.analyze()
}
.onDisappear {
viewModel.stopEventHandling()
Task {
await viewModel.camera.stop()
}
}
}
}
}
The camera feed is displayed using CameraView, which is tightly integrated with the ViewModel’s camera property. The .task modifier ensures that the camera starts and begins analysis as soon as the view appears, and proper cleanup is handled in onDisappear to stop the camera and event handling gracefully.
We also need to add some instuction view to our ZStack that will connect our ViewModel's instructionText
:
VStack {
Spacer()
// Text is in the middle of the screen
Text(viewModel.instructionText)
.font(.system(size: 20))
.foregroundColor(Color.white)
.padding()
.background(Color.gray)
.clipShape(.capsule)
Spacer()
}
In this section, we will demonstrate how to establish the connection between the ViewModel and the View to facilitate a seamless document scanning workflow:
let analyzer = await BlinkIDAnalyzer(
sdk: localSdk,
eventStream: BlinkIDEventStream()
)
let viewModel = ViewModel(analyzer: analyzer)
In your SwiftUI View, add CaptureView:
struct ContentView: View {
var body: some View {
CaptureView(viewModel: viewModel)
}
}
And that's it! You have created a custom SwiftUI View and ViewModel!
Our app supports localization following Apple’s recommended approach. We provide a Localizable.xcstrings
file that you can use or modify as needed. Localization is determined by the system settings, meaning you must define
supported languages in your app’s Info.plist
under the Localizations
key, ensuring all required keys are included. Once configured, users can change the app’s language via Settings > [App Name] > Language. Note that in-app
language switching is not supported, as we adhere to Apple’s intended localization flow.
In case of problems with using the SDK, you should do as follows:
If you are getting "invalid licence key" error or having other licence-related problems (e.g. some feature is not enabled that should be or there is a watermark on top of camera), first check the console. All licence-related problems are logged to error log so it is easy to determine what went wrong.
When you have determine what is the licence-relate problem or you simply do not understand the log, you should contact us help.microblink.com. When contacting us, please make sure you provide following information:
- exact Bundle ID of your app (from your
info.plist
file) - licence that is causing problems
- please stress out that you are reporting problem related to iOS version of BlinkID SDK
- if unsure about the problem, you should also provide excerpt from console containing licence error
If you are having problems with scanning certain items, undesired behaviour on specific device(s), crashes inside BlinkID SDK or anything unmentioned, please do as follows:
- Contact us at help.microblink.com describing your problem and provide following information:
- log file obtained in previous step
- high resolution scan/photo of the item that you are trying to scan
- information about device that you are using
- please stress out that you are reporting problem related to iOS version of BlinkID SDK
BlinkID is really lightweight SDK. Compressed size is just 2.1MB. SDK size calculation is done by creating an App Size Report with Xcode, one with and one without the SDK. Here is the SDK App Size Report for iPhone:
Size | App + On Demand Resources size | App size |
---|---|---|
compressed | 2.4 MB | 2,4 MB |
uncompressed | 5,5 MB | 5,5 MB |
The uncompressed size is equivalent to the size of the installed app on the device, and the compressed size is the download size of your app. You can find the App Size Report here.
Complete API references can be found: