Conformer2
Unlocking accessibility: every document, every format
Conformer2 transforms documents into accessible formats, making information inclusive for all, streamlining compliance and management.
Desktop
Overview
Our expert team dove into Conformer2, discovering an AI powerhouse that effortlessly makes documents accessible to everyone, including those with disabilities. It's a game-changer, transforming texts into Braille, large print, audio, and accessible PDFs with a few clicks. What stood out was its automation feature, slashing the time needed for conversions and significantly reducing effort. The tool shines in organization, making document retrieval a breeze. Conformer2 is not just about compliance with accessibility standards; it's about enhancing efficiency and simplifying tasks for a wide audience. Its user-friendly interface invites everyone to harness its capabilities, making it a standout choice for making information universally accessible.
Use cases
- Schools and Learning: It turns educational materials like textbooks and study guides into Braille or audio, opening up a world of learning for students with disabilities.
- Government: Helps agencies make their documents, like policies and reports, accessible to all citizens, ensuring they meet set standards.
- Businesses: Enables companies to create accessible internal documents, manuals, and training materials, promoting an inclusive work environment.
- Publishing: Assists publishers in reaching a broader audience by converting books and magazines into formats accessible to those with print disabilities.
- Libraries: Offers services that let library-goers access any book or document in formats they prefer or need.
- Healthcare: Makes medical information, prescriptions, and records accessible to visually impaired patients and professionals.
- Legal Sector: Ensures legal documents, contracts, and filings are accessible, allowing everyone equal access to crucial information.
- Websites: Helps make online content accessible, ensuring websites meet accessibility standards and reach a wider audience.
- Reports: Allows NGOs and government agencies to make their reports and publications accessible to the public, including those with disabilities.
- Personal Use: Even for personal documents like resumes and letters, Conformer2 can convert them into accessible formats, making sharing easier.
Users & Stats
Website Traffic
Traffic Sources
Users by Country
FAQ
What file types are supported by the AssemblyAI API? Are there recommended formats?
The AssemblyAI API supports most common audio and video file formats. We recommend that you submit your audio in its native format without additional transcoding or file conversion. Transcoding or converting it to another format can sometimes result in a loss of quality, especially if you're converting compressed formats like .mp3. The AssemblyAI API converts all files to 8khz uncompressed audio as part of our transcription pipeline.
What are the API limits on file size or file duration?
Currently, the maximum file size that can be submitted to the /v2/transcript endpoint for transcription is 5GB, and the maximum duration is 10 hours. The maximum file size for a local file uploaded to the API via the /v2/upload endpoint is 2.2GB.
How long does transcription take?
Processing times for our asynchronous transcription API are based on the duration of the submitted audio and models enabled in the request. The majority of files sent to our API will complete in under 45 seconds, with a Real-Time-Factor (RTF) as low as .008x. Real-time transcription files receive a response within a few hundred milliseconds.
Can I get timestamps for individual words? How do timestamps work?
The response for a completed request includes start and end keys. These keys are timestamp values for when a given word, phrase, or sentence starts and ends. These values are in milliseconds and are accurate to within about 400 milliseconds.
How do the Custom Vocabulary and Custom Spelling features work?
The Custom Vocabulary feature allows you to submit a list of words or phrases to boost the likelihood that the model predicts those words. The Custom Spelling feature allows you to control how words are spelled or formatted in the transcript text, working like a find and replace feature.
How long are audio or video files submitted to the API stored?
Files submitted to the API are deleted from our servers as soon as the transcription is completed. If you upload a local file but don't transcribe it, we delete it after 24 hours.
Can completed transcripts be deleted?
Completed transcripts are stored in our database, encrypted at rest. To permanently delete the transcription from our database, you can make a DELETE request to the API.
Can I get a list of all transcripts I have created?
You can retrieve a list of all transcripts that you have created by making a GET request to the API.
Do you offer any discounts on pricing?
If you plan to send a large amount of audio or video content through our API, please reach out to support@assemblyai.com to see if you qualify for a volume discount.
How can I get more information about an error? How do I contact support?
Any time you make a request to the API, you should receive a JSON response. If you don't receive the expected output, the JSON contains an error key with a message value describing the error. You can also reach out to our support team by sending an email to support@assemblyai.com for assistance.
Pricing & discounts
Service Category | Service | Price |
---|---|---|
Speech and Transcription | Speech-to-Text | $0.37 per hour |
| Real-time Transcription | $0.47 per hour |
Audio Intelligence Models | Key Phrases | $0.01 per hour |
| Sentiment Analysis | $0.02 per hour |
| Summarization | $0.03 per hour |
| PII Audio Redaction | $0.05 per hour |
| PII Redaction | $0.08 per hour |
| Auto Chapters | $0.08 per hour |
| Entity Detection | $0.08 per hour |
| Content Moderation | $0.15 per hour |
| Topic Detection | $0.15 per hour |
LeMUR Models | LeMUR Default/Claude 2.1 (Input) | $0.015 per 1K tokens |
| LeMUR Default/Claude 2.1 (Output) | $0.043 per 1K tokens |
| LeMUR Basic (Input) | $0.002 per 1K tokens |
| LeMUR Basic (Output) | $0.005 per 1K tokens |
User Reviews
There are no reviews here yet. Be the first to leave review.
Hi, there!
Funding
AssemblyAI, a company specializing in speech AI models, recently secured $50 million in Series C funding, led by Accel. This funding brings their total raised funds to $115 million.
AssemblyAI's mission is to create AI models that enable companies to integrate voice data-based AI applications into their products and workflows. Their latest AI model, Conformer-2, is trained on a vast amount of voice data and can perform tasks like converting speech to text and identifying speakers. They are also working on a universal model trained on over 10 million hours of voice data.
AssemblyAI's API is already being used by thousands of organizations, with over 10,000 new organizations signing up every month. The funding will support their research, the development of new models, training resources, market expansion, and team growth.
The most popular AI tools for audio
Suno
Audio
Adobe Podcast AI
Audio
Musicfy AI
Audio
Auphonic AI
Audio
Guide AI
Travel
Playtext.app
Audio
Huddles.app
Audio
SpeechGen
Audio
More tools