Read aloud any webpage — even Kindle and WeRead
CastReader 2.0 is a browser extension that converts web-based text content into spoken audio using text-to-speech (TTS) technology. It is designed for users who benefit from auditory learning, require accessibility support, or wish to consume written material hands-free. The tool operates directly within the browser without requiring user accounts, subscriptions, or configuration.
It targets diverse user groups including individuals with dyslexia or ADHD, students and researchers reviewing academic materials, professionals managing information overload, language learners practicing pronunciation, and users prioritizing digital wellness or visual accessibility. CastReader supports reading across a wide range of online platforms, including those with complex rendering techniques such as Canvas-based text layers.
CastReader 2.0 functions as a lightweight browser extension that activates on demand. After installation from the Chrome Web Store, it runs locally within the browser context. When a user navigates to a supported webpage — such as an article, e-book, research paper, or AI chat interface — they click the CastReader icon to initiate playback. The extension analyzes the page’s DOM structure and, where necessary, applies OCR and font decoding to extract readable text from Canvas elements, particularly in Kindle Cloud Reader and WeRead.
Content extraction includes intelligent filtering: non-essential elements like advertisements, navigation menus, and footers are excluded from the speech stream. The system then segments the extracted text into logical paragraphs and reads them aloud using locally processed AI voices. As audio playback progresses, the corresponding paragraph is highlighted on-screen in real time, enabling precise visual tracking. No content is sent to remote servers during processing.
CastReader 2.0 enables dual-channel learning by combining auditory input with synchronized visual feedback, which has demonstrated utility for users with reading-related learning differences. For students and researchers, it facilitates passive absorption of long-form technical content — such as academic papers from arXiv or PubMed — during commutes or low-focus activities. Professionals use it to convert industry reports, newsletters, and documentation into audio for efficient consumption without screen fatigue.
Language learners benefit from exposure to native-like pronunciation across 40+ languages while reading authentic web text. Its compatibility with AI conversation interfaces (e.g., ChatGPT, Claude, Gemini) allows users to listen to extended model outputs without manual copying or third-party tools. Additionally, the extension supports digital wellness goals by reducing prolonged screen engagement and offering an accessible alternative for visually impaired users or those managing eye strain.