Latest Twitter Threads by @dr_cintas on Thread Reader App

Apr 25 • 10 tweets • 4 min read

What a crazy week in AI 🤯

- Grok’s AI Vision
- Genspark AI Slides
- Perplexity Assistant
- OpenAI gpt-image-1
- Tavus SoTA lipsync model
- Dia SoTA speech AI model
- Dreamina AI's Top AI Image
- ChatGPT Deep Research Mini

Here's EVERYTHING you need to know: 1. Grok Vision launches with multimodal capabilities, letting users point their phone cameras at objects or environments for real-time analysis.

This comes alongside multilingual audio support and real-time search capabilities.

Apr 24 • 4 tweets • 2 min read

This is such a powerful AI coding workflow.

You can now use Cursor or Windsurf for coding, and let CodeRabbit’s AI agent refactor messy code, help you debug, and find security vulnerabilities.

Here’s how:

1. Visit , click "Sign in with GitHub”, and authorize CodeRabbit coderabbit.ai

Apr 23 • 12 tweets • 4 min read

Genspark might be the best AI agent I’ve tried yet.

It can agentically conduct research, create web pages, generate videos, and even have the AI call and make reservations for you.

10 powerful use cases:

2. AI Slides (@genspark_ai just launched this)

This Super Agent conducts research and creates polished slides and icons, and users can also edit them.

To try it out, you can go here: genspark.ai

Apr 23 • 8 tweets • 3 min read

🚨BREAKING: Top AI image model just launched.

It’s called Seedream 3.0 by Dreamina AI and it ranks #1 at creating photorealistic images up to 2k resolution.

Dreamina AI can also upscale, inpaint, expand and even generate videos.

It’s now fully available. Link to try free below

This is the link to access @dreamina_ai:

First, select where it says “Image Generator” dreamina.capcut.com/ai-tool/home/?…

Apr 21 • 11 tweets • 4 min read

AI videos are getting scary good.

So I tested some of the hardest prompts for all the major leading models:

• Kling 2.0
• Sora
• Runway Gen-4
• Google Veo 2

10 side-by-side examples:

2. A lion driving an open jeep in the tanzania safari

Apr 17 • 12 tweets • 4 min read

What a crazy week in AI 🤯

- Kling 2.0 AI video
- Canva Visual Suite 2.0
- Microsoft Copilot Vision
- Grok Studio and Memories
- ChatGPT 4.1, o3, & o4-mini
- OpenAI’s new coding agent
- ByteDance Seaweed AI video
- Claude Autonomous Research

Here’s EVERYTHING you need to know: 1. Kling AI released its new model Kling 2.0.

This launch features improved prompt understanding, enhanced character motion dynamics for more natural fluid movements, and a Multi-Elements Editor for easier video editing.

Apr 14 • 6 tweets • 3 min read

🚨 BREAKING: OpenAI just launched the GPT-4.1 family of models.

New benchmarks, bigger context windows, and the first-ever nano model.

Here’s everything you need to know:

OpenAI is rolling out 3 new models via API:

- GPT‑4.1
- GPT‑4.1 mini
- GPT‑4.1 nano

Each one beats GPT-4o and GPT-4o mini across the board, especially on coding, instruction following, and long-context tasks.

Apr 11 • 10 tweets • 3 min read

What a wild week in AI 🤯

- Google AI Agents
- Meta Llama 4 models
- AI 2027 forecast report
- Amazon AI Voice model
- Gemini 2.5 Deep Research
- ChatGPT memory upgrade
- Firebase Studio rivals Cursor
- Nvidia/Stanford 1-min AI cartoons

Here’s everything you need to know: 1. Google introduces Agent2Agent (A2A) protocol for AI interoperability.

It enables AI agents from different vendors to communicate and collaborate seamlessly.

Apr 9 • 5 tweets • 2 min read

🚨 BREAKING: Google just announced Agent2Agent.

This protocol enables AI agents to communicate across platforms regardless of framework or vendor.

Here’s how it works:

A2A facilitates communication between "client" and "remote" agents through four key capabilities:

Secure Collaboration, Task Management, User Experience Negotiation, and Capability Discovery

All built popular standards like HTTP, JSON-RPC standards with enterprise auth.

Apr 4 • 10 tweets • 3 min read

What a wild week in AI 🤯

- Midjourney v7
- Runway Gen-4
- Apple AI Health Coach
- Lindy AI Agent Swamps
- Amazon Browser Agent
- LLMs passing the Turing Test
- Meta MoCha AI talking characters
- AI brain signal to speech breakthrough

Here’s everything you need to know: Midjourney just released v7, the latest upgrade to the AI art generator loved by creatives.

Its new “Draft Mode” slashes costs by 50% and speeds up generation 10x, perfect for quick sketches.

https://twitter.com/midjourney/status/1908012961840672947

Mar 27 • 14 tweets • 5 min read

What a wild week in AI 🤯

- Reve Image
- Ideogram 3.0
- Qwen new models
- ARC-AGI-2 launch
- Alibaba LHM model
- Microsoft Researcher
- Google Gemini 2.5 Pro
- Perplexity Answer Tabs
- DeepSeek’s V3 AI model
- OpenAI’s Image Generator

Here’s everything you need to know: 1. Reve has launched Reve Image 1.0.

Fresh out of stealth, it has claimed the top spot in global image model rankings and outperforming big names like Midjourney and Google’s Imagen.

It provides stunning photorealism, best-in-class prompt accuracy, and wild text rendering.

https://twitter.com/reveimage/status/1904211082870456824

Mar 26 • 11 tweets • 3 min read

OpenAI’s native Al image generation is insane.

The image and text quality are so good that it has unlocked unlimited possibilities.

10 crazy use cases:

1. Thumbnail maker

2. Create product marketing images

Mar 24 • 6 tweets • 3 min read

AI photoshoots are taking over e-comm and fashion.

You can now upload a product design, choose a model, and generate a fashion photoshoot for your idea like this.

Here's how:

1. Go to and sign up for a free account htch.ai/HHbmhOE

Mar 20 • 11 tweets • 4 min read

What a wild week in AI 🤯

- Mistral Small 3.1
- Claude Web Search
- OpenAI Audio Models
- Krea AI Video Training
- NotebookLM Mind Maps
- Hunyuan 3D Generation AI
- Stability AI New Virtual Camera
- Gemini Canvas & Audio Overview

Here’s everything you need to know: 1. Mistral AI has released Mistral Small 3.1

A 24B open-source model that outperforms Google’s Gemma 3 and OpenAI’s GPT-4o Mini in key benchmarks.

It supports multimodal inputs, handles up to 128k tokens in context, and processes 150 tokens per second for high efficiency.

https://twitter.com/mistralai/status/1901668499832918151

Mar 17 • 12 tweets • 4 min read

Google's Gemini native Al image generation is insane.

You can now generate or edit photos with just plain text and completely free.

10 crazy use cases:

1. Thumbnail optimizer

2. Professional look

Mar 15 • 6 tweets • 3 min read

You can now add Deep Research to your AI code editors.

Simply add the new Firecrawl MCP with Deep Research, and it will autonomously explore the web, and extract the latest findings for your code projects.

Here’s how:

First, you are going to need an API key from Firecrawl. You can get one to try free here: firecrawl.dev

Mar 13 • 11 tweets • 4 min read

What a wild week in AI 🤯

- Google’s Gemma 3
- Luma AI Ray2 Flash
- Reka Flash 3 Reasoning
- Hunyuan-TurboS model
- OpenAI’s Building Agents
- Gemini Native Image Editing
- Hedra Character 3 Omnimodal
- Freepik & Veo 2 Image to Video

Here’s everything you need to know: 1. Google has released Gemma 3, a family of open-source AI models built from Gemini 2.0 technology.

It comes in sizes from 1B to 27B parameters, offering a 128K-token context window and multimodal support.

https://twitter.com/googleaidevs/status/1899725682545967555

Mar 9 • 8 tweets • 3 min read

You can connect Claude to the Internet using MCPs.

I’ve been using an MCP server that connects to the Brave Search API for web searches with Claude 3.7 Sonnet.

Here’s how:

First, head over to:

And download the latest desktop version. claude.ai/download

Mar 6 • 12 tweets • 4 min read

What a wild week in AI 🤯

- Mistral OCR
- Google’s AI Mode
- Windsurf Previews
- Anthropic Console
- ChatGPT Edit in IDEs
- Microsoft Dragon Copilot
- HunyuanVideo I2V Model
- Sesame Realistic AI Voices
- Alibaba releases QwQ-32B

Here’s everything you need to know: 1. Mistral AI has launched Mistral OCR, a new API designed for document understanding.

This tool extracts text from images and PDFs with high accuracy, making it ideal for use with RAG systems.

It’s priced at 1000 pages per dollar and is already available.

https://twitter.com/sophiamyang/status/1897713370029068381

Mar 3 • 5 tweets • 2 min read

You can now clone any website just by writing a prompt.

Simply add the new Firecrawl MCP server to your favorite AI coding tool for improved web data extraction, and let Claude code it for you.

Here’s how:

First, you are going to need an API key from Firecrawl. You can get one to try free here: firecrawl.dev

Mar 1 • 5 tweets • 2 min read

Real-time AI voice agents for businesses are here.

I created this AI assistant in minutes with no code and used my cloned voice to help handle customers at a fictitious store.

Plus, there’s now a marketplace with ready-made templates where you can sell yours.

First, head over to:

Create an AI voice agent by adding its name, a voice, and instructions of how it should behave via the “Prompt” section. synthflow.ai

Share this page!

Enter URL or ID to Unroll