- Add English and Spanish sections - Improve project description and features - Add professional formatting
90 lines
3.1 KiB
Markdown
90 lines
3.1 KiB
Markdown
# 🎵 CBC Fácil - Document Processing Platform
|
|
|
|
## 📝 Description / Descripción
|
|
|
|
**English:**
|
|
An intelligent document processing platform that uses AI (Gemini/Claude) to analyze, summarize, and convert various document formats (PDF, audio, text) into structured knowledge bases. Integrates with Notion for knowledge management.
|
|
|
|
**Español:**
|
|
Una plataforma inteligente de procesamiento de documentos que utiliza IA (Gemini/Claude) para analizar, resumir y convertir varios formatos de documentos (PDF, audio, texto) en bases de conocimiento estructuradas. Se integra con Notion para gestión del conocimiento.
|
|
|
|
## 🚀 Features / Características
|
|
|
|
**English:**
|
|
- Multi-format document processing (PDF, MP3, WAV, TXT)
|
|
- AI-powered content analysis and summarization
|
|
- Automatic LaTeX generation from processed content
|
|
- Notion integration for knowledge base creation
|
|
- GPU acceleration support (ROCm/AMD)
|
|
- Parallel AI provider processing (Gemini + Claude)
|
|
- Telegram notifications for processing status
|
|
- WebDAV file management
|
|
|
|
**Español:**
|
|
- Procesamiento de documentos multi-formato (PDF, MP3, WAV, TXT)
|
|
- Análisis y resumen de contenido con IA
|
|
- Generación automática de LaTeX desde contenido procesado
|
|
- Integración con Notion para creación de bases de conocimiento
|
|
- Soporte de aceleración GPU (ROCm/AMD)
|
|
- Procesamiento paralelo de proveedores de IA (Gemini + Claude)
|
|
- Notificaciones Telegram para estado de procesamiento
|
|
- Gestión de archivos WebDAV
|
|
|
|
## 🛠️ Tech Stack / Stack Tecnológico
|
|
|
|
- **Language:** Python 3.11+
|
|
- **AI Providers:** Google Gemini, Anthropic Claude
|
|
- **Document Processing:** PyPDF2, SpeechRecognition, pydub
|
|
- **LaTeX Generation:** Custom templates
|
|
- **Storage:** Notion API, WebDAV
|
|
- **Notifications:** Telegram Bot API
|
|
- **GPU:** ROCm/HIP (AMD GPU support)
|
|
|
|
## 📋 Architecture / Arquitectura
|
|
|
|
```
|
|
cbcren2026/
|
|
├── api/ # REST API routes
|
|
├── core/ # Core business logic
|
|
├── document/ # Document generation (LaTeX)
|
|
├── processors/ # File processors (PDF, audio, text)
|
|
├── services/ # External service integrations
|
|
│ ├── ai/ # AI providers (Claude, Gemini)
|
|
│ └── notion/ # Notion integration
|
|
├── storage/ # Data persistence
|
|
└── config/ # Configuration management
|
|
```
|
|
|
|
## 🔧 Setup / Configuración
|
|
|
|
**English:**
|
|
1. Clone the repository
|
|
2. Install dependencies: `pip install -r requirements.txt`
|
|
3. Create `.env` file with API keys (Gemini, Claude, Notion, Telegram)
|
|
4. Run: `python main.py`
|
|
|
|
**Español:**
|
|
1. Clona el repositorio
|
|
2. Instala dependencias: `pip install -r requirements.txt`
|
|
3. Crea archivo `.env` con claves API (Gemini, Claude, Notion, Telegram)
|
|
4. Ejecuta: `python main.py`
|
|
|
|
## 🔌 API Endpoints
|
|
|
|
- `POST /api/process` - Process new documents
|
|
- `GET /api/status` - Check processing status
|
|
- `GET /api/health` - Health check
|
|
|
|
## 👨💻 Author / Autor
|
|
|
|
**Renato CBC**
|
|
- GitHub: [renato97](https://gitea.cbcren.online/renato97)
|
|
|
|
## 📄 License / Licencia
|
|
|
|
Private project / Proyecto privado
|
|
|
|
---
|
|
|
|
**Tags / Etiquetas:** `python` `ai` `document-processing` `gemini` `claude` `notion` `latex` `automation`
|