captylize

Easily extendable API for prototyping with ML models to analyze and generate content from images.

Introduction

Note: This project is under active development. Expect breaking changes. Use at your own risk.

Captylize is a simple API designed to facilitate easy prototyping of Hugging Face models and other image analysis models. It provides a straightforward interface for analyzing images with the goal of:

Analyzing images (Using classification for e.g. age, emotion, nsfw)
Generating captions for images (for tagging datasets or building prompts)
Detecting objects and faces in images (coming soon™)

Features

Image captioning using VIT-GPT2 and Florence-2 models
Age estimation
Emotion detection
NSFW content detection
Support for both image URL and file upload inputs
Easy-to-use REST API endpoints

For information about the models used, see the MODELS.md file.

Todo

Installation

It is recommended to use a virtual environment to install the project dependencies.

First create and activate a virtual environment with python 3.11 or later.

Then install PyTorch 2 or later (project is developed using PyTorch 2.4.1)

Use the instructions on the PyTorch website to install the CPU or GPU version.

Then install the project dependencies:

With pip:

pip install -r requirements.txt

With poetry:

poetry install

Usage

To run the API locally (in development mode)

Execute the run_dev.sh script.

Or the command:

uvicorn captylize.main:app --reload

Basic usage examples:

Image Captioning:

POST /api/v1/generations/captions/vit
POST /api/v1/generations/captions/florence-2

Age Estimation:
```
POST /api/v1/analyses/ages
```
Emotion Detection:
```
POST /api/v1/analyses/emotions
```
NSFW Detection:
```
POST /api/v1/analyses/nsfw
```
Object Detection (coming soon™):
```
POST /api/v1/detections/objects
```
Face Detection (coming soon™):
```
POST /api/v1/detections/faces
```

For detailed API documentation, run the server and visit /docs or /redoc.

License

This project and code within is licensed under the MIT License. Models referenced in this project are licensed under their respective licenses.

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
captylize		captylize
.gitignore		.gitignore
LICENSE		LICENSE
MODELS.md		MODELS.md
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run_dev.sh		run_dev.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

captylize

Introduction

Features

Todo

Installation

Usage

License

About

Releases

Languages

License

KianBay/captylize

Folders and files

Latest commit

History

Repository files navigation

captylize

Introduction

Features

Todo

Installation

Usage

License

About

Resources

License

Stars

Watchers

Forks

Releases

Languages