Skip to content
View joanrod's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report joanrod

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
joanrod/README.md

Hi there 👋

Im an AI Researcher working on Multimodal Models and Vector Graphics. Visit my home page at joanrod.github.io

Followers Stars

Pinned Loading

  1. star-vector star-vector Public

    StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…

    Python 4.5k 254

  2. ocr-vqgan ocr-vqgan Public

    OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from V…

    Python 84 2

  3. figure-diffusion figure-diffusion Public

    Generating figures from research papers, using textual captions from the paper.

    Python 43 4

  4. paper2figure-dataset paper2figure-dataset Public

    Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer…

    Python 10 1

  5. IntentGPT IntentGPT Public

    IntentGPT: Few-Shot Intent Discovery with Large Language Models

    Python 4