Language-Oriented Semantic Latent Representation for Image Transmission

Language-Oriented Semantic Latent Representation for Image Transmission

Giordano Cicchetti, Eleonora Grassucci, Jihong Park,Jinho Choi Sergio Barbarossa,Danilo Comminiello

This is the official implementation of the paper: Language-Oriented Semantic Latent Representation for Image Transmission

News

June 17, 2024 Code Released

Requirements

Create a dedicated conda environment:

conda env -n SemanticI2I python=3.9
conda activate SemanticI2I

You can clone the repository by typing:


git clone https://github.com/ispamm/Img2Img-SC.git
cd Img2Img-SC

You can update an existing latent diffusion environment by running

conda install pytorch==1.12.1 torchvision==0.13.1 -c pytorch
pip install transformers==4.19.2 diffusers invisible-watermark
pip install -e .

After that you can install the remaining required packages by running:


pip install -r requirements.txt

Download pretraining models

Download pretrained checkpoints and copy them into the "/checkpoints" folder.

Img2Img

The scripts are located in the "/scripts" folder.

/scripts/semantic_i2i.py refers to the proposed I2I framework that uses latent embedding and image caption. /scripts/semantic_t2i.py refers to the I2I framework that uses only image caption.

For testing the img2img framework, change the model and configuration paths inside script files and then use:

python /scripts/semantic_i2i.py

#Or

python /scripts/semantic_t2i.py

and adapt the checkpoint and config paths accordingly.

Results

License

The code in this repository is released under the MIT License.

Acknowledgment

Most of the code contained in this repository is based on Stable diffusion repository https://github.com/Stability-AI/stablediffusion

BibTeX

@misc{cicchetti2024languageoriented,
      title={Language-Oriented Semantic Latent Representation for Image Transmission}, 
      author={Giordano Cicchetti and Eleonora Grassucci and Jihong Park and Jinho Choi and Sergio Barbarossa and Danilo Comminiello},
      year={2024},
      eprint={2405.09976},
      archivePrefix={arXiv},
      primaryClass={id='cs.CV' full_name='Computer Vision and Pattern Recognition' is_active=True alt_name=None in_archive='cs' is_general=False description='Covers image processing, computer vision, pattern recognition, and scene understanding. Roughly includes material in ACM Subject Classes I.2.10, I.4, and I.5.'}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs/stable-diffusion		configs/stable-diffusion
ldm		ldm
samples_paper		samples_paper
scripts		scripts
stable_diffusion.egg-info		stable_diffusion.egg-info
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE-MODEL		LICENSE-MODEL
README.md		README.md
environment.yaml		environment.yaml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Language-Oriented Semantic Latent Representation for Image Transmission

News

Requirements

Download pretraining models

Img2Img

Results

License

Acknowledgment

BibTeX

About

Licenses found

Releases

Packages

Languages

License

Licenses found

ispamm/Img2Img-SC

Folders and files

Latest commit

History

Repository files navigation

Language-Oriented Semantic Latent Representation for Image Transmission

News

Requirements

Download pretraining models

Img2Img

Results

License

Acknowledgment

BibTeX

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages