GitHub

Encoder-Decoder Architecture: I utilized transfer learning to generate image features by feeding the images into a pre-trained Inception-v3 model, then caption sequences were generated by inputting text into a word embedding layer and feeding the resultant output into a subsequent long short-term memory (LSTM) layer Data: Flicker8k dataset https://www.kaggle.com/hsankesara/flickr-image-dataset

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ImageCaptioning.ipynb		ImageCaptioning.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

quynhanh12345/ImageCaptioning

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages