Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

.t7 caption files #7

Open
shenkev opened this issue Oct 26, 2017 · 2 comments
Open

.t7 caption files #7

shenkev opened this issue Oct 26, 2017 · 2 comments

Comments

@shenkev
Copy link

shenkev commented Oct 26, 2017

Hi Scot,

Quick question about the bird dataset you're using.

I downloaded the bird dataset as per your instructions:

#####How to train a char-CNN-RNN model:
1. Download the birds and flowers data.

Inside the cvpr2016_cub/text_c10 directory, there are .t7 files. E.G 200.Common_Yellowthroat.t7

Upon opening them, I found that they were 60x201x10 tensors of integers. I guessed 60 is the images/specie, 10 is the caption/image. What is the 201 dimension? Is it the vocabulary size of the captions? What are the actual integers? I notice values from 0 to 70ish with a lot of the values being 0.

@GaryLMS
Copy link

GaryLMS commented Oct 28, 2017

I think 201 is the length of the sentence, if the length is shorter than 201, it will pad zero, otherwise the sentence will be cut.

@jayelm
Copy link

jayelm commented Feb 6, 2019

Since there are only ~70 possible values, the actual integers here seem to be character indices. Not sure what the precise mapping is. For word-level encodings see the word_c10 directory (see #8).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants