This module creates captions from images in a folder based on the folder names and the name of the image file.
>>> from captionizer import caption_from_path
In the following examples we assume that:
- Token is 'randoguy'
- Class for the token is 'person'
- The base of our test data is
/sd/data
>>> class_token = 'person'
>>> token = 'randoguy'
>>> base_path = '/sd/data'
In the simplest case - if a file is in the root of the folder - it will behave the same way as it does today. So:
>>> print(caption_from_path('/sd/data/img-001.jpg', base_path, class_token, token))
randoguy person
Next up - if you have an image file in /sd/data/joepenna/img-001.jpg
your token, if
passed, will be ignored and joepenna is going to be used to create the caption.
>>> print(caption_from_path('/sd/data/joepenna/img-001.jpg', base_path, class_token, token))
joepenna person
One deeper: if you have an image file in /sd/data/joepenna/person/img-001.jpg
things get
more interesting: the first part is used as a token the second as a class, so you get:
>>> print(caption_from_path('/sd/data/joepenna/man/img-001.jpg', base_path, class_token, token))
joepenna man
This allows you to train the same subject under different classes (man, person, biped)
Finally: you can customize the caption by adding an annotation to the filename, so:
/sd/data/joepenna/person/img-001@a_picture_of_S.jpg
would look like:
>>> print(caption_from_path('/sd/data/joepenna/person/img-001@a_picture_of_S.jpg', base_path, class_token, token))
a picture of joepenna
You can see that 'S' was automatically converted to the subject token. I can hear you scream: 'But what if I
want to have an "S" in my caption?' - well: tough luck! And wait until I tell you that I also eating all your
Cs! Yes: if you have a file in /sd/data/joepenna/dude/img-001@S_the_C_hanging_out_by_the_pool.jpg
then
your caption becomes:
>>> filename = '/sd/data/joepenna/dude/img-001@S_the_C_hanging_out_by_the_pool.jpg'
>>> print(caption_from_path(filename, base_path, class_token, token))
joepenna the dude hanging out by the pool
But wait! There is more!
If you don't want to caption all of your images -- who has the time anyhow, right? -- you can have
something along these lines: /sd/data/joepenna/dude/a_picture_of_S_being_a_C/img-001@as_a_C_can_be.jpg
to get:
>>> filename = '/sd/data/joepenna/dude/a_picture_of_S_being_a_C/img-001@as_a_C_can_be.jpg'
>>> print(caption_from_path(filename, base_path, class_token, token))
a picture of joepenna being a dude as a dude can be
All the examples show and underscore - _
- in the filename instead of a space. You can use
regular spaces if your operative system supports it. So:
>>> filename = '/sd/data/joepenna/dude/a picture of S being a C/img-001@as a C can be.jpg'
>>> print(caption_from_path(filename, base_path, class_token, token))
a picture of joepenna being a dude as a dude can be
If you used one of the tokens as part of the path to the file, the tokens will not be added to the caption, so
if you have /sd/data/joepenna/dude/a_photo_of_S_milling_about/img-001.jpg
, the result will be:
>>> filename = '/sd/data/joepenna/dude/a_photo_of_S_milling_about/img-001.jpg'
>>> print(caption_from_path(filename, base_path, class_token, token))
a photo of joepenna milling about
Finally - I promise - you can use captions directly in the old fashioned let's dump all of our
images in the same bucket method, so /sd/data/img-001@S_being_a_handsome_C.jpg
would spit out:
>>> filename = '/sd/data/img-001@S_being_a_handsome_C.jpg'
>>> print(caption_from_path(filename, base_path, class_token, token))
randoguy being a handsome person
Now that we have a more complex structure you can see that listing the files in a folder is no longer enough, we need to be able to recursively find them given a root data folder. Lucky us:
from captionizer import find_images
print(find_images('/sd/data'))
If you want to see what all of this sums up to, you can run:
captionizer <folder with data>
and it will spit out a list of the files it found and the captions generated for them.
You what? Really? You don't trust me?? OK:
python3 captionizer/run_tests.py
If you have your own set of tokens to replace - or if you want better tokens than the ones I picked -
you can use the generic_captions_from_path
method. the method allows to setup your own mappings.
This is an example of usage:
>>> from captionizer import generic_captions_from_path
>>> from typing import OrderedDict
>>> filename = '/sd/data/img-001@X_is_better_than_S.jpg'
>>> tokens = OrderedDict([('X', 'Mine'), ('S', 'Yours')])
>>> print(generic_captions_from_path(filename, base_path, tokens))
Mine is better than Yours