Importing function for mass categorization
Posted: Tue Jul 07, 2015 6:01 am
Long story short, an objective of mine would be very much facilitated if I could automate the assignment of categories to an image and apply this to thousands of images. I have no found such a feature in the options, but I may have missed it, if I have, please in form me. Let me describe what I would like to do... (firstly I would like to note how niche and specialized this request is, but surely other people out there have walked similar roads)
The way I imagine this feature, I declare that I wish to import an image of folder containing many images, which bring me to a wizard or a panel. I find and select from a list all delimiters of interest (common examples would be ' ', '_', '+', etc). There could be a list of common delimiters, or let the user input one. XnView now scans the folder/images and processes the string of their filenames. All images are assigned to be of the category with the delimiters listed. Example:
I declare the delimiter to be '_'
XnView scans an image called 20150703_books_library_plants_table.png
XnView could automatically assign the categories of 20150703, books, library, plants.
This feature would enable importing collections of photos that are already 'tagged' or categorized via filename (which, to may dismay, is something outside of my control that I have to deal with).
If things seem to make sense so far, hear me out. I would also make this import feature have a blacklist and a whitelist feature. A user can checkmark if they wish to use a whitelist, a blacklist or neither. Expanding on the example, if I know several pictures had 'books' in their filename and I desired to get rid of that category in the automation process, I can simply add it to the blacklist. Images will not receive that category ie: resulting in the categories 20150703, library, plants for the image. Similarly for the whitely except whitelisted strings are the only categories created.
Another potentially useful expansion would be to exclude all strings of a certain length. Why is this useful? Well, suppose the images have dates, or id codes(ie: hashes, that are no longer relevant), these could be filtered out. Add the value '8' to this exclusion list and voila, all dates of the form yyyy/mm/dd or dd/mm/yyyy will not appear in the categories. In a similar line of thinking one could extrapolate to strings > n or < n...
One could also easily imagine an export feature which does this opposite of this to move/rename collections of images for use outside of the XnView database... though I would rather do everything in XnView so I don't need this
Please let me know if my idea is really dumb, already implemented and somehow I missed it, or something about it is unclear. Also, feel free to relabel anything in an actual implementation as I will admit that I haven't pondered on this for a long time.
Thanks for reading
The way I imagine this feature, I declare that I wish to import an image of folder containing many images, which bring me to a wizard or a panel. I find and select from a list all delimiters of interest (common examples would be ' ', '_', '+', etc). There could be a list of common delimiters, or let the user input one. XnView now scans the folder/images and processes the string of their filenames. All images are assigned to be of the category with the delimiters listed. Example:
I declare the delimiter to be '_'
XnView scans an image called 20150703_books_library_plants_table.png
XnView could automatically assign the categories of 20150703, books, library, plants.
This feature would enable importing collections of photos that are already 'tagged' or categorized via filename (which, to may dismay, is something outside of my control that I have to deal with).
If things seem to make sense so far, hear me out. I would also make this import feature have a blacklist and a whitelist feature. A user can checkmark if they wish to use a whitelist, a blacklist or neither. Expanding on the example, if I know several pictures had 'books' in their filename and I desired to get rid of that category in the automation process, I can simply add it to the blacklist. Images will not receive that category ie: resulting in the categories 20150703, library, plants for the image. Similarly for the whitely except whitelisted strings are the only categories created.
Another potentially useful expansion would be to exclude all strings of a certain length. Why is this useful? Well, suppose the images have dates, or id codes(ie: hashes, that are no longer relevant), these could be filtered out. Add the value '8' to this exclusion list and voila, all dates of the form yyyy/mm/dd or dd/mm/yyyy will not appear in the categories. In a similar line of thinking one could extrapolate to strings > n or < n...
One could also easily imagine an export feature which does this opposite of this to move/rename collections of images for use outside of the XnView database... though I would rather do everything in XnView so I don't need this

Please let me know if my idea is really dumb, already implemented and somehow I missed it, or something about it is unclear. Also, feel free to relabel anything in an actual implementation as I will admit that I haven't pondered on this for a long time.
Thanks for reading