Topic Tag: Flickr

home Forums Topic Tag: Flickr

 Self-Guiding Multimodal LSTM – when we do not have a perfect training dataset for image captioning

     

In this paper, a self-guiding multimodal LSTM (sg-LSTM) image captioning model is proposed to handle uncontrolled imbalanced real-world image-sentence dataset. We collect FlickrNYC dataset from Flickr as our testbed with 306,165 images and the original text descriptions uploaded by the users are ut…


 An Update to Open Images – Now with Bounding-Boxes

   

Posted by Vittorio Ferrari, Research Scientist, Machine Perception Last year we introduced Open Images, a collaborative release of ~9 million images annotated with labels spanning over 6000 object categories, designed to be a useful dataset for machine learning research. The initial release feature…