TY - GEN
T1 - Improving object detection in paintings based on time contexts
AU - Marinescu, Maria Cristina
AU - Reshetnikov, Artem
AU - Lopez, Joaquim More
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/11
Y1 - 2020/11
N2 - This paper proposes a novel approach to object detection for the Cultural Heritage domain, which relies on combining Deep Learning and semantic metadata about candidate objects extracted from existing sources such as Wikidata, dictionaries, or Google NGram. Working with cultural heritage presents challenges not present in every-day images. In computer vision, object detection models are usually trained with datasets whose classes are not imaginary concepts, and have neither symbolic nor time-specific dimensions. Apart from this conceptual problem, the paintings are limited in number and represent the same concept in potentially very different styles. Finally, the metadata associated with the images is often poor or inexistent, which makes it hard to properly train a model. Our approach can improve the precision of object detection by placing the classes detected by a neural network model in time, based on the dates of their first known use. By taking into account the time of inception of objects such as the TV, cell phone, or scissors, and the appearance of some objects in the geographical space that corresponds to a painting (e.g. bananas or broccoli in 15th century Europe), we can correct and refine the detected objects based on their chronologic probability.
AB - This paper proposes a novel approach to object detection for the Cultural Heritage domain, which relies on combining Deep Learning and semantic metadata about candidate objects extracted from existing sources such as Wikidata, dictionaries, or Google NGram. Working with cultural heritage presents challenges not present in every-day images. In computer vision, object detection models are usually trained with datasets whose classes are not imaginary concepts, and have neither symbolic nor time-specific dimensions. Apart from this conceptual problem, the paintings are limited in number and represent the same concept in potentially very different styles. Finally, the metadata associated with the images is often poor or inexistent, which makes it hard to properly train a model. Our approach can improve the precision of object detection by placing the classes detected by a neural network model in time, based on the dates of their first known use. By taking into account the time of inception of objects such as the TV, cell phone, or scissors, and the appearance of some objects in the geographical space that corresponds to a painting (e.g. bananas or broccoli in 15th century Europe), we can correct and refine the detected objects based on their chronologic probability.
KW - Computer Vision
KW - Cultural Heritage
KW - Deep Learning
KW - Object Detection
UR - http://www.scopus.com/inward/record.url?scp=85101312401&partnerID=8YFLogxK
UR - https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=pure_univeritat_ramon_llull&SrcAuth=WosAPI&KeyUT=WOS:000657112800126&DestLinkType=FullRecord&DestApp=WOS_CPL
U2 - 10.1109/ICDMW51313.2020.00133
DO - 10.1109/ICDMW51313.2020.00133
M3 - Conference contribution
AN - SCOPUS:85101312401
T3 - IEEE International Conference on Data Mining Workshops, ICDMW
SP - 926
EP - 932
BT - Proceedings - 20th IEEE International Conference on Data Mining Workshops, ICDMW 2020
A2 - Di Fatta, Giuseppe
A2 - Sheng, Victor
A2 - Cuzzocrea, Alfredo
A2 - Zaniolo, Carlo
A2 - Wu, Xindong
PB - IEEE Computer Society
T2 - 20th IEEE International Conference on Data Mining Workshops, ICDMW 2020
Y2 - 17 November 2020 through 20 November 2020
ER -