<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>Shahid Beheshti University</PublisherName>
				<JournalTitle>Journal of Innovations in Computer Science and Engineering (JICSE)</JournalTitle>
				<Issn>2981-2135</Issn>
				<Volume>1</Volume>
				<Issue>1</Issue>
				<PubDate PubStatus="epublish">
					<Year>2023</Year>
					<Month>06</Month>
					<Day>01</Day>
				</PubDate>
			</Journal>
<ArticleTitle>A Weighted Multi-Criteria Decision Making Approach for Image Captioning</ArticleTitle>
<VernacularTitle></VernacularTitle>
			<FirstPage>38</FirstPage>
			<LastPage>51</LastPage>
			<ELocationID EIdType="pii">103526</ELocationID>
			
<ELocationID EIdType="doi">10.48308/jicse.2023.103526</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Hassan</FirstName>
					<LastName>Maleki Golandouz</LastName>
<Affiliation>Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran</Affiliation>

</Author>
<Author>
					<FirstName>Mohsen</FirstName>
					<LastName>Ebrahimi Moghaddam</LastName>
<Affiliation>Faculty of Computer Science and Engineering, Shahid Beheshti University G.C, Tehran, Iran,</Affiliation>

</Author>
<Author>
					<FirstName>Mehrnoush</FirstName>
					<LastName>Shamsfard</LastName>
<Affiliation>Faculty of Computer Science and Engineering, Shahid Beheshti University G.C, Tehran, Iran</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2022</Year>
					<Month>08</Month>
					<Day>23</Day>
				</PubDate>
			</History>
		<Abstract>Image captioning aims at automatically generating description of an image in natural language. This is a challenging problem in the field of artificial intelligence that has recently received significant attention in the computer vision and natural language processing. Among the existing approaches, visual retrieval based methods have been shown to be highly effective. These approaches search for similar images, then build a caption for the query image based on the captions of the retrieved images. In this study, we present a method for visual retrieval based image captioning, in which we use a multi criteria decision making algorithm to effectively combine several criteria with proportional impact weights to retrieve the most relevant caption for the query image. The main idea of the proposed approach is to design a mechanism to retrieve more semantically relevant captions with the query image and then selecting the most appropriate caption by imitation of the human act based on a weighted multi-criteria decision making algorithm. Experiments conducted on MS COCO benchmark dataset have shown that proposed method provides much more effective results compared to the state-of-the-art models.</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Image Captioning</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Machine Vision</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Natural Language Processing</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Multi-criteria Decision Making</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Transfer-based Approaches</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://jicse.sbu.ac.ir/article_103526_2948fd856c64a977215a2b00a2ca48ab.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
