Intelligent bar chart plagiarism detection in documents.
Identifieur interne : 000011 ( PubMed/Curation ); précédent : 000010; suivant : 000012Intelligent bar chart plagiarism detection in documents.
Auteurs : Mohammed Mumtaz Al-Dabbagh [Iraq] ; Naomie Salim [Malaisie] ; Amjad Rehman [Arabie saoudite] ; Mohammed Hazim Alkawaz [Iraq] ; Tanzila Saba [Arabie saoudite] ; Mznah Al-Rodhaan [Arabie saoudite] ; Abdullah Al-Dhelaan [Arabie saoudite]Source :
- TheScientificWorldJournal [ 1537-744X ] ; 2014.
English descriptors
- KwdEn :
- MESH :
Abstract
This paper presents a novel features mining approach from documents that could not be mined via optical character recognition (OCR). By identifying the intimate relationship between the text and graphical components, the proposed technique pulls out the Start, End, and Exact values for each bar. Furthermore, the word 2-gram and Euclidean distance methods are used to accurately detect and determine plagiarism in bar charts.
DOI: 10.1155/2014/612787
PubMed: 25309952
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: Pour aller vers cette notice dans l'étape Curation :000011
Links to Exploration step
pubmed:25309952Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Intelligent bar chart plagiarism detection in documents.</title>
<author><name sortKey="Al Dabbagh, Mohammed Mumtaz" sort="Al Dabbagh, Mohammed Mumtaz" uniqKey="Al Dabbagh M" first="Mohammed Mumtaz" last="Al-Dabbagh">Mohammed Mumtaz Al-Dabbagh</name>
<affiliation wicri:level="1"><nlm:affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq.</nlm:affiliation>
<country xml:lang="fr">Iraq</country>
<wicri:regionArea>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Salim, Naomie" sort="Salim, Naomie" uniqKey="Salim N" first="Naomie" last="Salim">Naomie Salim</name>
<affiliation wicri:level="1"><nlm:affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia.</nlm:affiliation>
<country xml:lang="fr">Malaisie</country>
<wicri:regionArea>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Rehman, Amjad" sort="Rehman, Amjad" uniqKey="Rehman A" first="Amjad" last="Rehman">Amjad Rehman</name>
<affiliation wicri:level="1"><nlm:affiliation>MIS Department, CBA, Salman Bin Abdulaziz University, Alkharj, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>MIS Department, CBA, Salman Bin Abdulaziz University, Alkharj</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Alkawaz, Mohammed Hazim" sort="Alkawaz, Mohammed Hazim" uniqKey="Alkawaz M" first="Mohammed Hazim" last="Alkawaz">Mohammed Hazim Alkawaz</name>
<affiliation wicri:level="1"><nlm:affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq.</nlm:affiliation>
<country xml:lang="fr">Iraq</country>
<wicri:regionArea>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Saba, Tanzila" sort="Saba, Tanzila" uniqKey="Saba T" first="Tanzila" last="Saba">Tanzila Saba</name>
<affiliation wicri:level="1"><nlm:affiliation>College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Al Rodhaan, Mznah" sort="Al Rodhaan, Mznah" uniqKey="Al Rodhaan M" first="Mznah" last="Al-Rodhaan">Mznah Al-Rodhaan</name>
<affiliation wicri:level="1"><nlm:affiliation>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Al Dhelaan, Abdullah" sort="Al Dhelaan, Abdullah" uniqKey="Al Dhelaan A" first="Abdullah" last="Al-Dhelaan">Abdullah Al-Dhelaan</name>
<affiliation wicri:level="1"><nlm:affiliation>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2014">2014</date>
<idno type="doi">10.1155/2014/612787</idno>
<idno type="RBID">pubmed:25309952</idno>
<idno type="pmid">25309952</idno>
<idno type="wicri:Area/PubMed/Corpus">000011</idno>
<idno type="wicri:Area/PubMed/Curation">000011</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Intelligent bar chart plagiarism detection in documents.</title>
<author><name sortKey="Al Dabbagh, Mohammed Mumtaz" sort="Al Dabbagh, Mohammed Mumtaz" uniqKey="Al Dabbagh M" first="Mohammed Mumtaz" last="Al-Dabbagh">Mohammed Mumtaz Al-Dabbagh</name>
<affiliation wicri:level="1"><nlm:affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq.</nlm:affiliation>
<country xml:lang="fr">Iraq</country>
<wicri:regionArea>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Salim, Naomie" sort="Salim, Naomie" uniqKey="Salim N" first="Naomie" last="Salim">Naomie Salim</name>
<affiliation wicri:level="1"><nlm:affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia.</nlm:affiliation>
<country xml:lang="fr">Malaisie</country>
<wicri:regionArea>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Rehman, Amjad" sort="Rehman, Amjad" uniqKey="Rehman A" first="Amjad" last="Rehman">Amjad Rehman</name>
<affiliation wicri:level="1"><nlm:affiliation>MIS Department, CBA, Salman Bin Abdulaziz University, Alkharj, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>MIS Department, CBA, Salman Bin Abdulaziz University, Alkharj</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Alkawaz, Mohammed Hazim" sort="Alkawaz, Mohammed Hazim" uniqKey="Alkawaz M" first="Mohammed Hazim" last="Alkawaz">Mohammed Hazim Alkawaz</name>
<affiliation wicri:level="1"><nlm:affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq.</nlm:affiliation>
<country xml:lang="fr">Iraq</country>
<wicri:regionArea>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Saba, Tanzila" sort="Saba, Tanzila" uniqKey="Saba T" first="Tanzila" last="Saba">Tanzila Saba</name>
<affiliation wicri:level="1"><nlm:affiliation>College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Al Rodhaan, Mznah" sort="Al Rodhaan, Mznah" uniqKey="Al Rodhaan M" first="Mznah" last="Al-Rodhaan">Mznah Al-Rodhaan</name>
<affiliation wicri:level="1"><nlm:affiliation>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh</wicri:regionArea>
</affiliation>
</author>
<author><name sortKey="Al Dhelaan, Abdullah" sort="Al Dhelaan, Abdullah" uniqKey="Al Dhelaan A" first="Abdullah" last="Al-Dhelaan">Abdullah Al-Dhelaan</name>
<affiliation wicri:level="1"><nlm:affiliation>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.</nlm:affiliation>
<country xml:lang="fr">Arabie saoudite</country>
<wicri:regionArea>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series><title level="j">TheScientificWorldJournal</title>
<idno type="eISSN">1537-744X</idno>
<imprint><date when="2014" type="published">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithms</term>
<term>Artificial Intelligence</term>
<term>Computer Graphics</term>
<term>Data Mining (methods)</term>
<term>Humans</term>
<term>Pattern Recognition, Automated (methods)</term>
<term>Plagiarism</term>
<term>Semantics</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en"><term>Data Mining</term>
<term>Pattern Recognition, Automated</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Algorithms</term>
<term>Artificial Intelligence</term>
<term>Computer Graphics</term>
<term>Humans</term>
<term>Plagiarism</term>
<term>Semantics</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper presents a novel features mining approach from documents that could not be mined via optical character recognition (OCR). By identifying the intimate relationship between the text and graphical components, the proposed technique pulls out the Start, End, and Exact values for each bar. Furthermore, the word 2-gram and Euclidean distance methods are used to accurately detect and determine plagiarism in bar charts.</div>
</front>
</TEI>
<pubmed><MedlineCitation Owner="NLM" Status="MEDLINE"><PMID Version="1">25309952</PMID>
<DateCreated><Year>2014</Year>
<Month>10</Month>
<Day>13</Day>
</DateCreated>
<DateCompleted><Year>2015</Year>
<Month>05</Month>
<Day>26</Day>
</DateCompleted>
<DateRevised><Year>2015</Year>
<Month>10</Month>
<Day>29</Day>
</DateRevised>
<Article PubModel="Print-Electronic"><Journal><ISSN IssnType="Electronic">1537-744X</ISSN>
<JournalIssue CitedMedium="Internet"><Volume>2014</Volume>
<PubDate><Year>2014</Year>
</PubDate>
</JournalIssue>
<Title>TheScientificWorldJournal</Title>
<ISOAbbreviation>ScientificWorldJournal</ISOAbbreviation>
</Journal>
<ArticleTitle>Intelligent bar chart plagiarism detection in documents.</ArticleTitle>
<Pagination><MedlinePgn>612787</MedlinePgn>
</Pagination>
<ELocationID EIdType="doi" ValidYN="Y">10.1155/2014/612787</ELocationID>
<Abstract><AbstractText>This paper presents a novel features mining approach from documents that could not be mined via optical character recognition (OCR). By identifying the intimate relationship between the text and graphical components, the proposed technique pulls out the Start, End, and Exact values for each bar. Furthermore, the word 2-gram and Euclidean distance methods are used to accurately detect and determine plagiarism in bar charts.</AbstractText>
</Abstract>
<AuthorList CompleteYN="Y"><Author ValidYN="Y"><LastName>Al-Dabbagh</LastName>
<ForeName>Mohammed Mumtaz</ForeName>
<Initials>MM</Initials>
<AffiliationInfo><Affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Salim</LastName>
<ForeName>Naomie</ForeName>
<Initials>N</Initials>
<AffiliationInfo><Affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Rehman</LastName>
<ForeName>Amjad</ForeName>
<Initials>A</Initials>
<Identifier Source="ORCID">0000-0002-3817-2655</Identifier>
<AffiliationInfo><Affiliation>MIS Department, CBA, Salman Bin Abdulaziz University, Alkharj, Saudi Arabia.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Alkawaz</LastName>
<ForeName>Mohammed Hazim</ForeName>
<Initials>MH</Initials>
<AffiliationInfo><Affiliation>Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Saba</LastName>
<ForeName>Tanzila</ForeName>
<Initials>T</Initials>
<AffiliationInfo><Affiliation>College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Al-Rodhaan</LastName>
<ForeName>Mznah</ForeName>
<Initials>M</Initials>
<AffiliationInfo><Affiliation>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.</Affiliation>
</AffiliationInfo>
</Author>
<Author ValidYN="Y"><LastName>Al-Dhelaan</LastName>
<ForeName>Abdullah</ForeName>
<Initials>A</Initials>
<AffiliationInfo><Affiliation>Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.</Affiliation>
</AffiliationInfo>
</Author>
</AuthorList>
<Language>eng</Language>
<PublicationTypeList><PublicationType UI="D016428">Journal Article</PublicationType>
<PublicationType UI="D013485">Research Support, Non-U.S. Gov't</PublicationType>
</PublicationTypeList>
<ArticleDate DateType="Electronic"><Year>2014</Year>
<Month>09</Month>
<Day>17</Day>
</ArticleDate>
</Article>
<MedlineJournalInfo><Country>United States</Country>
<MedlineTA>ScientificWorldJournal</MedlineTA>
<NlmUniqueID>101131163</NlmUniqueID>
<ISSNLinking>1537-744X</ISSNLinking>
</MedlineJournalInfo>
<CitationSubset>IM</CitationSubset>
<CommentsCorrectionsList><CommentsCorrections RefType="Cites"><RefSource>Radiology. 1999 Nov;213(2):317-20</RefSource>
<PMID Version="1">10551208</PMID>
</CommentsCorrections>
</CommentsCorrectionsList>
<MeshHeadingList><MeshHeading><DescriptorName MajorTopicYN="Y" UI="D000465">Algorithms</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="Y" UI="D001185">Artificial Intelligence</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D003196">Computer Graphics</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D057225">Data Mining</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D006801">Humans</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D010363">Pattern Recognition, Automated</DescriptorName>
<QualifierName MajorTopicYN="Y" UI="Q000379">methods</QualifierName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="Y" UI="D015714">Plagiarism</DescriptorName>
</MeshHeading>
<MeshHeading><DescriptorName MajorTopicYN="N" UI="D012660">Semantics</DescriptorName>
</MeshHeading>
</MeshHeadingList>
<OtherID Source="NLM">PMC4182899</OtherID>
</MedlineCitation>
<PubmedData><History><PubMedPubDate PubStatus="received"><Year>2014</Year>
<Month>3</Month>
<Day>30</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="revised"><Year>2014</Year>
<Month>6</Month>
<Day>21</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="accepted"><Year>2014</Year>
<Month>7</Month>
<Day>7</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="epublish"><Year>2014</Year>
<Month>9</Month>
<Day>17</Day>
</PubMedPubDate>
<PubMedPubDate PubStatus="entrez"><Year>2014</Year>
<Month>10</Month>
<Day>14</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="pubmed"><Year>2014</Year>
<Month>10</Month>
<Day>14</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
<PubMedPubDate PubStatus="medline"><Year>2015</Year>
<Month>5</Month>
<Day>27</Day>
<Hour>6</Hour>
<Minute>0</Minute>
</PubMedPubDate>
</History>
<PublicationStatus>ppublish</PublicationStatus>
<ArticleIdList><ArticleId IdType="doi">10.1155/2014/612787</ArticleId>
<ArticleId IdType="pubmed">25309952</ArticleId>
<ArticleId IdType="pmc">PMC4182899</ArticleId>
</ArticleIdList>
</PubmedData>
</pubmed>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/PubMed/Curation
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000011 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd -nk 000011 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= PubMed |étape= Curation |type= RBID |clé= pubmed:25309952 |texte= Intelligent bar chart plagiarism detection in documents. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/PubMed/Curation/RBID.i -Sk "pubmed:25309952" \ | HfdSelect -Kh $EXPLOR_AREA/Data/PubMed/Curation/biblio.hfd \ | NlmPubMed2Wicri -a OcrV1
![]() | This area was generated with Dilib version V0.6.32. | ![]() |