<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wicri-demo.istex.fr/Wicri/Sic/en/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Jacques+Ducloy</id>
	<title>Wicri ICS - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wicri-demo.istex.fr/Wicri/Sic/en/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Jacques+Ducloy"/>
	<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php/Special:Contributions/Jacques_Ducloy"/>
	<updated>2026-04-19T18:31:33Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.31.10</generator>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Template:Small&amp;diff=1800</id>
		<title>Template:Small</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Template:Small&amp;diff=1800"/>
		<updated>2024-03-31T07:49:36Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: Created page with &amp;quot;&amp;lt;span style=&amp;quot;font-size:85%;&amp;quot;&amp;gt;{{{1}}}&amp;lt;/span&amp;gt;&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;span style=&amp;quot;font-size:85%;&amp;quot;&amp;gt;{{{1}}}&amp;lt;/span&amp;gt;&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1799</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1799"/>
		<updated>2024-03-31T07:49:21Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
This last case study deals with a more thematic aspect of a research policy survey. We have&lt;br /&gt;
chosen to speak about biodiversity which is becoming a strategic issue. For instance, the&lt;br /&gt;
European Commission launched BiodiverSA37 which aims at “setting up efficient trans-&lt;br /&gt;
national co-operation in the field of biodiversity research funding”.&lt;br /&gt;
&lt;br /&gt;
In this section we will study a topic which is not really on the agenda of this project but which&lt;br /&gt;
is considered as very close to its targets: how are distributed the activities of public research&lt;br /&gt;
laboratories with regard to the main axis of biodiverSA members? This information is&lt;br /&gt;
supposed to be contained in the publications and more specifically in the PhD theses.&lt;br /&gt;
&lt;br /&gt;
In order to illustrate the complexity of the problem, here are the figures on R&amp;amp;D biodiversity&lt;br /&gt;
funding in Europe:&lt;br /&gt;
*more than one hundred funding agencies[{{Article body/Link bibl|2}}].;&lt;br /&gt;
*several programs by agency, so several hundreds of programs;&lt;br /&gt;
*several project by program, so several thousands of projects;&lt;br /&gt;
*several results, such as theses, reports or articles by project, so ten thousand publications!&lt;br /&gt;
&lt;br /&gt;
BiodiverSA intends to create an inventory of all existing biodiversity research funding&lt;br /&gt;
programs which will be implemented in a “metadatabase” (on a CERIF38 basis). Vocabulary&lt;br /&gt;
aspects will play a fundamental role. More specifically classification (or taxonomy) tools&lt;br /&gt;
must be used with some computational constraints in order to produce a set of indicators.&lt;br /&gt;
&lt;br /&gt;
The BiodivERsA classification scheme is still being designed and it could be composed of&lt;br /&gt;
three parts.&lt;br /&gt;
&lt;br /&gt;
* A general scientific component based on ASRC (Australian Standard Research Classification). ASRC is tightly related to the Organisation for Economic Cooperation and Development (OECD) Proposed Standard Practice for Surveys of Research and Experimental Development.&lt;br /&gt;
* A specific component dedicated to biodiversity, built with a combination of several existing classifications;&lt;br /&gt;
* A complementary indexation based on keywords extracted from the “CBD Controlled Vocabulary”39 .&lt;br /&gt;
&lt;br /&gt;
In this context, how could we handle the main topic of this study and, for example, how build&lt;br /&gt;
an indicator based on PhD theses?&lt;br /&gt;
A first problem is to feed a CERIF compliant database which could be used by BiodivERsA&lt;br /&gt;
with something close to qualified Dublin Core. But the most important issue is the mapping of&lt;br /&gt;
the resources in the classification system. We could imagine that a few research laboratories&lt;br /&gt;
will use this classification system in order to be visible by funding agencies. In this case, the&lt;br /&gt;
indexer needs also to be cautious with the future computational usage of its elements. (This is&lt;br /&gt;
not the same that archiving or making browsing easier).&lt;br /&gt;
&lt;br /&gt;
But a very large amount of theses related to some particular aspect of biodiversity will not use&lt;br /&gt;
this schema and several terminological adaptations will be requested. They could be quite&lt;br /&gt;
easy if the theses are indexed with a well known vocabulary (MeSH for instance).&lt;br /&gt;
&lt;br /&gt;
In the other cases, a document content linguistic analysis should be done. Once again, several&lt;br /&gt;
vocabulary oriented resources are needed, and this last sample would illustrate the need of&lt;br /&gt;
complementary terminological repositories.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt; {{Small|http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html}} &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&amp;lt;br/&amp;gt;&amp;lt; doi:10.1045/july2005-lynch &amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=11. &lt;br /&gt;
  |author=M. Patel, &lt;br /&gt;
  |text=Fourth Open Archives Forum Workshop In Practice, Good Practice: The Future&lt;br /&gt;
of Open Archives, Ariadne Issue 37, Oct 2003, &amp;lt;br/&amp;gt;&amp;lt; http://www.ariadne.ac.uk/issue37/oa-forum-ws-rpt/#6&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=12. &lt;br /&gt;
  |author=M. Smith, M. Bass, G. McClellan, R. Tansley, M. Barton, M. Branschofsky, D. Stuve,&lt;br /&gt;
and J. H. Walker, &lt;br /&gt;
  |text=DSpace: An Open Source Dynamic Digital Repository, D-Lib Magazine, 9&lt;br /&gt;
(1), 2003.&lt;br /&gt;
&amp;lt;doi:10.1045/january2003-smith&amp;gt;.&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=13. &lt;br /&gt;
  |author=H. Suleman, A. Atkins, M. Gonçalves, R. France and E. Fox. &lt;br /&gt;
  |text=Networked Digital Library&lt;br /&gt;
of Theses and Dissertations, Bridging the Gaps for Global Access - Part 1: Mission and&lt;br /&gt;
Progress. In D-lib Magazine, September 2001&lt;br /&gt;
&amp;lt; doi:10.1045/september2001-suleman-pt1&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1798</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1798"/>
		<updated>2024-03-31T07:45:13Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
This last case study deals with a more thematic aspect of a research policy survey. We have&lt;br /&gt;
chosen to speak about biodiversity which is becoming a strategic issue. For instance, the&lt;br /&gt;
European Commission launched BiodiverSA37 which aims at “setting up efficient trans-&lt;br /&gt;
national co-operation in the field of biodiversity research funding”.&lt;br /&gt;
&lt;br /&gt;
In this section we will study a topic which is not really on the agenda of this project but which&lt;br /&gt;
is considered as very close to its targets: how are distributed the activities of public research&lt;br /&gt;
laboratories with regard to the main axis of biodiverSA members? This information is&lt;br /&gt;
supposed to be contained in the publications and more specifically in the PhD theses.&lt;br /&gt;
&lt;br /&gt;
In order to illustrate the complexity of the problem, here are the figures on R&amp;amp;D biodiversity&lt;br /&gt;
funding in Europe:&lt;br /&gt;
*more than one hundred funding agencies[{{Article body/Link bibl|2}}].;&lt;br /&gt;
*several programs by agency, so several hundreds of programs;&lt;br /&gt;
*several project by program, so several thousands of projects;&lt;br /&gt;
*several results, such as theses, reports or articles by project, so ten thousand publications!&lt;br /&gt;
&lt;br /&gt;
BiodiverSA intends to create an inventory of all existing biodiversity research funding&lt;br /&gt;
programs which will be implemented in a “metadatabase” (on a CERIF38 basis). Vocabulary&lt;br /&gt;
aspects will play a fundamental role. More specifically classification (or taxonomy) tools&lt;br /&gt;
must be used with some computational constraints in order to produce a set of indicators.&lt;br /&gt;
&lt;br /&gt;
The BiodivERsA classification scheme is still being designed and it could be composed of&lt;br /&gt;
three parts.&lt;br /&gt;
&lt;br /&gt;
* A general scientific component based on ASRC (Australian Standard Research Classification). ASRC is tightly related to the Organisation for Economic Cooperation and Development (OECD) Proposed Standard Practice for Surveys of Research and Experimental Development.&lt;br /&gt;
* A specific component dedicated to biodiversity, built with a combination of several existing classifications;&lt;br /&gt;
* A complementary indexation based on keywords extracted from the “CBD Controlled Vocabulary”39 .&lt;br /&gt;
&lt;br /&gt;
In this context, how could we handle the main topic of this study and, for example, how build&lt;br /&gt;
an indicator based on PhD theses?&lt;br /&gt;
A first problem is to feed a CERIF compliant database which could be used by BiodivERsA&lt;br /&gt;
with something close to qualified Dublin Core. But the most important issue is the mapping of&lt;br /&gt;
the resources in the classification system. We could imagine that a few research laboratories&lt;br /&gt;
will use this classification system in order to be visible by funding agencies. In this case, the&lt;br /&gt;
indexer needs also to be cautious with the future computational usage of its elements. (This is&lt;br /&gt;
not the same that archiving or making browsing easier).&lt;br /&gt;
&lt;br /&gt;
But a very large amount of theses related to some particular aspect of biodiversity will not use&lt;br /&gt;
this schema and several terminological adaptations will be requested. They could be quite&lt;br /&gt;
easy if the theses are indexed with a well known vocabulary (MeSH for instance).&lt;br /&gt;
&lt;br /&gt;
In the other cases, a document content linguistic analysis should be done. Once again, several&lt;br /&gt;
vocabulary oriented resources are needed, and this last sample would illustrate the need of&lt;br /&gt;
complementary terminological repositories.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt; http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&amp;lt;br/&amp;gt;&amp;lt; doi:10.1045/july2005-lynch &amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=11. &lt;br /&gt;
  |author=M. Patel, &lt;br /&gt;
  |text=Fourth Open Archives Forum Workshop In Practice, Good Practice: The Future&lt;br /&gt;
of Open Archives, Ariadne Issue 37, Oct 2003, &amp;lt;br/&amp;gt;&amp;lt; http://www.ariadne.ac.uk/issue37/oa-forum-ws-rpt/#6&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=12. &lt;br /&gt;
  |author=M. Smith, M. Bass, G. McClellan, R. Tansley, M. Barton, M. Branschofsky, D. Stuve,&lt;br /&gt;
and J. H. Walker, &lt;br /&gt;
  |text=DSpace: An Open Source Dynamic Digital Repository, D-Lib Magazine, 9&lt;br /&gt;
(1), 2003.&lt;br /&gt;
&amp;lt;doi:10.1045/january2003-smith&amp;gt;.&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=13. &lt;br /&gt;
  |author=H. Suleman, A. Atkins, M. Gonçalves, R. France and E. Fox. &lt;br /&gt;
  |text=Networked Digital Library&lt;br /&gt;
of Theses and Dissertations, Bridging the Gaps for Global Access - Part 1: Mission and&lt;br /&gt;
Progress. In D-lib Magazine, September 2001&lt;br /&gt;
&amp;lt; doi:10.1045/september2001-suleman-pt1&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=DC_2010_Pittsburgh&amp;diff=1797</id>
		<title>DC 2010 Pittsburgh</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=DC_2010_Pittsburgh&amp;diff=1797"/>
		<updated>2024-03-30T23:01:23Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Program */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Image:DC2010Banner.jpg|center]]&lt;br /&gt;
{{Semantic Infobox Event&lt;br /&gt;
 | Title=DC 2010&lt;br /&gt;
 | Series=DC Conference&lt;br /&gt;
 | Start date=Oct 20 2010 &lt;br /&gt;
 | Country=United States&lt;br /&gt;
 | City=Pittsburgh&lt;br /&gt;
 | Homepage=www.asis.org/Conferences/DC2010/&lt;br /&gt;
 | Homepage label=www.asis.org/ ... /DC2010&lt;br /&gt;
 | Paper deadline=Apr 9 2010&lt;br /&gt;
 }}&lt;br /&gt;
&amp;lt;big&amp;gt;&amp;lt;center&amp;gt;'''Making Metadata Work Harder:&amp;lt;br/&amp;gt;''Celebrating 15 Years of Dublin Core '''''&amp;lt;/center&amp;gt;&amp;lt;/big&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Conference Theme==&lt;br /&gt;
&lt;br /&gt;
DC-2010 marks the 15th anniversary of Dublin Core and the 10th year of the Annual Conference. To celebrate and reflect on the past and future, conference participants will engage in investigations in both research and application entailing advances that  make metadata work harder in ways beyond the originally identified need for better resource discovery. The  DCMI Abstract Model, the refined concept of  Application Profiles and the initial  Description Set Profile specification, represent an inflection point in the trajectory of metadata design and deployment in the service of human information needs. But all is not perfect and we have much to learn from sharing experience. DC-2010, will take stock of progress, look to the future and celebrate the broad scope of research and applied work in  making metadata work harder.&lt;br /&gt;
&lt;br /&gt;
Beyond the conference theme, papers, reports, and poster submissions are welcome on a wide range of metadata topics, such as:&lt;br /&gt;
&lt;br /&gt;
*Metadata principles, guidelines, and best practices&lt;br /&gt;
*Metadata quality, normalization, improvement and mapping&lt;br /&gt;
*Conceptual models and frameworks (e.g., RDF, DCAM, OAIS)&lt;br /&gt;
*Application profiles&lt;br /&gt;
*Metadata interoperability across domains, languages, time, structures, and scales.&lt;br /&gt;
*Cross-domain metadata uses (e.g., recordkeeping, preservation, curation, institutional repositories, publishing)&lt;br /&gt;
*Domain metadata (e.g., for corporations, cultural memory institutions, education, government, and scientific fields)&lt;br /&gt;
*Bibliographic standards (e.g., RDA, FRBR, subject headings) as Semantic Web vocabularies&lt;br /&gt;
*Accessibility metadata&lt;br /&gt;
*Metadata for scientific data, e-Science and grid applications&lt;br /&gt;
*Social tagging and user participation in building metadata&lt;br /&gt;
*Knowledge Organization Systems (e.g., ontologies, taxonomies, authority files, folksonomies, and thesauri) and Simple Knowledge Organization Systems (SKOS)&lt;br /&gt;
* Ontology design and development&lt;br /&gt;
* Integration of metadata and ontologies&lt;br /&gt;
* Metadata generation (methods, tools, and practices)&lt;br /&gt;
* Search engines and metadata&lt;br /&gt;
* Semantic Web metadata and applications&lt;br /&gt;
* Vocabulary registries and registry services&lt;br /&gt;
&lt;br /&gt;
==Submissions==&lt;br /&gt;
&lt;br /&gt;
Authors wishing to submit papers, reports, or poster proposals may do so through the DCMI Peer Review System at http://dcpapers.dublincore.org/index.php/pubs/. Author registration and instructions for the submission process appear under the “Information for Authors” link. Author Guidelines for full papers, project reports and posters/demonstrations are available through the DCMI Peer Review System. All submissions to the DC-2010 Conference Proceedings will be peer-reviewed by the International Program Committee. All submissions must be in English. Accepted submissions will be published in the official electronic Conference Proceedings. Unless previously arranged, accepted papers, project reports and posters must be presented in Pittsburgh by at least one of their authors. &lt;br /&gt;
&lt;br /&gt;
All submitting authors must provide basic information regarding current professional positions and affiliations as a condition of acceptance and publication.&lt;br /&gt;
&lt;br /&gt;
===Full papers (8-10 pages)===&lt;br /&gt;
&lt;br /&gt;
Full papers either describe innovative work in detail or provide critical, well-referenced overviews of key developments or good practice in the areas outlined above. Full papers will be assessed using the following criteria:&lt;br /&gt;
&lt;br /&gt;
* Originality of the approach to implementation &lt;br /&gt;
* Quality of the contribution to the implementation community &lt;br /&gt;
* Significance of the results presented &lt;br /&gt;
* Clarity of presentation &lt;br /&gt;
&lt;br /&gt;
===Project reports (4-5 pages)===&lt;br /&gt;
&lt;br /&gt;
Project reports describe a specific model, application, or activity in a concise, prescribed format. Project reports will be assessed using the following criteria:&lt;br /&gt;
&lt;br /&gt;
* Conciseness and completeness of technical description &lt;br /&gt;
* Usability of the technical description by other potential implementers &lt;br /&gt;
* Clarity of presentation &lt;br /&gt;
&lt;br /&gt;
===Posters &amp;amp; demonstrations (1-2 pages)===&lt;br /&gt;
&lt;br /&gt;
Posters are for the presentation of projects or research under development or late-breaking results. Poster proposals should consist of a one-two page extended abstract. Posters will be assessed using the following criteria:&lt;br /&gt;
&lt;br /&gt;
* Concise statement of research or project goals and milestones&lt;br /&gt;
* Significance of the research or project&lt;br /&gt;
* Framing of key barriers and future research&lt;br /&gt;
* Statement of results and accomplishments &lt;br /&gt;
* Clarity of presentation&lt;br /&gt;
&lt;br /&gt;
Accepted posters will be published in the Conference Proceedings and displayed at the conference. Unless otherwise arranged, accepted posters must be presented in Pittsburgh by at least one of their authors. However, with prior arrangement, posters may be included in the proceedings and presented by means of video ranging from 4-10 minutes in length and uploaded to YouTube with the link supplied at the time the poster is submitted. Asynchronous mechanisms for participant/author communications will be provided.&lt;br /&gt;
&lt;br /&gt;
==Committees==&lt;br /&gt;
&lt;br /&gt;
===Organization===&lt;br /&gt;
&lt;br /&gt;
* [[Dublin Core Metadata Initiative]] ;&lt;br /&gt;
* with [[American Society for Information Science and Technology]] (ASIS&amp;amp;T).&lt;br /&gt;
&lt;br /&gt;
===Comittees===&lt;br /&gt;
&lt;br /&gt;
; Conference Committee Chair:&lt;br /&gt;
:*[[Stuart A. Sutton]], [[A pour affiliation de président de comité de programme::Université de Washington]], [[USA]].&lt;br /&gt;
&lt;br /&gt;
;Conference Committee Co-chairs ,:&lt;br /&gt;
:* [[Diane Hillmann]], [[A pour affiliation de président de comité de programme::Information Institute of Syracuse ]], [[USA]] ;&lt;br /&gt;
:* [[Mike Lauruhn]], [[A pour affiliation de président de comité de programme::Taxonomy Strategies]], [[USA]]&lt;br /&gt;
;Pour les workshops :&lt;br /&gt;
:* [[Liddy Nevile]], [[Université de La Trobe|La Trobe University]], [[Australie]]&lt;br /&gt;
;Pour les tutoriaux :&lt;br /&gt;
:* [[Marcia Lei Zeng]], [[Université d'État de Kent|Kent State University]], [[États-Unis]]&lt;br /&gt;
&lt;br /&gt;
;Program Committee:&lt;br /&gt;
*[[Has PC member::Leif Andresen]], [[Danish Agency for Libraries and Media]], [[Denmark]]&lt;br /&gt;
*[[Has PC member::Ann Apps]], [[BSc MBCS CITP]], Retired, [[United Kingdom]]&lt;br /&gt;
*[[Has PC member::Thomas Baker]], [[DCMI Directorate]], [[USA]]&lt;br /&gt;
*[[Has PC member::Ana Alice Baptista]], [[Universidade do Minho]], [[Portugal]]&lt;br /&gt;
*[[Has PC member::Michael Robert Bolam]], [[University of Pittsburgh]], [[USA]]&lt;br /&gt;
*[[Has PC member::Joseph A. Busch]], [[Project Performance Corporation]], [[USA]]&lt;br /&gt;
*[[Has PC member::Douglas Campbell]], [[National Library of New Zealand]], [[New Zealand]]&lt;br /&gt;
*[[Has PC member::Chao-chen Chen]], [[National Taiwan Normal University]], [[Taiwan]]&lt;br /&gt;
*[[Has PC member::Eric Rogers Childress]], [[OCLC Research]], [[USA]]&lt;br /&gt;
*[[Has PC member::Michael D. Crandall]], [[University of Washington]], [[USA]]&lt;br /&gt;
*[[Has PC member::Jacques Ducloy]], [[DRRT Lorraine]], [[France]]&lt;br /&gt;
*[[Has PC member::Schubert Shou Boon Foo]], [[Nanyang Technological University]], [[Singapore]]&lt;br /&gt;
*[[Has PC member::Jane Greenberg]], [[University of North Carolina]], Chapel Hill, [[USA]]&lt;br /&gt;
*[[Has PC member::Corey A. Harper]], [[New York University]], [[USA]]&lt;br /&gt;
*[[Has PC member::Bernhard Haslhofer]], [[University of Vienna]], Austria&lt;br /&gt;
*[[Has PC member::Carol Hert]], [[Schema Logic]], Inc. [[USA]]&lt;br /&gt;
*[[Has PC member::Gail M. Hodge]], [[Information International Assoc.]], [[USA]]&lt;br /&gt;
*[[Has PC member::Pete Johnston]], [[Eduserv Foundation]], United Kingdom&lt;br /&gt;
*[[Has PC member::Masahide Kanzaki]], [[Zenon Ltd. Partners]], Japan&lt;br /&gt;
*[[Has PC member::Christopher S.G. Khoo]], [[Nanyang Technological University]], Singapore&lt;br /&gt;
*[[Has PC member::Traugott Koch]], [[Max Planck Digital Library]], Germany&lt;br /&gt;
*[[Has PC member::John Kunze]], [[California Digital Library]], [[USA]]&lt;br /&gt;
*[[Has PC member::Faye R. Leibowitz]], [[University of Pittsburgh]], [[USA]]&lt;br /&gt;
*[[Has PC member::Yushiana Mansor]], [[International Islamic University Malaysia]], Malaysia&lt;br /&gt;
*[[Has PC member::Filiberto Felipe Martinez-Arellano]], [[Universidad Nacional Autónoma de México]], Mexico&lt;br /&gt;
*[[Has PC member::Michael Robert]], [[Middleton Queensland University of Technology]], Australia&lt;br /&gt;
*[[Has PC member::Alistair Miles]], [[University of Oxford]], United Kingdom&lt;br /&gt;
*[[Has PC member::Akira Miyazawa]], [[National Institute of Informatics]], Japan&lt;br /&gt;
*[[Has PC member::William E. Moen]], [[University of North Texas]], [[USA]]&lt;br /&gt;
*[[Has PC member::Jin-Cheon Na Nanyang]], [[Technological University]], Singapore&lt;br /&gt;
*[[Has PC member::Mitsuharu Nagamori]], [[University of Tsukuba]], Japan&lt;br /&gt;
*[[Has PC member::Cristina Pattuelli]], [[Pratt Institute]], [[USA]]&lt;br /&gt;
*[[Has PC member::Jon Phipps]], [[JES &amp;amp; Co.]], [[USA]]&lt;br /&gt;
*[[Has PC member::Jian Qin]], [[Syracuse University]], [[USA]]&lt;br /&gt;
*[[Has PC member::Sandy K. Roe]], [[Cataloging &amp;amp; Classification Quarterly]], [[USA]]&lt;br /&gt;
*[[Has PC member::Stefanie Ruehle]], [[[[Göttingen State and University Library|SUB Goettingen]]]], Germany&lt;br /&gt;
*[[Has PC member::Bernhard Schandl]], [[University of Vienna]], Austria&lt;br /&gt;
*[[Has PC member::Aida Slavic]], [[UDC Consortium]], United Kingdom&lt;br /&gt;
*[[Has PC member::Shigeo Sugimoto]], [[University of Tsukuba]], Japan&lt;br /&gt;
*[[Has PC member::Stuart A. Sutton]], [[University of Washington]], Retired, [[USA]]&lt;br /&gt;
*[[Has PC member::Ahmed Taha]], [[UAE University]], United Arab Emirates&lt;br /&gt;
*[[Has PC member::Joseph T. Tennis]], [[University of Washington]], [[USA]]&lt;br /&gt;
*[[Has PC member::Seth van Hooland]], [[Université Libre de Bruxelles]], Belgium&lt;br /&gt;
*[[Has PC member::Sherry L. Vellucci]], [[Rutgers University]], [[USA]]&lt;br /&gt;
*[[Has PC member::Cheryl D. Walters]], [[Utah State University]], [[USA]]&lt;br /&gt;
*[[Has PC member::Mei-Ling Wang]], [[National Chengchi University]], Taiwan&lt;br /&gt;
*[[Has PC member::Hollie C. White]], [[University of North Carolina at Chapel Hill]], [[USA]]&lt;br /&gt;
*[[Has PC member::Andrew C. Wilson]], [[Australian National Data Service]], Australia&lt;br /&gt;
*[[Has PC member::Marcia Lei Zeng]], [[Kent State University]], [[USA]]&lt;br /&gt;
&lt;br /&gt;
==Program==&lt;br /&gt;
&lt;br /&gt;
;invited speakers:&lt;br /&gt;
* [[Has invited speaker::Stu Weibel]]:  A Metadata Trajectory: What 15 years of Dublin Core tells us about our future.&lt;br /&gt;
* [[Has invited speaker::Mike Bergman]]: Bridging the Gaps: Adaptive Approaches to Data Interoperability.&lt;br /&gt;
&lt;br /&gt;
;Session Principles and Innovations:&lt;br /&gt;
* Emma Tonkin, Andrew Hewson:  Building blocks of metadata: What can we learn from Lego™?&lt;br /&gt;
* Sherry Koshman: Visualizing Metadata for Environmental Datasets&lt;br /&gt;
&lt;br /&gt;
;Session Libraries and the FRBR Model :&lt;br /&gt;
*Maja Zumer, [[Marcia Lei Zeng]], Athena Salaba: FRBR: A Generalized Approach to Dublin Core Application Profiles&lt;br /&gt;
*Jenn Riley: Enhancing Interoperability of FRBR-Based Metadata&lt;br /&gt;
*Jennifer B. Bowen: Moving Library Metadata Toward Linked Data: Opportunities Provided by the eXtensible Catalog&lt;br /&gt;
&lt;br /&gt;
;Session Communities and Metadata :&lt;br /&gt;
* [[Marie-Claude Côté]], [[Margaret Devey]], [[Lynne McAvoy]], [[Leigh Bain]] :  Celebrating 10 Years of Government of Canada Metadata Standards&lt;br /&gt;
* Paul Asman, San Cannon, Christine Sommo: Extending RSS to Meet Central Bank Needs&lt;br /&gt;
* Jian Qin, Miao Chen, Xiaozhong Liu, Andrea Kathleen Wiggins: Linking Entities in Scientific Metadata&lt;br /&gt;
&lt;br /&gt;
;Session DC in Practice:&lt;br /&gt;
*Carol Jean Godby:  From records to streams: Merging library and publisher metadata&lt;br /&gt;
*Steven J. Miller: The One-To-One Principle: Challenges in Current Practice&lt;br /&gt;
*Hannah Tarver: Better Guidelines, Better Functionality: How Metadata Supports the Cycle of System Improvement at UNT&lt;br /&gt;
&lt;br /&gt;
;Session (reports) Metadata at Work:&lt;br /&gt;
Yunyun Shen, Long Xiao, Ying Feng: [https://doi.org/10.23106/dcmi.952109981 Building Metadata Application Framework for Chinese Digital Library: A Case Study of National Digital Library of China]&lt;br /&gt;
*Sarah Buchanan : Use of Community Metadata: Public Policy Research in Policy Archive&lt;br /&gt;
* Myung-Ja K. Han, Sheila Bair, Jason Lee: Creating Metadata Best Practices for CONTENTdm Users&lt;br /&gt;
&lt;br /&gt;
;Session (reports)  New Directions in Metadata:&lt;br /&gt;
*Stephanie Ogeneski Christensen, Douglas Donald Dunlop:  The Case for Implementing Core Descriptive Embedded Metadata at the Smithsonian&lt;br /&gt;
*James E. Andrews, Denise Shereff, Timothy Patrick, Rachel Richesson:  The Question about Questions: Is DC a Good Choice to Address the Challenges of Representation of Clinical Research Questions and Value Sets?&lt;br /&gt;
*Jacques Ducloy, Thierry Daunois, Muriel Foulonneau, Alice Hermann, Jean-Charles Lamirel, Stéphane Sire, Jean-Pierre Thomesse, Christine Vanoirbeek: [[Metadata for WICRI, a Network of Semantic Wikis for Communities in Research and Innovation]].&lt;br /&gt;
&lt;br /&gt;
;Tutorials:&lt;br /&gt;
* Jane Greenberg, Jon Phipps : Dublin Core: History, Key Concepts, and Evolving Context&lt;br /&gt;
* Karen Coyle, Ron Daniel : A SAFARI from the Dublin Core to the Semantic Web&lt;br /&gt;
&lt;br /&gt;
;Working groups:&lt;br /&gt;
*Libraries Task group, [http://dublincore.org/librarieswiki/ Wiki], [http://www.jiscmail.ac.uk/lists/DC-LIBRARIES.html Mailing list]&lt;br /&gt;
*Preservation Community, [http://dublincore.org/groups/preservation/ Web], [http://www.jiscmail.ac.uk/lists/dc-preservation.html Mailing list]&lt;br /&gt;
*Registry Community and Task Group, [http://dublincore.org/groups/registry/ Web], [http://wiki.metadataregistry.org/DCMI_Registry_Community#DCMI_Registry_Community_Projects Wiki], [http://www.jiscmail.ac.uk/lists/DC-REGISTRY.html Mailing list]&lt;br /&gt;
*Education Community and Task Group, [http://dublincore.org/groups/education/ Web], [http://dublincore.org/educationwiki/ Wiki], [http://www.jiscmail.ac.uk/lists/DC-EDUCATION.html Mailing list]&lt;br /&gt;
*DCMI/NKOS Task Group, [http://dublincore.org/groups/nkos/ Web], [http://www.metadataetc.org/wiki/dcmi-nkos/doku.php Wiki], [http://www.jiscmail.ac.uk/lists/DC-NKOS.html Mailing list]&lt;br /&gt;
*User Documentation and Glossary Task Group, [http://sites.google.com/site/dublincoreglossaryupdate/ Wiki], [http://www.jiscmail.ac.uk/lists/DC-GLOSSARY.html Mailing list]&lt;br /&gt;
*Libraries Community, [http://dublincore.org/groups/libraries/ Web], [http://www.jiscmail.ac.uk/lists/DC-LIBRARIES.html Mailing list]&lt;br /&gt;
*Science and Metadata Community, [http://dublincore.org/groups/sam/ Web], [http://www.jiscmail.ac.uk/lists/DC-SCIENCE.html Mailing list]&lt;br /&gt;
*Knowledge Management Community, [http://dublincore.org/groups/km/ Web], [http://www.jiscmail.ac.uk/lists/DC-KM.html Mailing list]&lt;br /&gt;
*DCMI/RDA Task Group , [http://dublincore.org/dcmirdataskgroup/ Wiki], [http://www.jiscmail.ac.uk/lists/DC-RDA.html Mailing list]&lt;br /&gt;
*Tools Community , [http://dublincore.org/groups/tools/ Web], [http://www.jiscmail.ac.uk/lists/DC-TOOLS.html Mailing list]&lt;br /&gt;
*Social Tagging Community, [http://dublincore.org/groups/social-tagging/ Web], [http://dublincore.org/taggingwiki/ Wiki], [http://www.jiscmail.ac.uk/lists/DC-SOCIAL-TAGGING.html Mailing list]&lt;br /&gt;
*Localization and Internationalization Community and Translation Task Group, [http://dublincore.org/groups/languages/ Web], [http://eiah.org/dcwiki/index.php/Translation_Task_Force Wiki], [http://www.jiscmail.ac.uk/lists/DC-INTERNATIONAL.html Mailing list]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Metadata]]&lt;br /&gt;
[[fr:DC 2010 Pittsburgh]]&lt;br /&gt;
__SHOWFACTBOX__&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1796</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1796"/>
		<updated>2024-03-30T17:38:50Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
This last case study deals with a more thematic aspect of a research policy survey. We have&lt;br /&gt;
chosen to speak about biodiversity which is becoming a strategic issue. For instance, the&lt;br /&gt;
European Commission launched BiodiverSA37 which aims at “setting up efficient trans-&lt;br /&gt;
national co-operation in the field of biodiversity research funding”.&lt;br /&gt;
&lt;br /&gt;
In this section we will study a topic which is not really on the agenda of this project but which&lt;br /&gt;
is considered as very close to its targets: how are distributed the activities of public research&lt;br /&gt;
laboratories with regard to the main axis of biodiverSA members? This information is&lt;br /&gt;
supposed to be contained in the publications and more specifically in the PhD theses.&lt;br /&gt;
&lt;br /&gt;
In order to illustrate the complexity of the problem, here are the figures on R&amp;amp;D biodiversity&lt;br /&gt;
funding in Europe:&lt;br /&gt;
*more than one hundred funding agencies[{{Article body/Link bibl|2}}].;&lt;br /&gt;
*several programs by agency, so several hundreds of programs;&lt;br /&gt;
*several project by program, so several thousands of projects;&lt;br /&gt;
*several results, such as theses, reports or articles by project, so ten thousand publications!&lt;br /&gt;
&lt;br /&gt;
BiodiverSA intends to create an inventory of all existing biodiversity research funding&lt;br /&gt;
programs which will be implemented in a “metadatabase” (on a CERIF38 basis). Vocabulary&lt;br /&gt;
aspects will play a fundamental role. More specifically classification (or taxonomy) tools&lt;br /&gt;
must be used with some computational constraints in order to produce a set of indicators.&lt;br /&gt;
&lt;br /&gt;
The BiodivERsA classification scheme is still being designed and it could be composed of&lt;br /&gt;
three parts.&lt;br /&gt;
&lt;br /&gt;
* A general scientific component based on ASRC (Australian Standard Research Classification). ASRC is tightly related to the Organisation for Economic Cooperation and Development (OECD) Proposed Standard Practice for Surveys of Research and Experimental Development.&lt;br /&gt;
* A specific component dedicated to biodiversity, built with a combination of several existing classifications;&lt;br /&gt;
* A complementary indexation based on keywords extracted from the “CBD Controlled Vocabulary”39 .&lt;br /&gt;
&lt;br /&gt;
In this context, how could we handle the main topic of this study and, for example, how build&lt;br /&gt;
an indicator based on PhD theses?&lt;br /&gt;
A first problem is to feed a CERIF compliant database which could be used by BiodivERsA&lt;br /&gt;
with something close to qualified Dublin Core. But the most important issue is the mapping of&lt;br /&gt;
the resources in the classification system. We could imagine that a few research laboratories&lt;br /&gt;
will use this classification system in order to be visible by funding agencies. In this case, the&lt;br /&gt;
indexer needs also to be cautious with the future computational usage of its elements. (This is&lt;br /&gt;
not the same that archiving or making browsing easier).&lt;br /&gt;
&lt;br /&gt;
But a very large amount of theses related to some particular aspect of biodiversity will not use&lt;br /&gt;
this schema and several terminological adaptations will be requested. They could be quite&lt;br /&gt;
easy if the theses are indexed with a well known vocabulary (MeSH for instance).&lt;br /&gt;
&lt;br /&gt;
In the other cases, a document content linguistic analysis should be done. Once again, several&lt;br /&gt;
vocabulary oriented resources are needed, and this last sample would illustrate the need of&lt;br /&gt;
complementary terminological repositories.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&amp;lt;br/&amp;gt;&amp;lt; doi:10.1045/july2005-lynch &amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=11. &lt;br /&gt;
  |author=M. Patel, &lt;br /&gt;
  |text=Fourth Open Archives Forum Workshop In Practice, Good Practice: The Future&lt;br /&gt;
of Open Archives, Ariadne Issue 37, Oct 2003, &amp;lt;br/&amp;gt;&amp;lt; http://www.ariadne.ac.uk/issue37/oa-forum-ws-rpt/#6&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=12. &lt;br /&gt;
  |author=M. Smith, M. Bass, G. McClellan, R. Tansley, M. Barton, M. Branschofsky, D. Stuve,&lt;br /&gt;
and J. H. Walker, &lt;br /&gt;
  |text=DSpace: An Open Source Dynamic Digital Repository, D-Lib Magazine, 9&lt;br /&gt;
(1), 2003.&lt;br /&gt;
&amp;lt;doi:10.1045/january2003-smith&amp;gt;.&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=13. &lt;br /&gt;
  |author=H. Suleman, A. Atkins, M. Gonçalves, R. France and E. Fox. &lt;br /&gt;
  |text=Networked Digital Library&lt;br /&gt;
of Theses and Dissertations, Bridging the Gaps for Global Access - Part 1: Mission and&lt;br /&gt;
Progress. In D-lib Magazine, September 2001&lt;br /&gt;
&amp;lt; doi:10.1045/september2001-suleman-pt1&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1795</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1795"/>
		<updated>2024-03-30T17:36:00Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
This last case study deals with a more thematic aspect of a research policy survey. We have&lt;br /&gt;
chosen to speak about biodiversity which is becoming a strategic issue. For instance, the&lt;br /&gt;
European Commission launched BiodiverSA37 which aims at “setting up efficient trans-&lt;br /&gt;
national co-operation in the field of biodiversity research funding”.&lt;br /&gt;
&lt;br /&gt;
In this section we will study a topic which is not really on the agenda of this project but which&lt;br /&gt;
is considered as very close to its targets: how are distributed the activities of public research&lt;br /&gt;
laboratories with regard to the main axis of biodiverSA members? This information is&lt;br /&gt;
supposed to be contained in the publications and more specifically in the PhD theses.&lt;br /&gt;
&lt;br /&gt;
In order to illustrate the complexity of the problem, here are the figures on R&amp;amp;D biodiversity&lt;br /&gt;
funding in Europe:&lt;br /&gt;
*more than one hundred funding agencies[{{Article body/Link bibl|2}}].;&lt;br /&gt;
*several programs by agency, so several hundreds of programs;&lt;br /&gt;
*several project by program, so several thousands of projects;&lt;br /&gt;
*several results, such as theses, reports or articles by project, so ten thousand publications!&lt;br /&gt;
&lt;br /&gt;
BiodiverSA intends to create an inventory of all existing biodiversity research funding&lt;br /&gt;
programs which will be implemented in a “metadatabase” (on a CERIF38 basis). Vocabulary&lt;br /&gt;
aspects will play a fundamental role. More specifically classification (or taxonomy) tools&lt;br /&gt;
must be used with some computational constraints in order to produce a set of indicators.&lt;br /&gt;
&lt;br /&gt;
The BiodivERsA classification scheme is still being designed and it could be composed of&lt;br /&gt;
three parts.&lt;br /&gt;
&lt;br /&gt;
* A general scientific component based on ASRC (Australian Standard Research Classification). ASRC is tightly related to the Organisation for Economic Cooperation and Development (OECD) Proposed Standard Practice for Surveys of Research and Experimental Development.&lt;br /&gt;
* A specific component dedicated to biodiversity, built with a combination of several existing classifications;&lt;br /&gt;
* A complementary indexation based on keywords extracted from the “CBD Controlled Vocabulary”39 .&lt;br /&gt;
&lt;br /&gt;
In this context, how could we handle the main topic of this study and, for example, how build&lt;br /&gt;
an indicator based on PhD theses?&lt;br /&gt;
A first problem is to feed a CERIF compliant database which could be used by BiodivERsA&lt;br /&gt;
with something close to qualified Dublin Core. But the most important issue is the mapping of&lt;br /&gt;
the resources in the classification system. We could imagine that a few research laboratories&lt;br /&gt;
will use this classification system in order to be visible by funding agencies. In this case, the&lt;br /&gt;
indexer needs also to be cautious with the future computational usage of its elements. (This is&lt;br /&gt;
not the same that archiving or making browsing easier).&lt;br /&gt;
&lt;br /&gt;
But a very large amount of theses related to some particular aspect of biodiversity will not use&lt;br /&gt;
this schema and several terminological adaptations will be requested. They could be quite&lt;br /&gt;
easy if the theses are indexed with a well known vocabulary (MeSH for instance).&lt;br /&gt;
&lt;br /&gt;
In the other cases, a document content linguistic analysis should be done. Once again, several&lt;br /&gt;
vocabulary oriented resources are needed, and this last sample would illustrate the need of&lt;br /&gt;
complementary terminological repositories.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&amp;lt;br/&amp;gt;&amp;lt; doi:10.1045/july2005-lynch &amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=11. &lt;br /&gt;
  |author=M. Patel, &lt;br /&gt;
  |text=Fourth Open Archives Forum Workshop In Practice, Good Practice: The Future&lt;br /&gt;
of Open Archives, Ariadne Issue 37, Oct 2003, &amp;lt;br/&amp;gt;&amp;lt; http://www.ariadne.ac.uk/issue37/oa-forum-ws-rpt/#6&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
12. M. Smith, M. Bass, G. McClellan, R. Tansley, M. Barton, M. Branschofsky, D. Stuve,&lt;br /&gt;
and J. H. Walker, DSpace: An Open Source Dynamic Digital Repository, D-Lib Magazine, 9&lt;br /&gt;
(1), 2003.&lt;br /&gt;
&amp;lt;doi:10.1045/january2003-smith&amp;gt;.&lt;br /&gt;
13. H. Suleman, A. Atkins, M. Gonçalves, R. France and E. Fox. Networked Digital Library&lt;br /&gt;
of Theses and Dissertations, Bridging the Gaps for Global Access - Part 1: Mission and&lt;br /&gt;
Progress. In D-lib Magazine, September 2001&lt;br /&gt;
&amp;lt; doi:10.1045/september2001-suleman-pt1&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1794</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1794"/>
		<updated>2024-03-30T17:30:12Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Thematic survey about biodiversity */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
This last case study deals with a more thematic aspect of a research policy survey. We have&lt;br /&gt;
chosen to speak about biodiversity which is becoming a strategic issue. For instance, the&lt;br /&gt;
European Commission launched BiodiverSA37 which aims at “setting up efficient trans-&lt;br /&gt;
national co-operation in the field of biodiversity research funding”.&lt;br /&gt;
&lt;br /&gt;
In this section we will study a topic which is not really on the agenda of this project but which&lt;br /&gt;
is considered as very close to its targets: how are distributed the activities of public research&lt;br /&gt;
laboratories with regard to the main axis of biodiverSA members? This information is&lt;br /&gt;
supposed to be contained in the publications and more specifically in the PhD theses.&lt;br /&gt;
&lt;br /&gt;
In order to illustrate the complexity of the problem, here are the figures on R&amp;amp;D biodiversity&lt;br /&gt;
funding in Europe:&lt;br /&gt;
*more than one hundred funding agencies[{{Article body/Link bibl|2}}].;&lt;br /&gt;
*several programs by agency, so several hundreds of programs;&lt;br /&gt;
*several project by program, so several thousands of projects;&lt;br /&gt;
*several results, such as theses, reports or articles by project, so ten thousand publications!&lt;br /&gt;
&lt;br /&gt;
BiodiverSA intends to create an inventory of all existing biodiversity research funding&lt;br /&gt;
programs which will be implemented in a “metadatabase” (on a CERIF38 basis). Vocabulary&lt;br /&gt;
aspects will play a fundamental role. More specifically classification (or taxonomy) tools&lt;br /&gt;
must be used with some computational constraints in order to produce a set of indicators.&lt;br /&gt;
&lt;br /&gt;
The BiodivERsA classification scheme is still being designed and it could be composed of&lt;br /&gt;
three parts.&lt;br /&gt;
&lt;br /&gt;
* A general scientific component based on ASRC (Australian Standard Research Classification). ASRC is tightly related to the Organisation for Economic Cooperation and Development (OECD) Proposed Standard Practice for Surveys of Research and Experimental Development.&lt;br /&gt;
* A specific component dedicated to biodiversity, built with a combination of several existing classifications;&lt;br /&gt;
* A complementary indexation based on keywords extracted from the “CBD Controlled Vocabulary”39 .&lt;br /&gt;
&lt;br /&gt;
In this context, how could we handle the main topic of this study and, for example, how build&lt;br /&gt;
an indicator based on PhD theses?&lt;br /&gt;
A first problem is to feed a CERIF compliant database which could be used by BiodivERsA&lt;br /&gt;
with something close to qualified Dublin Core. But the most important issue is the mapping of&lt;br /&gt;
the resources in the classification system. We could imagine that a few research laboratories&lt;br /&gt;
will use this classification system in order to be visible by funding agencies. In this case, the&lt;br /&gt;
indexer needs also to be cautious with the future computational usage of its elements. (This is&lt;br /&gt;
not the same that archiving or making browsing easier).&lt;br /&gt;
&lt;br /&gt;
But a very large amount of theses related to some particular aspect of biodiversity will not use&lt;br /&gt;
this schema and several terminological adaptations will be requested. They could be quite&lt;br /&gt;
easy if the theses are indexed with a well known vocabulary (MeSH for instance).&lt;br /&gt;
&lt;br /&gt;
In the other cases, a document content linguistic analysis should be done. Once again, several&lt;br /&gt;
vocabulary oriented resources are needed, and this last sample would illustrate the need of&lt;br /&gt;
complementary terminological repositories.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1793</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1793"/>
		<updated>2024-03-30T17:25:00Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Thematic survey about biodiversity */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
This last case study deals with a more thematic aspect of a research policy survey. We have&lt;br /&gt;
chosen to speak about biodiversity which is becoming a strategic issue. For instance, the&lt;br /&gt;
European Commission launched BiodiverSA37 which aims at “setting up efficient trans-&lt;br /&gt;
national co-operation in the field of biodiversity research funding”.&lt;br /&gt;
&lt;br /&gt;
In this section we will study a topic which is not really on the agenda of this project but which&lt;br /&gt;
is considered as very close to its targets: how are distributed the activities of public research&lt;br /&gt;
laboratories with regard to the main axis of biodiverSA members? This information is&lt;br /&gt;
supposed to be contained in the publications and more specifically in the PhD theses.&lt;br /&gt;
&lt;br /&gt;
In order to illustrate the complexity of the problem, here are the figures on R&amp;amp;D biodiversity&lt;br /&gt;
funding in Europe:&lt;br /&gt;
*more than one hundred funding agencies[2];&lt;br /&gt;
*several programs by agency, so several hundreds of programs;&lt;br /&gt;
*several project by program, so several thousands of projects;&lt;br /&gt;
*several results, such as theses, reports or articles by project, so ten thousand publications!&lt;br /&gt;
&lt;br /&gt;
BiodiverSA intends to create an inventory of all existing biodiversity research funding&lt;br /&gt;
programs which will be implemented in a “metadatabase” (on a CERIF38 basis). Vocabulary&lt;br /&gt;
aspects will play a fundamental role. More specifically classification (or taxonomy) tools&lt;br /&gt;
must be used with some computational constraints in order to produce a set of indicators.&lt;br /&gt;
&lt;br /&gt;
The BiodivERsA classification scheme is still being designed and it could be composed of&lt;br /&gt;
three parts.&lt;br /&gt;
&lt;br /&gt;
* A general scientific component based on ASRC (Australian Standard Research Classification). ASRC is tightly related to the Organisation for Economic Cooperation and Development (OECD) Proposed Standard Practice for Surveys of Research and Experimental Development.&lt;br /&gt;
* A specific component dedicated to biodiversity, built with a combination of several existing classifications;&lt;br /&gt;
* A complementary indexation based on keywords extracted from the “CBD Controlled Vocabulary”39 .&lt;br /&gt;
&lt;br /&gt;
In this context, how could we handle the main topic of this study and, for example, how build&lt;br /&gt;
an indicator based on PhD theses?&lt;br /&gt;
A first problem is to feed a CERIF compliant database which could be used by BiodivERsA&lt;br /&gt;
with something close to qualified Dublin Core. But the most important issue is the mapping of&lt;br /&gt;
the resources in the classification system. We could imagine that a few research laboratories&lt;br /&gt;
will use this classification system in order to be visible by funding agencies. In this case, the&lt;br /&gt;
indexer needs also to be cautious with the future computational usage of its elements. (This is&lt;br /&gt;
not the same that archiving or making browsing easier).&lt;br /&gt;
&lt;br /&gt;
But a very large amount of theses related to some particular aspect of biodiversity will not use&lt;br /&gt;
this schema and several terminological adaptations will be requested. They could be quite&lt;br /&gt;
easy if the theses are indexed with a well known vocabulary (MeSH for instance).&lt;br /&gt;
&lt;br /&gt;
In the other cases, a document content linguistic analysis should be done. Once again, several&lt;br /&gt;
vocabulary oriented resources are needed, and this last sample would illustrate the need of&lt;br /&gt;
complementary terminological repositories.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1792</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1792"/>
		<updated>2024-03-30T17:21:44Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Institutional surveys */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
INIST experience shows that detecting relationships between research communities appears&lt;br /&gt;
to be a key point for research policy [3]. About PhD theses, the computation of affiliations of&lt;br /&gt;
jury’s members could set up several kinds of interesting indicators, for instance dealing with&lt;br /&gt;
“hidden” research communities.&lt;br /&gt;
&lt;br /&gt;
On a technical point of view, the problem is to extract from metadata several homogeneous&lt;br /&gt;
items, dealing with people or affiliation. It could be easy in a standardised world; but in&lt;br /&gt;
reality, a given institution could appear in a quite large number of different lexical forms.&lt;br /&gt;
In this purpose, authority files and terminological tables play an important part in the&lt;br /&gt;
normalisation of the bibliographic data before being handled in the computational process.&lt;br /&gt;
&lt;br /&gt;
In a first step, the authority files can be used to establish the correspondence between well&lt;br /&gt;
defined items, for example, the names of countries. The technique generally used to establish&lt;br /&gt;
the equivalent terms and normalise the data fields containing data which differ in terms of&lt;br /&gt;
typography (upper case or lower case, etc.) or flexion (plural, singular), is to find a&lt;br /&gt;
convergence to a simpler form, similar to a key with which the given form is associated.&lt;br /&gt;
&lt;br /&gt;
For the majority of indicators, the analytical unit (the object of the study) is a geographical or&lt;br /&gt;
institutional entity. Publications are assigned to these entities on the basis of an analysis of the&lt;br /&gt;
addresses of the authors. Variations in the way the names of countries are written are&lt;br /&gt;
numerically limited. Relating publications to institutions is a much more difficult task which&lt;br /&gt;
cannot be achieved by a simple analysis of the addresses of the authors appearing in the&lt;br /&gt;
publications. Very often, a wide variety of different lexical forms for a given entry is found.&lt;br /&gt;
&lt;br /&gt;
This presupposes the existence of geographical (postcodes, towns, regions, countries) and&lt;br /&gt;
institutional (code for the institution, classification of the organisations by sector, etc.)&lt;br /&gt;
authority files.&lt;br /&gt;
&lt;br /&gt;
As far as we look for a merely statistical indicator, with a medium quality, this kind of post&lt;br /&gt;
process is sufficient. But if we need precise computations, some very complex situations&lt;br /&gt;
could appear.&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Figure 4 illustrates a realistic situation in Nancy geographical area, as it was two years ago.&lt;br /&gt;
This example would just illustrate the complexity of what we could meet in practice. A single&lt;br /&gt;
list of affiliation names is not sufficient and some more sophisticated tools are requested.&lt;br /&gt;
At the upper level, we have two government funded research institutions (CNRS and INRIA)&lt;br /&gt;
and one university (UHP) 36 .&lt;br /&gt;
&lt;br /&gt;
At the medium level (research unit), Loria is a joint unit from UHP, Inria, CNRS and two&lt;br /&gt;
other universities. INRIA Lorraine is the name of the INRIA component in Nancy.&lt;br /&gt;
Some years before, CRIN was the acronym for a joint unit between CNRS and Universities&lt;br /&gt;
but without INRIA. CRIN does not exist at the present time, but a lot of papers or theses are&lt;br /&gt;
indexed by CRIN and must be handled if we need an “historical study”.&lt;br /&gt;
&lt;br /&gt;
At the bottom level (project team), most of the teams, like Cortex, are part of all upper&lt;br /&gt;
organisations. But things could be more complicated! For instance, YT (for Young Team) is&lt;br /&gt;
only recognised by universities and not by INRIA (and thus by Inria Lorraine). Orpailleur is&lt;br /&gt;
getting recognised by INRIA. Oméga is a joint team between Inria Loraine and INRIA Sophia&lt;br /&gt;
but not with Loria.&lt;br /&gt;
&lt;br /&gt;
A consistent metadata schema, such as LEAF[7] one, could offer a solid base which must be&lt;br /&gt;
completed by a strong study of affiliation links. An ARTIST working group intends to work&lt;br /&gt;
on this kind of relationship, by using for instance several links which could be issued from&lt;br /&gt;
something like a “taxonomy of affiliations”.&lt;br /&gt;
&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1791</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1791"/>
		<updated>2024-03-30T17:19:19Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Creating metadata: thinking about reusability */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
As mentioned before, French research institutions encourage researchers to deposit their work&lt;br /&gt;
in an open-access repository for greater visibility and added value of their scientific&lt;br /&gt;
production. These repositories include various types of documents or data: some published&lt;br /&gt;
articles but also expertises, reports, courses, lecture notes, conference papers, thesis, software&lt;br /&gt;
documentation or primary data (demographical for instance).&lt;br /&gt;
&lt;br /&gt;
This repository is a significant and valuable source of information for the evaluation of&lt;br /&gt;
researchers and their research units based on their scientific production. For instance,&lt;br /&gt;
researchers have to provide information about their scientific production to the evaluation&lt;br /&gt;
committee every four years. Why do the researchers or their units have to provide this kind of&lt;br /&gt;
information already available in the repository?&lt;br /&gt;
&lt;br /&gt;
INRA, like any other public research institution, could be willing to transfer data from&lt;br /&gt;
document repository to the application managing the evaluation files of the researchers and&lt;br /&gt;
research units. But, even if the metadata provided from the repository is useful, the evaluation&lt;br /&gt;
process requires new and more complex metadata.&lt;br /&gt;
&lt;br /&gt;
To simplify the study, let’s look at the most fundamental exportable data which are required&lt;br /&gt;
for the “evaluation” application.&lt;br /&gt;
 &lt;br /&gt;
;People: the researcher who has completed his/her PhD thesis in a research institution has to be identified by the evaluation committee as a researcher, a former PhD student and author of a thesis available in the repository. The researcher may have changed his/her name. Identifying the various statuses or names of a person in order to establish correspondences requires enriching the metadata related to persons.&lt;br /&gt;
;Structures: the research unit where the PhD student worked may be different from the unit he is working for later on as a researcher. Both structures are entitled to claim the search results presented in the PhD thesis, the first one as research work financial support and the second one as researcher’s affiliation. The “evaluation” application needs to identify the structure the way it was mentioned in the PhD theses with its equivalent in the institution structural network&lt;br /&gt;
;Partners: the variety of research institutions in France urges to set up a list of scientific partners and to describe the various collaborations. In this way, the PhD student’s enrolment university is mentioned in the thesis. The list of partners and/or the collaboration type will have to be completed.&lt;br /&gt;
&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ....&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1790</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1790"/>
		<updated>2024-03-30T17:16:22Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Three case-studies */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
We have just introduced a set of operators which seems to offer a complete set of library&lt;br /&gt;
oriented services. But, for historical reasons, they had been created quite independently. Thus&lt;br /&gt;
the reality could be “less than perfect”. This paper is written by several people, coming from&lt;br /&gt;
these organisations, who have realized that interoperability was an important issue, and who&lt;br /&gt;
are working on exchange formats, generally based on qualified Dublin Core. Is this approach&lt;br /&gt;
sufficient?&lt;br /&gt;
&lt;br /&gt;
We will now present several case-studies in which we go beyond the basic bibliographic&lt;br /&gt;
needs (deposit and retrieval) in order to introduce some research policy oriented needs.&lt;br /&gt;
&lt;br /&gt;
In the first case we will analyse the handling of a PhD thesis from the start until its&lt;br /&gt;
accessibility via OAI-PMH. The “previous designed” workflow shown in Figure 3 looks&lt;br /&gt;
simple: a thesis is managed by Cyberthèses, and then sent to STAR, and at last to CCSD to be&lt;br /&gt;
integrated within articles flow. We will consider a situation in which two initial organisations,&lt;br /&gt;
university and research institution (EPST, Etablissements Publics d’Enseignement et de&lt;br /&gt;
Recherche) are concerned.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
&lt;br /&gt;
Then we will study two cases of using metadata for doing institutional surveys or creating&lt;br /&gt;
thematic portals. We will suppose that metadata could have been indexed or upgraded by a&lt;br /&gt;
documentary centre such as INIST&lt;br /&gt;
&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
 ...&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ....&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1789</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1789"/>
		<updated>2024-03-30T17:15:08Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* INIST: Metadata homogenisation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
&lt;br /&gt;
One of INIST’s departments is specialised in building thematic portal or handling statistical&lt;br /&gt;
studies dealing with research policies (Figure 2). This entity is more and more implied in&lt;br /&gt;
defining institutional indicators, bringing INIST, like CCSD, to improve the quality of&lt;br /&gt;
metadata related to relationships between authors and affiliations.&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ...&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
 ...&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ....&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1788</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1788"/>
		<updated>2024-03-30T17:13:47Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* STAR: logistic intermediary between local actors and wider actors. */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
''TEF (Thèses Electroniques Françaises)'' is a recommendation provided by an AFNOR33&lt;br /&gt;
working group (AFNOR CG46/CN357/GE5). It aims at offering a coherent and flexible&lt;br /&gt;
organisation for rich and normalised theses metadata: bibliographic metadata (DC), rights&lt;br /&gt;
metadata (METS Rights), administrative metadata relative to the diploma and preservation&lt;br /&gt;
metadata. Within TEF, FRBR 34 (Functional Requirements for Bibliographic Records) model&lt;br /&gt;
is used as a conceptual tool to untangle the notion of theses, METS as an XML wrapping to&lt;br /&gt;
bind the various metadata modules, Schematron35 as a precise and flexible validation tool to&lt;br /&gt;
enforce the business rules that come from the French context.&lt;br /&gt;
&lt;br /&gt;
STAR, as a tool, like TEF, as a data structure, plays as a go-between for the benefit of those&lt;br /&gt;
that produce and authenticate the theses and their metadata as well as for those that make use&lt;br /&gt;
of them.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ...&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
 ...&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ....&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1787</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1787"/>
		<updated>2024-03-30T17:12:44Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* STAR: logistic intermediary between local actors and wider actors. */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
&lt;br /&gt;
In the output process, the digital theses will be delivered to a national digital preservation&lt;br /&gt;
system which is handled by CINES (Centre Informatique National de l'Enseignement&lt;br /&gt;
Supérieur)31 . In addition, metadata will be converted to UNIMARC in order to be sent to&lt;br /&gt;
Sudoc union catalogue which hosts the theses national bibliography.&lt;br /&gt;
&lt;br /&gt;
Several complementary services (figure 1) will be offered to institutions of PhD defence:&lt;br /&gt;
* Sending to CCSD/HAL and other bibliographic databases;&lt;br /&gt;
* Full text indexing in SUDOC 32 (Système Universitaire de DOCumentation) academic portal;&lt;br /&gt;
* Building a permanent identifier (URI) and resolution for guaranteeing access in any location to a valid copy of the thesis.&lt;br /&gt;
Thus, through a unique deposit, a local institution will be able to provide long term&lt;br /&gt;
preservation and dissemination by many channels, with a high level of traceability in both&lt;br /&gt;
scientific and administrative aspects.&lt;br /&gt;
&lt;br /&gt;
STAR does not claim to dispense with specific tools or workflows set up by universities. It is&lt;br /&gt;
true that STAR will offer a web interface to those universities that don't possess any local&lt;br /&gt;
ETDs management tool. For the others, STAR will ingest locally generated metadata and&lt;br /&gt;
document files. These metadata will comply with the French exchange format TEF.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ...&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
 ...&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ....&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1786</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1786"/>
		<updated>2024-03-30T17:09:22Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Three case-studies */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 3.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ...&lt;br /&gt;
====Creating metadata: thinking about reusability====&lt;br /&gt;
 ...&lt;br /&gt;
====Institutional surveys====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 4.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ....&lt;br /&gt;
====Thematic survey about biodiversity====&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1785</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1785"/>
		<updated>2024-03-30T17:06:07Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Three case-studies */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
 ...&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|400px|thumb|Figure 3]]&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1784</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1784"/>
		<updated>2024-03-30T17:04:40Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* INIST: Metadata homogenisation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 2.jpg|center|300px|thumb|Figure 2]]&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_4.jpg&amp;diff=1783</id>
		<title>File:Wicri DC 2006 fig 4.jpg</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_4.jpg&amp;diff=1783"/>
		<updated>2024-03-30T17:01:55Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_3.jpg&amp;diff=1782</id>
		<title>File:Wicri DC 2006 fig 3.jpg</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_3.jpg&amp;diff=1782"/>
		<updated>2024-03-30T17:01:24Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_2.jpg&amp;diff=1781</id>
		<title>File:Wicri DC 2006 fig 2.jpg</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_2.jpg&amp;diff=1781"/>
		<updated>2024-03-30T17:00:58Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1780</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1780"/>
		<updated>2024-03-30T16:54:02Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* CCSD: open archive with institutional views */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's &amp;lt;ref&amp;gt; HAL stands for “hyper article en ligne” &amp;lt; http://hal.ccsd.cnrs.fr/?langue=en &amp;gt; &amp;lt;/ref&amp;gt; software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1779</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1779"/>
		<updated>2024-03-30T16:52:51Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* CCSD: open archive with institutional views */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
CCSD stands for “Centre de la Communication Scientifique Directe” and aims at promoting&lt;br /&gt;
direct scientific communication between researchers. Very close to ArXiv’s philosophy,&lt;br /&gt;
HAL's 29 software provides an interface for authors to upload into the CCSD database their&lt;br /&gt;
manuscripts of scholarly articles in all fields. Most of the French research organisations have&lt;br /&gt;
set up a global agreement for a common cooperation based on HAL which can offer an&lt;br /&gt;
institutional view for any participant.&lt;br /&gt;
&lt;br /&gt;
A specific service called TEL (thèses-EN-ligne) is dedicated to facilitating the self archiving&lt;br /&gt;
of thesis manuscripts, which are important documents for direct scientific communication&lt;br /&gt;
between scientists. TEL can be harvested through the OAI-PHM protocol and two metadata&lt;br /&gt;
formats are available: unqualified Dublin Core30 , and a specific CCSD one.&lt;br /&gt;
A particular feature of this format deals with formal and precise relationships between authors&lt;br /&gt;
and affiliations which are clearly identified in deposit procedure. This facility allows the&lt;br /&gt;
institutional views and illustrates the two main goals of CCSD: open archive with a research&lt;br /&gt;
management orientation.&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1778</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1778"/>
		<updated>2024-03-30T16:51:55Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Cyberthèses */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
Cyberthèses was born within a francophone program which was also extended to South&lt;br /&gt;
America. Cyberdocs, its related platform, is an open source software which supports an&lt;br /&gt;
assembly line starting from document writing to dissemination and archiving.&lt;br /&gt;
&lt;br /&gt;
The main members of Cyberthèses network in the Francophony are the following:&lt;br /&gt;
“Universidad de Chile” in Santiago27 , “Université de Dakar” (Senegal), “Université&lt;br /&gt;
d’Antananarivo” (Madagascar) and the National Institute of Agronomy of Algiers [1].&lt;br /&gt;
&lt;br /&gt;
In the Cyberthèses project each university is in charge of the conversion of its theses and&lt;br /&gt;
dissertations into an archiving format (e.g. TEI-lite in XML). At “Université de Lyon 2”, the&lt;br /&gt;
electronic registration and deposit are now included in the &amp;quot;charte des thèses&amp;quot; which defines&lt;br /&gt;
the relationship between the student and the institution. The deposit of a complete electronic&lt;br /&gt;
version of the dissertation is compulsory. The registration is still done by administration, but a&lt;br /&gt;
workflow software tool was developed which handles the actual deposit and the electronic&lt;br /&gt;
management of the document and its metadata (DC28 , ETDMS, OAI-PMH).&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1777</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1777"/>
		<updated>2024-03-30T16:50:42Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* The French context */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
The French administrative organisation is particularly complex because of the multiplicity of&lt;br /&gt;
complementary administrative frameworks (an example will be given further in section 4.2).&lt;br /&gt;
In the last 10 years, no ambitious program has been launched in France to structure scholarly&lt;br /&gt;
publishing at national level. Public institutions in charge of libraries and scholarly&lt;br /&gt;
communication such as ABES (Association for Libraries in Higher Education)25 and INIST&lt;br /&gt;
(Institute for Scientific and Technical Information)26 have essentially initiated operational&lt;br /&gt;
projects such as an integrated publishing chain from articles deposit to the extraction of key&lt;br /&gt;
indicators for research. Local initiatives are often disconnected from those operations&lt;br /&gt;
launched at national level.&lt;br /&gt;
&lt;br /&gt;
As a result, the focus of operations launched by French actors tends to be too narrow to enable&lt;br /&gt;
the implementation of a digital library for e-research, which would federate scholarly&lt;br /&gt;
communication at national level.&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1776</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1776"/>
		<updated>2024-03-30T16:49:58Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* The francophone context */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
Francophone e-research networks also face both organisational and linguistic challenges. For&lt;br /&gt;
the most part, francophone countries (more than 50 countries over 5 continents) are outside&lt;br /&gt;
Europe. They have extremely different research infrastructures. Several institutions contribute&lt;br /&gt;
in structuring this community. For instance, directly related to theses and dissertations, the&lt;br /&gt;
“Organisation Internationale de la Francophonie” (OIF)20 has funded Cyberthèses (see section&lt;br /&gt;
3.1). Several institutions such as the “Agence universitaire de la Francophonie” (AUF)21 and&lt;br /&gt;
programs related to research infrastructures such as “Système d’Information Scientifique et&lt;br /&gt;
Technique” (SIST)22 are also helping standardising scholarly publishing in the francophone&lt;br /&gt;
area.&lt;br /&gt;
&lt;br /&gt;
Many francophone countries actually use multiple languages. They need to implement&lt;br /&gt;
multilingual systems, with classic constraints in the case of Latin languages and more&lt;br /&gt;
complex ones in the case of the Arabic language for example. The IMIST (Moroccan Institute&lt;br /&gt;
for Scientific and Technical information)23 in Morocco will implement a bilingual union&lt;br /&gt;
catalogue for theses and dissertations24 .&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1775</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1775"/>
		<updated>2024-03-30T16:48:55Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* The European context */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
Nevertheless, the major initiatives to concretely build a framework for scholarly&lt;br /&gt;
communication were launched at national level. The JISC (Joint Information Systems&lt;br /&gt;
Committee) has funded projects such as Thesis Alive!16 and Daedalus 17 to promote the&lt;br /&gt;
electronic publishing of theses and dissertations in the UK and the integration of UK&lt;br /&gt;
institutions in the NDLTD network. SURF (higher education and research partnership&lt;br /&gt;
organisation for network services and information and communications technology) has&lt;br /&gt;
supported DARE (Digital Academic Repositories)18 project to modify the infrastructure of&lt;br /&gt;
provision of academic information in the Netherlands. However, similar initiatives to create&lt;br /&gt;
comprehensive frameworks for publishing scholarly material at national level do not exist in&lt;br /&gt;
all European countries.&lt;br /&gt;
&lt;br /&gt;
The European IST priority on Research Networking (IST 2.5.6) will face the challenge of&lt;br /&gt;
building a framework for publishing scholarly material, at European level. The DRIVER&lt;br /&gt;
project (2006-2008) coordinated by the University of Athens will help provide this necessary&lt;br /&gt;
infrastructure for European research. It will be based on the open infrastructure proposed in&lt;br /&gt;
the scope of the DELOS network of Excellence for digital libraries19 .&lt;br /&gt;
&lt;br /&gt;
In practice, European actors have extremely diverse administrative organisations, inherited&lt;br /&gt;
from the past. Interoperability between national systems will have to deal with the&lt;br /&gt;
heterogeneity of the structures of academic and research entities, their dependencies and&lt;br /&gt;
relations (as detailed below in section 4.2). Additionally, the implementation of a European&lt;br /&gt;
framework for e-research will have to face the challenge of multilingualism, with particular&lt;br /&gt;
impacts on metadata creation and the management of terminologies.&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1774</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1774"/>
		<updated>2024-03-30T16:47:14Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* The European context */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum&amp;lt;ref&amp;gt; &amp;lt; http://www.oaforum.org/ &amp;gt; &amp;lt;/ref&amp;gt; [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
&lt;br /&gt;
The standardisation of the European Research Systems is also supported by the Commission.&lt;br /&gt;
For instance, EuroCRIS &amp;lt;ref&amp;gt; &amp;lt; http://www.eurocris.org/en/ &amp;gt; &amp;lt;/ref&amp;gt; aims at “transforming research information into knowledge” while&lt;br /&gt;
maintaining and publishing the CERIF &amp;lt;ref&amp;gt; &amp;lt; http://www.cordis.lu/cerif/home.html &amp;gt; &amp;lt;/ref&amp;gt; (Common European Research Information Format)&lt;br /&gt;
recommendation&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1773</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1773"/>
		<updated>2024-03-30T16:43:03Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* International context of e-research */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.fcla.edu/dlini/etd.html &amp;gt; &amp;lt;/ref&amp;gt; , or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1772</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1772"/>
		<updated>2024-03-30T16:40:13Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* International context of e-research */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://www.loc.gov/standards/mods/ &amp;gt;&amp;lt;/ref&amp;gt; are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS&amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt; &amp;lt;/nowiki&amp;gt;http://www.tdl.org/projects/metadata/tdlappprofile.pdf &amp;gt;&amp;lt;/ref&amp;gt;. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1771</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1771"/>
		<updated>2024-03-30T16:31:41Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* INIST: Metadata homogenisation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
INIST (INstitut de l’Information Scientifique et Technique) is a documentary centre which&lt;br /&gt;
produces bibliographic databases (Pascal and Francis).&lt;br /&gt;
&lt;br /&gt;
This activity is in permanent evolution. Until fifteen years ago, bibliographic records were&lt;br /&gt;
manually produced in IS0 2709 format. In a first step, an equivalent SGML DTD was used in&lt;br /&gt;
order to modernise the production process. Now INIST aims at metadata homogenisation&lt;br /&gt;
towards a Dublin Core compliant xml schema (Exodic) with automatic indexing.&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1770</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1770"/>
		<updated>2024-03-30T16:28:59Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Conclusion */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “''What Is a Digital Library anyway, anymore''”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
&lt;br /&gt;
But “''what do we really want to do anyway, anymore?''”&lt;br /&gt;
&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
# Compromise between the national environment of theses and the international network.&lt;br /&gt;
# Compromise between the different practices of various actors to ensure reusability of metadata through many applications.&lt;br /&gt;
# Compromise between the needs specific to every kind of users: librarians, informetrics engineers, policy actors, social aspects in networked collaborations (with a particular point about evaluation: indeed the thesis status guarantees a validation process which is the last step of semantic web).&lt;br /&gt;
# Compromise between a focused look on theses and their integration in a larger environment which goes beyond the basic role of a library, even with a “digital” attribute.&lt;br /&gt;
&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1769</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1769"/>
		<updated>2024-03-30T16:26:16Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Conclusion */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
For this new experience (the writing of this paper), after the translation of “What Is a Digital&lt;br /&gt;
Library anyway, anymore”, we have chosen to work again from a quite technical point of&lt;br /&gt;
view. We have identified a large set of stuff40 , such as theses metadata, affiliation links,&lt;br /&gt;
vocabulary items, which could upgrade our services. We have underscored the fundamental&lt;br /&gt;
role of a set of repositories of various items and naming conventions which should complete&lt;br /&gt;
the classical bibliographic archives…&lt;br /&gt;
But “what do we really want to do anyway, anymore?”&lt;br /&gt;
Our common objective is to go further in the e-research or e-science movement and to&lt;br /&gt;
consider scientific and technical information regardless of the global needs of the research&lt;br /&gt;
organisations. As we are working in separate institutions which manage different objectives or&lt;br /&gt;
priorities, this job was not an easy one. Perhaps our most interesting result concerns the&lt;br /&gt;
identification of all compromises that we have to work with:&lt;br /&gt;
1. Compromise between the national environment of theses and the international&lt;br /&gt;
network.&lt;br /&gt;
2. Compromise between the different practices of various actors to ensure reusability of&lt;br /&gt;
metadata through many applications.&lt;br /&gt;
3. Compromise between the needs specific to every kind of users: librarians, informetrics&lt;br /&gt;
engineers, policy actors, social aspects in networked collaborations (with a particular&lt;br /&gt;
point about evaluation: indeed the thesis status guarantees a validation process which&lt;br /&gt;
is the last step of semantic web).&lt;br /&gt;
4. Compromise between a focused look on theses and their integration in a larger&lt;br /&gt;
environment which goes beyond the basic role of a library, even with a “digital”&lt;br /&gt;
attribute.&lt;br /&gt;
In summary, we would consider the theses as nodes within a constellation containing&lt;br /&gt;
“articles, dissertations, affiliations, vocabularies”, but also “patents, projects and numerical&lt;br /&gt;
results”; in other words all components of a CRIS (Current Research Information System) [8].&lt;br /&gt;
Because of a current lack of French or francophone federative research programme, such as&lt;br /&gt;
NSDL, ARTIST is trying to set up a place where field actors could experiment and exchange&lt;br /&gt;
information about new practices in producing Scientific or Technical Information.&lt;br /&gt;
We would like to consider this paper as a step towards a more regular activity. At present step&lt;br /&gt;
ARTIST’s services look like a “collective scientific blog” and now we intend to produce a&lt;br /&gt;
francophone electronic journal with peer review mechanism, “electronic style” and&lt;br /&gt;
sophisticated standardisation. The French language is not to be considered as a “limitation”&lt;br /&gt;
and we think that new concepts must be grown deeper in a native language training area&lt;br /&gt;
before international confrontation.&lt;br /&gt;
In this context, metadata experimentations give us a natural workshop for collaborative&lt;br /&gt;
activities that we intend to carry on in the framework of DCMI&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1768</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1768"/>
		<updated>2024-03-30T16:25:14Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Conclusion */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Acknowledgements===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Only the main contributors are listed as authors of this paper. We would like to acknowledge&lt;br /&gt;
several other people who have contributed by giving some advice or information (Francis&lt;br /&gt;
ANDRE, Catherine MOREL-PAIR, Clotilde ROUSSEL and Pierrette PAILLASSARD from&lt;br /&gt;
INIST; Amos DAVID from LORIA, Daniel CHARNAY from CCSD; Ghalia MRAHI from&lt;br /&gt;
IMIST; Estelle BALIAN from “BiodivErSA – Belgium Biodiversity Platform”, or helping in&lt;br /&gt;
the translation or revising process (Marc RUBIO and Catherine GUNET from INIST).&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1767</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1767"/>
		<updated>2024-03-30T16:23:01Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* Several structuring initiatives */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
 ...&lt;br /&gt;
====INIST: Metadata homogenisation====&lt;br /&gt;
 ...&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Three case-studies===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
===Conclusion===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1766</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1766"/>
		<updated>2024-03-30T16:17:45Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, [[Has author::Jean-Paul Ducasse]], [[Has author::Muriel Foulonneau]], [[Has author::Luc Grivel]], [[Has author::Diane Le Hénaff]], [[Has author::Yann Nicolas]]&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1765</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1765"/>
		<updated>2024-03-30T15:12:15Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, Jean-Paul Ducasse, [[Has author::Muriel Foulonneau]], Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1764</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1764"/>
		<updated>2024-03-30T15:11:04Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;, Jean-Paul Ducasse, [[Has author|Muriel Foulonneau]], Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1763</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1763"/>
		<updated>2024-03-30T15:09:45Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* STAR: logistic intermediary between local actors and wider actors. */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
From 2006, the French Ministry of Education, which is responsible for PhD theses&lt;br /&gt;
infrastructure, will ask ABES, its bibliographic agency, to set up STAR (Signalement des&lt;br /&gt;
Thèses, Archivage et Recherche), a new service which will operate as a clearing house.&lt;br /&gt;
&lt;br /&gt;
In the input process, STAR will get theses and related metadata from the institutions entitled&lt;br /&gt;
to guarantee that the given document is true to the original which has been validated by the&lt;br /&gt;
jury.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1762</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1762"/>
		<updated>2024-03-30T15:09:05Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* STAR: logistic intermediary between local actors and wider actors. */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
[[File:Wicri DC 2006 fig 1.jpg|center|300px|thumb|Figure 1]]&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_1.jpg&amp;diff=1761</id>
		<title>File:Wicri DC 2006 fig 1.jpg</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Wicri_DC_2006_fig_1.jpg&amp;diff=1761"/>
		<updated>2024-03-30T15:07:24Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1760</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1760"/>
		<updated>2024-03-29T23:22:41Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=Clifford Lynch{{!}}C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In ''D-lib Magazine'', July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1759</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1759"/>
		<updated>2024-03-29T23:20:36Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=10&lt;br /&gt;
  |author=C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In D-lib Magazine, July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1758</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1758"/>
		<updated>2024-03-29T23:20:10Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=C. Lynch. &lt;br /&gt;
  |text=Where Do We Go From Here? The Next Decade for Digital Libraries. In D-lib Magazine, July 2005&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1757</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1757"/>
		<updated>2024-03-29T23:14:18Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* The European context */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
The European IST (Information Society Technologies) program, like the DLI programs in the&lt;br /&gt;
US, has focused on the research dimension of information technologies to create an open&lt;br /&gt;
digital library framework. Several projects, such as the Open Archives Forum13 [11], have&lt;br /&gt;
been funded by the European Commission to raise awareness of national players and to&lt;br /&gt;
investigate the technology issues related to scholarly communication.&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1756</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1756"/>
		<updated>2024-03-29T23:13:27Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* International context of e-research */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University &amp;lt;ref&amp;gt;&amp;lt;nowiki&amp;gt;&amp;lt;&amp;lt;/nowiki&amp;gt; http://publications.uu.se/theses/index.xsql?lang=en&amp;gt;&amp;lt;/ref&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1755</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1755"/>
		<updated>2024-03-29T23:11:07Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
[[File:Dcmi-952108464.pdf|200px|right]]&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University12).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Dcmi-952108464.pdf&amp;diff=1754</id>
		<title>File:Dcmi-952108464.pdf</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=File:Dcmi-952108464.pdf&amp;diff=1754"/>
		<updated>2024-03-29T23:09:53Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1753</id>
		<title>Metadata towards an e-research cyberinfrastructure: The case of French PhD theses</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Metadata_towards_an_e-research_cyberinfrastructure:_The_case_of_French_PhD_theses&amp;diff=1753"/>
		<updated>2024-03-29T22:58:54Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: /* International context of e-research */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri working area|text=page in building step}}&lt;br /&gt;
&lt;br /&gt;
;Authors:[[Has first creator::Jacques Ducloy]]&amp;lt;sup&amp;gt;(i)&amp;lt;/sup&amp;gt;,, Jean-Paul Ducasse, Muriel Foulonneau, Luc Grivel, Diane Le Hénaff, Yann Nicolas&lt;br /&gt;
* (i) INIST / CNRS&lt;br /&gt;
;Abstract: This paper analyses metadata practices and needs in the French research community. It focuses on PhD theses whose life-cycle is totally controlled by the academic institutions. It uses information treatments dealing with setting up research policy as samples for an eresearch orientation. Several case-studies illustrate the fundamental role of various repositories containing affiliations, authorities or linguistic items. ARTIST, the collective author of this paper, is introduced.&lt;br /&gt;
==Archived version==&lt;br /&gt;
===Introduction===&lt;br /&gt;
{{Begin article body}}&lt;br /&gt;
This paper is the result of a collaborative work and was written by a networked team of&lt;br /&gt;
people, engineers or librarians, working in different organisations, in the framework of&lt;br /&gt;
ARTIST&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/ &amp;gt;&amp;lt;/ref&amp;gt; (Appropriation par la Recherche des Technologies de l’Information Scientifique et&lt;br /&gt;
Technique) project. &lt;br /&gt;
Our first experience was based on various contributions on a&lt;br /&gt;
terminological forum, about a translation&amp;lt;ref&amp;gt;&amp;lt; http://artist.inist.fr/article.php3?id_article=245 &amp;gt;&amp;lt;/ref&amp;gt; of “''What Is a Digital Library anyway, anymore''”, a&lt;br /&gt;
paper written by Carl Lagoze, and whose subject deals with the deep structure of a Digital&lt;br /&gt;
Library[{{Article body/Link bibl|9}}]. This paper is a new cooperative experience which would like to analyse how&lt;br /&gt;
metadata could help the French academic community in building a federative Digital Library.&lt;br /&gt;
&lt;br /&gt;
The annual issue of the “Academic Ranking of World Universities” [{{Article body/Link bibl|6}}] is causing discomfort&lt;br /&gt;
in those in charge of setting up research policies. Improving the quality of metadata items&lt;br /&gt;
such as affiliations is now considered as a key issue for improving the visibility of&lt;br /&gt;
universities. The researchers themselves are now permanently looking at impact factor. The&lt;br /&gt;
“publish or perish” notion is now used as a strong incentive for author self-archiving in&lt;br /&gt;
institutional repositories [{{Article body/Link bibl|4}}].&lt;br /&gt;
&lt;br /&gt;
Academic librarian and research communities begin to feel that metadata are not only useful&lt;br /&gt;
for information retrieval but could play a more strategic function. This new way of viewing is&lt;br /&gt;
perhaps a first step towards a more global analysis about the role of scholarly publishing in&lt;br /&gt;
what is called “cyberinfrastructure for e-science or e-research” [{{Article body/Link bibl|10}}].&lt;br /&gt;
&lt;br /&gt;
In this context, this paper will explore how metadata could be used in some activities dealing&lt;br /&gt;
with research policy in a francophone&amp;lt;ref&amp;gt;From the French speaking area&amp;lt;/ref&amp;gt; environment. We have chosen to focus on PhD theses&lt;br /&gt;
because their life-cycle is fully controlled by academic institutions; but a large part of the&lt;br /&gt;
discussion could be applied to all items of scholarly publishing.&lt;br /&gt;
&lt;br /&gt;
We will show that a precise research policy requires sophisticated metadata. In an open&lt;br /&gt;
archiving framework, the most popular among technical solutions, such as DSpace [{{Article body/Link bibl|12}}], or&lt;br /&gt;
Eprints&amp;lt;ref&amp;gt;&amp;lt; http://www.eprints.org/ &amp;gt;&amp;lt;/ref&amp;gt;, do not require a depositor to provide strongly structured metadata. Most&lt;br /&gt;
requirements are limited to a basic set of Dublin Core elements in order to be easily harvested.&lt;br /&gt;
PhD theses are naturally concerned by this goal of improving visibility [{{Article body/Link bibl|5}}]. We will show that&lt;br /&gt;
their initial life-cycle requires that metadata should not be merely descriptive but should&lt;br /&gt;
include some management elements. Indeed, most of the time and more specifically in a&lt;br /&gt;
French context, several institutions or organisations are concerned and must cooperate.&lt;br /&gt;
&lt;br /&gt;
As for all published items of research, theses metadata must be usable in any portal (national,&lt;br /&gt;
international, thematic…) that could increase their visibility. They should also be easily&lt;br /&gt;
handled by informetric tools in order to be picked out in a scientific or strategic watch or for&lt;br /&gt;
research policy oriented studies. At this level we will show that a key issue is the handling of&lt;br /&gt;
vocabularies and affiliations.&lt;br /&gt;
&lt;br /&gt;
In the first part of this paper, we will start by introducing the francophone environment. Then&lt;br /&gt;
we will present several structuring initiatives dealing with PhD thesis production, union&lt;br /&gt;
catalogues and institutional archives. Finally, we will discuss three case-studies showing&lt;br /&gt;
various aspects of metadata and vocabularies.&lt;br /&gt;
{{Article body/End}}&lt;br /&gt;
&lt;br /&gt;
===Digital libraries for e-research: an overview of European, francophone and French contexts===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
Francophone research institutions must position themselves in relation to a variety of existing&lt;br /&gt;
national and international frameworks.&lt;br /&gt;
&lt;br /&gt;
They take part in international standardisation initiatives. They have to take into account the&lt;br /&gt;
evolution of standards and practices in the United States and worldwide. Additionally, they&lt;br /&gt;
are part of both linguistic and regional networks. France and Belgium for instance are part of&lt;br /&gt;
both Europe and the francophone area (Francophony). Algeria, Morocco and Tunisia are part&lt;br /&gt;
of Francophony as well as of the Arabic language community.&lt;br /&gt;
&lt;br /&gt;
As a result, francophone research actors must coordinate with a number of initiatives in&lt;br /&gt;
multiple areas of cooperation. The metadata strategies adopted for scholarly publishing must&lt;br /&gt;
ensure interoperability of francophone scholarly material in all those networks. They must&lt;br /&gt;
reflect very diverse administrative situations in the different countries as well as in the&lt;br /&gt;
regional and international network infrastructures.&lt;br /&gt;
====International context of e-research====&lt;br /&gt;
The open access movement and the Open Archives Initiative have encouraged research&lt;br /&gt;
institutions to make available theses and dissertations on the Web. On the technical side, the&lt;br /&gt;
Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)&amp;lt;ref&amp;gt;&amp;lt; http://openarchives.org &amp;gt;&amp;lt;/ref&amp;gt; makes it possible to&lt;br /&gt;
share and exchange metadata about scholarly material. This has allowed the creation of an&lt;br /&gt;
open framework for publishing theses and dissertations. They are integrated into open&lt;br /&gt;
repositories and shared in larger networks. In France, this led to the creation of the Centre for&lt;br /&gt;
Direct Scholarly Communication&amp;lt;ref&amp;gt;&amp;lt; http://www.ccsd.cnrs.fr/accueil.php3 ?lang=en &amp;gt;&amp;lt;/ref&amp;gt; (CCSD), a major initiative aiming at reengineering the&lt;br /&gt;
processes of scholarly communication, as illustrated below ([[#CCSD: open archive with institutional views|section 3.2]]).&lt;br /&gt;
&lt;br /&gt;
In the United States, efforts to create an open digital library framework in the scope of the&lt;br /&gt;
Digital Library Initiative DLI-I and DLI-II funded by the National Science Foundation have&lt;br /&gt;
led to such major projects as the National Science Digital Library&amp;lt;ref&amp;gt;&amp;lt; http://www.nsdl.org &amp;gt;&amp;lt;/ref&amp;gt;. NSDL has contributed to&lt;br /&gt;
the promotion of standards and the development of services based on an open architecture for&lt;br /&gt;
digital libraries. The Networked Digital Library of Theses and Dissertations (NDLTD)&amp;lt;ref&amp;gt;&amp;lt;http://www.ndltd.org/index.en.html &amp;gt;&amp;lt;/ref&amp;gt; [13]&lt;br /&gt;
has developed an infrastructure, including processes and workflow for electronic publishing&lt;br /&gt;
of theses and dissertations. It has raised IPR issues related to ETD (electronic theses and&lt;br /&gt;
dissertations) publishing. It has also improved repositories technical interoperability by&lt;br /&gt;
encouraging the use of OAI-PMH and SRU servers. Finally it has improved metadata-related&lt;br /&gt;
interoperability by adopting the ETD metadata set (ETDMS) [4] developed as a Dublin Core&lt;br /&gt;
application profile. ETDMS is notably used in the Cyberthèses project (francophone portal for&lt;br /&gt;
ETD) further described in section 3.1. Alternative metadata formats such as MARC and&lt;br /&gt;
MODS (Metadata Object Description Schema maintained by the Library of Congress)9 are&lt;br /&gt;
also used. The Metadata Working Group of the Texas Digital Library has developed a&lt;br /&gt;
descriptive application profile for electronic theses and dissertations in MODS10. Finally, a&lt;br /&gt;
number of libraries embed descriptive metadata in METS wrappers (e.g. The Florida Center&lt;br /&gt;
for Library Automation11, or Uppsala University12).&lt;br /&gt;
&lt;br /&gt;
The ARTIST project, collective author of the present article, is notably in charge of tracking information on the multiplicity of existing metadata initiatives and their evolution in order to ensure that French and francophone actors benefit from those initiatives. It aims to better coordinate the standardisation efforts in the different networks.&lt;br /&gt;
&lt;br /&gt;
====The European context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The francophone context====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====The French context====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===Several structuring initiatives===&lt;br /&gt;
{{Article body/Begin}}&lt;br /&gt;
====Cyberthèses====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====CCSD: open archive with institutional views====&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
====STAR: logistic intermediary between local actors and wider actors.====&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
{{End article body}}&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 1&lt;br /&gt;
  |text=Y. Bakelli and S. Benrahmoun. Long-term preservation of ETDs in Algeria: discussion&lt;br /&gt;
through the CERIST Deposit system. In Proceedings of ETD2003. Berlin 2003.&lt;br /&gt;
&amp;lt;http://edoc.hu-berlin.de/conferences/etd2003/bakelli-yahia/HTML/bakelli.html &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 2&lt;br /&gt;
  |text= BiodivERsA. Compendium of Biodiversity Research Funding Agencies in Europe&lt;br /&gt;
&amp;lt;http://www.eurobiodiversa.org/rich_files/attachments/Compendium%201%20Feb%202006r&lt;br /&gt;
ev.doc &amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 3&lt;br /&gt;
  |author=Luc Grivel{{!}}L. Grivel&lt;br /&gt;
  |text=H. Fagherazzi, P. Fourneret and A. Zerouki. La conception de bases de&lt;br /&gt;
données infométriques hybrides : analyse de la pratique de trois observatoires européens. In&lt;br /&gt;
Journées SFBA proceedings Ile Rousse 99.&lt;br /&gt;
&amp;lt; http://archivesic.ccsd.cnrs.fr/sic_00000464.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 4&lt;br /&gt;
  |text=S. Harnad. Publish or Perish - Self-Archive to Flourish : The Green Route to Open&lt;br /&gt;
Access. In ERCIM News January 2006&lt;br /&gt;
&amp;lt;http://www.ercim.org/publication/Ercim_News/enw64/harnad.html&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 5&lt;br /&gt;
  |author=Diane Le Hénaff{{!}}D. Le Henaff&lt;br /&gt;
  |text=  and C. Thiolon. Gérer et diffuser des thèses électroniques : un choix&lt;br /&gt;
politique pour un enjeu scientifique. In Documentaliste - Sciences de l’information. 42(4-&lt;br /&gt;
5):272-280. October 2005.}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 6&lt;br /&gt;
  |text= Institute of Higher Education . Academic Ranking of World Universities - Shanghai Jiao&lt;br /&gt;
Tong University, 2005&lt;br /&gt;
&amp;lt; http://ed.sjtu.edu.cn/ranking.htm &amp;gt;}}&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id= 7&lt;br /&gt;
  |text=M. Kaiser. New Ways of Sharing and Using Authority Information. In D-lib Magazine,&lt;br /&gt;
September 2001&lt;br /&gt;
&amp;lt;http://www.dlib.org/dlib/november03/lieder/11lieder.html&amp;gt; }}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=8&lt;br /&gt;
  |text=K. Jeffery. CRIS + open access = the route to research knowledge on the GRID. In 71st IFLA General Conference and Council proceedings, Oslo, Norway, 2005 &amp;lt;http://www.ifla.org/IV/ifla71/papers/007e-Jeffery.pdf&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
{{Article body/Bibl&lt;br /&gt;
  |id=9&lt;br /&gt;
  |author=Carl Lagoze{{!}}C. Lagoze&lt;br /&gt;
  |author 2= D. Krafft&lt;br /&gt;
  |author 3 = S. Payette&lt;br /&gt;
  |author 4 = S. Jesuroga&lt;br /&gt;
  |text=What Is a Digital Library anyway, anymore? In ''D-lib Magazine''. November 2005.&amp;lt;br/&amp;gt;&amp;lt;http://dx.doi.org/10.1045/november2005-lagoze&amp;gt;}}&lt;br /&gt;
&lt;br /&gt;
===Notes===&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
&lt;br /&gt;
: https://dcpapers.dublincore.org/pubs/article/view/846&lt;br /&gt;
&lt;br /&gt;
__SHOWFACTBOX__&lt;br /&gt;
&lt;br /&gt;
[[fr:Métadonnées dans le contexte d’une cyberinfrastructure de la Recherche]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Main_Page&amp;diff=1752</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Main_Page&amp;diff=1752"/>
		<updated>2021-01-05T15:06:47Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri box&lt;br /&gt;
 |text=&amp;lt;center&amp;gt;'''&amp;lt;big&amp;gt;Welcome on Wicri/Information &amp;amp; Communication Sciences&amp;lt;/big&amp;gt;&amp;lt;br/&amp;gt;'''See also: [[Artist.en:Main Page|Artist]]&amp;lt;/center&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
{|width=100%&lt;br /&gt;
|-&lt;br /&gt;
|[[Image:604px-Julia set 10th power.png|left|140px|:fr:Fractale|Fractale de Julia]]&lt;br /&gt;
|&amp;lt;big&amp;gt;'''TICRI'''&amp;lt;/big&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;big&amp;gt;'''T'''&amp;lt;/big&amp;gt; Technologies  for&lt;br /&gt;
* &amp;lt;big&amp;gt;'''I'''&amp;lt;/big&amp;gt;  Information used by&lt;br /&gt;
* &amp;lt;big&amp;gt;'''C'''&amp;lt;/big&amp;gt; Communities in&lt;br /&gt;
* &amp;lt;big&amp;gt;'''R'''&amp;lt;/big&amp;gt;  Research&lt;br /&gt;
* &amp;lt;big&amp;gt;'''I'''&amp;lt;/big&amp;gt;  and Innovation&lt;br /&gt;
|&lt;br /&gt;
{|style=&amp;quot;background:#ccccff;-moz-border-radius:0.8em; -webkit-border-radius:0.5em;&amp;quot; align=right&lt;br /&gt;
|-&lt;br /&gt;
|width=200px|{{Wicri global map&lt;br /&gt;
 |map size=280&lt;br /&gt;
 |size 1 = 32&lt;br /&gt;
 |size 2 = 26&lt;br /&gt;
 |size 3 = 24&lt;br /&gt;
 |size 4 = 22&lt;br /&gt;
 |size 5 = 18&lt;br /&gt;
}}&lt;br /&gt;
|-&lt;br /&gt;
|width=200px|{{Wicri network navbox}}&lt;br /&gt;
|}&lt;br /&gt;
|}&lt;br /&gt;
{{clr}}&lt;br /&gt;
{{Begin 2 columns}}&lt;br /&gt;
{{H2 Wicri|The wiki Wicri/Ticri (en)}}&lt;br /&gt;
{{To French box&lt;br /&gt;
 |text=This wiki is in the first stage of its building process. To enter the  [[wicri.fr:Accueil|wiki Ticri in French]].&lt;br /&gt;
}}&lt;br /&gt;
{{Skip 2 columns}}&lt;br /&gt;
{{DCMI Information}}&lt;br /&gt;
{{End 2 columns}}&lt;br /&gt;
&lt;br /&gt;
At this time, this wiki is only dedicated to French speaking communities working on this area in order to define an ontology.&lt;br /&gt;
&lt;br /&gt;
==Work in progress==&lt;br /&gt;
&lt;br /&gt;
Implementing semantic model by an adaptation of [[semweb:Template:Event|Semantic Web]].&lt;br /&gt;
&lt;br /&gt;
* [[Research events]]&lt;br /&gt;
* [[Scientific journals]]&lt;br /&gt;
* [[Digital information actors]] &lt;br /&gt;
&lt;br /&gt;
[[fr:Accueil]]&lt;br /&gt;
&lt;br /&gt;
==Setting up the wiki==&lt;br /&gt;
* [[Projects:Setting up the wiki]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
	<entry>
		<id>https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Main_Page&amp;diff=1751</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://wicri-demo.istex.fr/Wicri/Sic/en/index.php?title=Main_Page&amp;diff=1751"/>
		<updated>2021-01-05T15:06:00Z</updated>

		<summary type="html">&lt;p&gt;Jacques Ducloy: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Wicri box&lt;br /&gt;
 |text=&amp;lt;center&amp;gt;'''&amp;lt;big&amp;gt;Welcome on Wicri/Open Science&amp;lt;/big&amp;gt;&amp;lt;br/&amp;gt;'''See also: [[Artist.en:Main Page|Artist]]&amp;lt;/center&amp;gt;&lt;br /&gt;
}}&lt;br /&gt;
{|width=100%&lt;br /&gt;
|-&lt;br /&gt;
|[[Image:604px-Julia set 10th power.png|left|140px|:fr:Fractale|Fractale de Julia]]&lt;br /&gt;
|&amp;lt;big&amp;gt;'''TICRI'''&amp;lt;/big&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;big&amp;gt;'''T'''&amp;lt;/big&amp;gt; Technologies  for&lt;br /&gt;
* &amp;lt;big&amp;gt;'''I'''&amp;lt;/big&amp;gt;  Information used by&lt;br /&gt;
* &amp;lt;big&amp;gt;'''C'''&amp;lt;/big&amp;gt; Communities in&lt;br /&gt;
* &amp;lt;big&amp;gt;'''R'''&amp;lt;/big&amp;gt;  Research&lt;br /&gt;
* &amp;lt;big&amp;gt;'''I'''&amp;lt;/big&amp;gt;  and Innovation&lt;br /&gt;
|&lt;br /&gt;
{|style=&amp;quot;background:#ccccff;-moz-border-radius:0.8em; -webkit-border-radius:0.5em;&amp;quot; align=right&lt;br /&gt;
|-&lt;br /&gt;
|width=200px|{{Wicri global map&lt;br /&gt;
 |map size=280&lt;br /&gt;
 |size 1 = 32&lt;br /&gt;
 |size 2 = 26&lt;br /&gt;
 |size 3 = 24&lt;br /&gt;
 |size 4 = 22&lt;br /&gt;
 |size 5 = 18&lt;br /&gt;
}}&lt;br /&gt;
|-&lt;br /&gt;
|width=200px|{{Wicri network navbox}}&lt;br /&gt;
|}&lt;br /&gt;
|}&lt;br /&gt;
{{clr}}&lt;br /&gt;
{{Begin 2 columns}}&lt;br /&gt;
{{H2 Wicri|The wiki Wicri/Ticri (en)}}&lt;br /&gt;
{{To French box&lt;br /&gt;
 |text=This wiki is in the first stage of its building process. To enter the  [[wicri.fr:Accueil|wiki Ticri in French]].&lt;br /&gt;
}}&lt;br /&gt;
{{Skip 2 columns}}&lt;br /&gt;
{{DCMI Information}}&lt;br /&gt;
{{End 2 columns}}&lt;br /&gt;
&lt;br /&gt;
At this time, this wiki is only dedicated to French speaking communities working on this area in order to define an ontology.&lt;br /&gt;
&lt;br /&gt;
==Work in progress==&lt;br /&gt;
&lt;br /&gt;
Implementing semantic model by an adaptation of [[semweb:Template:Event|Semantic Web]].&lt;br /&gt;
&lt;br /&gt;
* [[Research events]]&lt;br /&gt;
* [[Scientific journals]]&lt;br /&gt;
* [[Digital information actors]] &lt;br /&gt;
&lt;br /&gt;
[[fr:Accueil]]&lt;br /&gt;
&lt;br /&gt;
==Setting up the wiki==&lt;br /&gt;
* [[Projects:Setting up the wiki]]&lt;/div&gt;</summary>
		<author><name>Jacques Ducloy</name></author>
		
	</entry>
</feed>