ALIRE (corpus Hypertext)
Organisation
- Corpus : les données de départ
- Tables : les tables de correspondances
- Server : les données Dilib intermédiaire
Site : le site
Compilations
. $DILIB_IMPORT/DilibMake/DilibUpdate.sh
gcc testPageListAuteur.c $DILIB_CC -o testPageListAuteur
gcc testPageAuteur.c $DILIB_CC -o testPageAuteur
gcc testCorpusPageDate.c $DILIB_CC -o testCorpusPageDate
gcc HfdStoreFile.c $DILIB_CC -o HfdStoreFile
gcc testPascal2site.c $DILIB_CC -o testPascal2site
gcc IsoDate2An.c $DILIB_CC -o IsoDate2An
gcc InistRecords2site.c $DILIB_CC -o InistRecords2site
gcc XmlRecordPutKeyNum.c $DILIB_CC -o XmlRecordPutKeyNum
gcc Hal2CorpusDocument.c $DILIB_CC -o Hal2CorpusDocument
gcc Hal2CorpusRecord.c $DILIB_CC -o Hal2CorpusRecord
gcc InistGroupAutAff.c Utf8Converter.c Utf8ConverterName.c Utf8ConverterJumpToAlpha.c $DILIB_CC -o InistGroupAutAff
gcc Hal2CorpusRecord.c Utf8Converter.c Utf8ConverterName.c Utf8ConverterJumpToAlpha.c $DILIB_CC -o Hal2CorpusRecord
lex TableTxt2Tab.lex
gcc lex.yy.c -ll -o TableTxt2Tab
./TableTxt2Tab < PersonneCorpus2Wiki.txt
./TableTxt2Tab < Cote2Pays.txt > Cote2Pays.tab
Création
Pour tester les caractères spéciaux :
PascalServer2xml < Hypertext/Corpus/inistServer.txt | SgmlSelect -g server/ID# -p @g1 -p @1 | PascalSgml2utf8 | DamHfdBuild -h Hypertext/Server/biblio
rm -rf Hypertext/Server/*
(
(
PascalFromIlib < Hypertext/Corpus/inistStandard.txt | ./XmlRecordPutKeyNum -b inist
PascalServer2xml < Hypertext/Corpus/inistServer.txt | ./XmlRecordPutKeyNum -b inist
) | sort | IndexBuildRec \
| SgmlSelect -g idx/l/e/inist -g idx/l/e/server -g idx/k# -p "@g3@g1@g2" \
| PascalSgml2utf8 | ./InistGroupAutAff
./Hal2CorpusDocument < Hypertext/Corpus/hal.xml | XmlUnIndent | ./XmlRecordPutKeyNum -b hal| ./Hal2CorpusRecord
) | DamHfdBuild -h Hypertext/Server/biblio
DamCat Hypertext/Server/biblio.hfd | SgmlSelect -s record/inist/pA/fA11/s1# -p @S -p @1 | sort -u | IndexFastBuildRec | DamHfdBuild -h Hypertext/Server/inist.aut.i
IndexBuildHid -h Hypertext/Server/inist.aut.i -p record/inist/pA/fA11/s1# -e kw
DamCat Hypertext/Server/inist.aut.i.hfd | SgmlFast -c l | SgmlSelect -g idx/f# -p @g1 -p @2 | sort -rn | XmlCut 1 | head -500 > Hypertext/Server/inist.aut.i.sort
DamCat Hypertext/Server/biblio.hfd | SgmlSelect -s record/dc/creator/key# -p @S -p @1 | sort -u | IndexFastBuildRec | DamHfdBuild -h Hypertext/Server/dc.aut.i
IndexBuildHid -h Hypertext/Server/dc.aut.i -p record/dc/creator/key# -e kw
DamCat Hypertext/Server/dc.aut.i.hfd | SgmlFast -c l | SgmlSelect -g idx/f# -p @g1 -p @2 | sort -rn | XmlCut 1 | head -500 > Hypertext/Server/dc.aut.i.sort
DamCat Hypertext/Server/biblio.hfd | SgmlSelect -s record/inist/pA/fC03@l=FRE/s0# -p @S -p @1 | sort -u | IndexFastBuildRec | DamHfdBuild -h Hypertext/Server/inist.mcl.i
IndexBuildHid -h Hypertext/Server/inist.mcl.i -p record/inist/pA/fC03@l=FRE/s0# -e kw
DamCat Hypertext/Server/inist.mcl.i.hfd | SgmlFast -c l | SgmlSelect -g idx/f# -p @g1 -p @2 | sort -rn | XmlCut 1 | head -500 > Hypertext/Server/inist.mcl.i.sort
===== A adapter =====
DamCat Hypertext/Server/biblio.hfd |./InistRecords2site -p Hypertext/param.xml | ./HfdStoreFile
DamCat Hypertext/Server/biblio.hfd | SgmlSelect -g record/inist/pA/fA21/s1# -p @g1 -p @1 |./IsoDate2An | sort | IndexFastBuildRec | DamHfdBuild -h Hypertext/Server/date.i
IndexBuildHid -h Hypertext/Server/date.i -p record/inist/pA/fA21/s1# -e kw
===== A adapter =====
DamCat Dendro/Server/DendroPascal.date.i.hfd | ./testCorpusPageDate | ./HfdStoreFile
IndexBuildUsual -h Hypertext/Server/DendroPascal -k record/inist/pA/fA11/s1# -i aut
DamCat Dendro/Server/DendroPascal.aut.i.hfd | SgmlSelect -g idx/kw# -p @g1 -p @1 > Dendro/Server/DendroPascal.aut2hfd.table
DamCat Dendro/Server/DendroPascal.aut.i.hfd | ./testPageAuteur | ./HfdStoreFile
DamCat Dendro/Server/DendroPascal.aut.i.hfd | SgmlSelect -g idx/f# -p @g1 -p @1 -p @2 | sort -rn | head -50 |./testPageListAuteur |./HfdStoreFile -f Dendro/Site/aut.i.hfd/sort.html
==Archive==
(
PascalFromIlib < Hypertext/Corpus/inistStandard.txt | SgmlSelect -g inist/pA/fA47/s0# -p @g1 -p @1
PascalServer2xml < Hypertext/Corpus/inistServer.txt | SgmlSelect -g server/ID# -p @g1 -p @1
) | sort | IndexBuildRec \
| SgmlSelect -e idx/l/e/inist -e idx/l/e/server -p "@g1@g2" \
| PascalSgml2utf8 | DamHfdBuild -h Hypertext/Server/biblio
XmlSelect -g documents -p @g1 < Hypertext/Corpus/hal.xml | XmlUnIndent | Sgmlselect -s documents/typdoc -p @s1 | SgmlSelect -g typdoc@name@# -s typdoc/document -p "@g1@s1" | ./XmlRecordPutKeyNum -b hal | SgmlSelect -p "@1@2"