idzebra-moved-to-github.git
17 years agooptimized code such that the RecWord structure recword is only
Marc Cromme [Thu, 15 Feb 2007 15:08:41 +0000 (15:08 +0000)]
optimized code such that the RecWord structure recword is only
initialized once for each to-be-indexed record, and not once for each
to-be-indexed term - at the expense of a bit of pointer passing when
recursively transversing the XML DOM tree

17 years agoremoved dead code pieces which are reminisences from the original
Marc Cromme [Thu, 15 Feb 2007 14:44:48 +0000 (14:44 +0000)]
removed dead code pieces which are reminisences from the original
alvis-style parsin and indexing stuff. Now only new dom indexing code
is present.

17 years agopretty formatting warning messages, always giving the file name and
Marc Cromme [Thu, 15 Feb 2007 14:33:41 +0000 (14:33 +0000)]
pretty formatting warning messages, always giving the file name and
the XML node path as informative parameters along

17 years agorewritten mod_dom instruction parsing code hooked into mod_dom indexing
Marc Cromme [Thu, 15 Feb 2007 13:01:00 +0000 (13:01 +0000)]
rewritten mod_dom instruction parsing code hooked into mod_dom indexing
new stylesheets added, one for PI based indexing, and one for <z:index> based indexing
segmentation fault traced and fixed
test framework updated to use new mod_dom parsing

17 years agoadded 'static' declaration to functiondefinitions
Marc Cromme [Wed, 14 Feb 2007 16:43:37 +0000 (16:43 +0000)]
added 'static' declaration to functiondefinitions

17 years agochanging attribute 'action' to 'type' for better confrmance with Alvis
Marc Cromme [Wed, 14 Feb 2007 16:38:41 +0000 (16:38 +0000)]
changing attribute 'action' to 'type' for better confrmance with Alvis
filter syntax

17 years agoindenting entire file according to the rules stated in the very end of
Marc Cromme [Wed, 14 Feb 2007 16:31:37 +0000 (16:31 +0000)]
indenting entire file according to the rules stated in the very end of
the file, using emacs M-x indent-region, and manual line breaking afterwards

17 years agocontinued hooking in tinfo and recctr, still need to do real indexing
Marc Cromme [Wed, 14 Feb 2007 16:16:15 +0000 (16:16 +0000)]
continued hooking in tinfo and recctr, still need to do real indexing

17 years agoremoved wanings by zillions of (const char *) casts and the like
Marc Cromme [Wed, 14 Feb 2007 15:42:24 +0000 (15:42 +0000)]
removed wanings by zillions of (const char *) casts and the like

17 years agoremoved the crappy PI and <z:index> parsing code comitted yesterday
Marc Cromme [Wed, 14 Feb 2007 15:23:33 +0000 (15:23 +0000)]
removed the crappy PI and <z:index> parsing code comitted yesterday
replaced with clean parsing logic developped outside mod_dom.c
needs to take care of all new warnings due to stricter compile flags
finally, needs to be hooked into actual indexing of records

17 years agoremoved unnecessary out-commented code lines
Marc Cromme [Tue, 13 Feb 2007 12:19:37 +0000 (12:19 +0000)]
removed unnecessary out-commented code lines

17 years agofacturized DOM XML indexing code out into function
Marc Cromme [Tue, 13 Feb 2007 11:37:02 +0000 (11:37 +0000)]
facturized DOM XML indexing code out into function
static void extract_doc_alvis(struct filter_info *tinfo,
                              struct recExtractCtrl *recctr,
                              xmlDocPtr doc)
This is the function to be re-written using both PI and <z:index> instructions,
and also fixing the bug of index type 'p' and '0' chop-over of merged content.

17 years agoexperimental processing-instruction based indexing XSLT added
Marc Cromme [Mon, 12 Feb 2007 14:00:20 +0000 (14:00 +0000)]
experimental processing-instruction based indexing XSLT added

17 years agoavoiding unnecesasary unused namespace declarations in output documents
Marc Cromme [Mon, 12 Feb 2007 13:58:12 +0000 (13:58 +0000)]
avoiding unnecesasary unused namespace declarations in output documents

17 years agoadded parsing function 'parse_pi_zebra_20' for processing-instruction parsing and...
Marc Cromme [Mon, 12 Feb 2007 13:24:31 +0000 (13:24 +0000)]
added parsing function 'parse_pi_zebra_20' for processing-instruction parsing and 'format_pi_zebra_err' for error or wanrning formatting. Those are yet not called, and need to be build into the XML parsing in the DOM module.

17 years agoFixed bug #884: Entity declarations in input are lost at retrieval time.
Adam Dickmeiss [Mon, 12 Feb 2007 10:33:50 +0000 (10:33 +0000)]
Fixed bug #884: Entity declarations in input are lost at retrieval time.

17 years agoFixed serious bug in mf_open which made it fail to see an already existing
Adam Dickmeiss [Sat, 10 Feb 2007 18:37:42 +0000 (18:37 +0000)]
Fixed serious bug in mf_open which made it fail to see an already existing
metafile. The bug was introduced in mfile 1.70.

17 years agobuildconf.sh part of dist.
Adam Dickmeiss [Sat, 10 Feb 2007 12:46:54 +0000 (12:46 +0000)]
buildconf.sh part of dist.

17 years agocorrected DEPRECIATED to DEPRECATED
Marc Cromme [Wed, 7 Feb 2007 13:33:17 +0000 (13:33 +0000)]
corrected DEPRECIATED to DEPRECATED

17 years agoadded debian libidzebra-2.0-mod-dom package
Marc Cromme [Wed, 7 Feb 2007 13:19:35 +0000 (13:19 +0000)]
added debian libidzebra-2.0-mod-dom package

17 years agomaking 'dox' target phony
Marc Cromme [Wed, 7 Feb 2007 12:50:13 +0000 (12:50 +0000)]
making 'dox' target phony

17 years agoImplemented new filter 'dom'. See test/xslt/dom-config*xml for examples.
Adam Dickmeiss [Wed, 7 Feb 2007 12:08:54 +0000 (12:08 +0000)]
Implemented new filter 'dom'. See test/xslt/dom-config*xml for examples.
This, like alvis, performs indexing and retrieval using XSLT. But Unlike
alvis, it allows multiple XSLT steps to be performed and does ISO2709

17 years agoThe configuration, fileverboselimit, has a value of 1000. When
Adam Dickmeiss [Tue, 6 Feb 2007 09:34:56 +0000 (09:34 +0000)]
The configuration, fileverboselimit, has a value of 1000. When
reached a message is logged. Bug #845.

17 years agoOmit sort info: bug #844.
Adam Dickmeiss [Tue, 6 Feb 2007 09:33:31 +0000 (09:33 +0000)]
Omit sort info: bug #844.

17 years agoMore compact statistics
Adam Dickmeiss [Tue, 6 Feb 2007 09:32:50 +0000 (09:32 +0000)]
More compact statistics

17 years agodropped section on future directions
Marc Cromme [Mon, 5 Feb 2007 14:32:31 +0000 (14:32 +0000)]
dropped section on future directions

17 years agospll checked
Marc Cromme [Mon, 5 Feb 2007 14:05:26 +0000 (14:05 +0000)]
spll checked

17 years agofromatting of feature tables updated
Marc Cromme [Mon, 5 Feb 2007 14:02:27 +0000 (14:02 +0000)]
fromatting of feature tables updated

17 years agofeature table updated
Marc Cromme [Mon, 5 Feb 2007 13:35:12 +0000 (13:35 +0000)]
feature table updated

17 years agocleaning a bit. more cleaning needed
Marc Cromme [Fri, 2 Feb 2007 14:42:44 +0000 (14:42 +0000)]
cleaning a bit. more cleaning needed

17 years agomore feature info. tables still look like a grande disaster, but the content is there...
Marc Cromme [Fri, 2 Feb 2007 14:34:20 +0000 (14:34 +0000)]
more feature info. tables still look like a grande disaster, but the content is there - more or less. needs pretty formating and tweaking

17 years agoFixed bug in zebrasrv: the default module path and default module path
Adam Dickmeiss [Fri, 2 Feb 2007 13:48:13 +0000 (13:48 +0000)]
Fixed bug in zebrasrv: the default module path and default module path
was not set recognized.

17 years agoUse YAZ_BIB1_SYSTEM_ERROR_IN_PRESENTING_RECORDS everywhere where
Adam Dickmeiss [Fri, 2 Feb 2007 12:16:38 +0000 (12:16 +0000)]
Use YAZ_BIB1_SYSTEM_ERROR_IN_PRESENTING_RECORDS everywhere where
this diagnostic is returned. Put more appropriate addinfo in case
of filter load failure during retrieval.

17 years agoFix DEFAULT_PROFILE_PATH
Adam Dickmeiss [Fri, 2 Feb 2007 12:07:33 +0000 (12:07 +0000)]
Fix DEFAULT_PROFILE_PATH

17 years agoreplaces acronymes in XML text with new defined acronyme entities
Marc Cromme [Fri, 2 Feb 2007 11:10:08 +0000 (11:10 +0000)]
replaces acronymes in XML text with new defined acronyme entities

17 years agoadded acronyme entities
Marc Cromme [Fri, 2 Feb 2007 09:58:39 +0000 (09:58 +0000)]
added acronyme entities

17 years agosome more typos corrected
Marc Cromme [Thu, 1 Feb 2007 21:26:30 +0000 (21:26 +0000)]
some more typos corrected

17 years agocorrected typos
Marc Cromme [Thu, 1 Feb 2007 21:18:53 +0000 (21:18 +0000)]
corrected typos

17 years agoadded Alvis 'XML'
Marc Cromme [Thu, 1 Feb 2007 21:08:52 +0000 (21:08 +0000)]
added Alvis 'XML'

17 years agoplaced Alvis filter module before GRS-1 in arch chapter
Marc Cromme [Thu, 1 Feb 2007 21:08:12 +0000 (21:08 +0000)]
placed Alvis filter module before GRS-1 in arch chapter

17 years agoplacing Alvis filter chapter before GRS-1 filter chapter
Marc Cromme [Thu, 1 Feb 2007 21:04:15 +0000 (21:04 +0000)]
placing Alvis filter chapter before GRS-1 filter chapter

17 years agofirst shot on tabulated feature overview - much needs to be done yet
Marc Cromme [Thu, 1 Feb 2007 20:49:05 +0000 (20:49 +0000)]
first shot on tabulated feature overview - much needs to be done yet

17 years agoNew
Mike Taylor [Wed, 31 Jan 2007 12:26:50 +0000 (12:26 +0000)]
New

17 years agoBump version to 2.0.11
Adam Dickmeiss [Wed, 24 Jan 2007 18:00:39 +0000 (18:00 +0000)]
Bump version to 2.0.11

17 years agoDepend on YAZ 2.1.48 or later
Adam Dickmeiss [Wed, 24 Jan 2007 16:05:25 +0000 (16:05 +0000)]
Depend on YAZ 2.1.48 or later

17 years agoTowards 2.0.10. ZEBRA.2.0.10
Adam Dickmeiss [Wed, 24 Jan 2007 15:23:58 +0000 (15:23 +0000)]
Towards 2.0.10.

17 years agoStaticrank indexing is now an index register type defined in default.idx
Adam Dickmeiss [Mon, 22 Jan 2007 18:15:02 +0000 (18:15 +0000)]
Staticrank indexing is now an index register type defined in default.idx
via directive 'staticrank'. The 'staticrank' directive for grs is no longer
supported (was only implemented for Zebra 2.0.8).

17 years agoNew
Mike Taylor [Mon, 22 Jan 2007 11:02:12 +0000 (11:02 +0000)]
New

17 years agoAvoid full rset count for rset_count. Proper break for result set
Adam Dickmeiss [Wed, 17 Jan 2007 15:35:47 +0000 (15:35 +0000)]
Avoid full rset count for rset_count. Proper break for result set
sort/rank.

17 years agoChange prototype of busyhandler
Adam Dickmeiss [Wed, 17 Jan 2007 13:51:36 +0000 (13:51 +0000)]
Change prototype of busyhandler

17 years agodocumentation updated according to suggestion from zebra list
Marc Cromme [Wed, 17 Jan 2007 13:31:36 +0000 (13:31 +0000)]
documentation updated according to suggestion from zebra list

17 years agoExperiment with check for dead sessions.
Adam Dickmeiss [Wed, 17 Jan 2007 13:22:53 +0000 (13:22 +0000)]
Experiment with check for dead sessions.

17 years agoFor searches, allow truncmax value to be controlled with attribute 13.
Adam Dickmeiss [Wed, 17 Jan 2007 12:59:38 +0000 (12:59 +0000)]
For searches, allow truncmax value to be controlled with attribute 13.
If given, that overrides the value of 'truncmax'.

17 years agoFor truncations being limited (abort at truncmax terms), zebrasrv returns
Adam Dickmeiss [Tue, 16 Jan 2007 15:31:23 +0000 (15:31 +0000)]
For truncations being limited (abort at truncmax terms), zebrasrv returns
SearchResponse with resultSetStatus=subset.

17 years agoFor estimated hit counts, Zebra returns resultSetStatus=estimtate as
Adam Dickmeiss [Tue, 16 Jan 2007 15:01:15 +0000 (15:01 +0000)]
For estimated hit counts, Zebra returns resultSetStatus=estimtate as
part of SearchResponse. Requires YAZ 2.1.47 or later. Bump Zebra
to version 2.0.9.

17 years agoTowards 2.0.8. ZEBRA.2.0.8
Adam Dickmeiss [Mon, 15 Jan 2007 20:14:37 +0000 (20:14 +0000)]
Towards 2.0.8.

17 years agoNoted some changes to NEWS
Adam Dickmeiss [Mon, 15 Jan 2007 20:11:44 +0000 (20:11 +0000)]
Noted some changes to NEWS

17 years ago2007.
Adam Dickmeiss [Mon, 15 Jan 2007 20:08:23 +0000 (20:08 +0000)]
2007.

17 years agoUse dblatex instead pdfjadetex to generate PDF
Adam Dickmeiss [Mon, 15 Jan 2007 20:04:34 +0000 (20:04 +0000)]
Use dblatex instead pdfjadetex to generate PDF

17 years ago2007.
Adam Dickmeiss [Mon, 15 Jan 2007 15:10:13 +0000 (15:10 +0000)]
2007.

17 years agoAdded releaseinfo
Adam Dickmeiss [Mon, 15 Jan 2007 15:01:35 +0000 (15:01 +0000)]
Added releaseinfo

17 years agoProductname from ZEBRA to zebra in refentries
Adam Dickmeiss [Mon, 15 Jan 2007 14:55:50 +0000 (14:55 +0000)]
Productname from ZEBRA to zebra in refentries

17 years agoRename to table ID to unique and proper ID.
Adam Dickmeiss [Mon, 15 Jan 2007 14:51:04 +0000 (14:51 +0000)]
Rename to table ID to unique and proper ID.

17 years agoPut author elements in authorgroup.
Adam Dickmeiss [Mon, 15 Jan 2007 14:48:51 +0000 (14:48 +0000)]
Put author elements in authorgroup.

17 years agoFixed bug #815: Extended Services "delete" requires a dummy "record" to
Adam Dickmeiss [Mon, 15 Jan 2007 13:01:26 +0000 (13:01 +0000)]
Fixed bug #815: Extended Services "delete" requires a dummy "record" to
be set. The fix, surprisingly, is to allow SUTRS for "update". The problem
is that Z39.50 mandates that *some* record to be sent (including the
case where a record is to be deleted by ID). However, ZOOM-C sends a "void"
SUTRS record in this case.. This record is not parsed by the server.

17 years agoSuggest fakeroot
Adam Dickmeiss [Tue, 9 Jan 2007 09:11:22 +0000 (09:11 +0000)]
Suggest fakeroot

17 years agoNull value for res_set makes it do nothing
Adam Dickmeiss [Fri, 5 Jan 2007 16:11:25 +0000 (16:11 +0000)]
Null value for res_set makes it do nothing

17 years agoRemoved zebra_timing_t. Using yaz_timing_t instead. Requires YAZ 2.1.45.
Adam Dickmeiss [Fri, 5 Jan 2007 10:45:11 +0000 (10:45 +0000)]
Removed zebra_timing_t. Using yaz_timing_t instead. Requires YAZ 2.1.45.

17 years agoImplemented bug #781: Easier tracking of result sets. We only do this
Adam Dickmeiss [Fri, 22 Dec 2006 19:47:36 +0000 (19:47 +0000)]
Implemented bug #781: Easier tracking of result sets. We only do this
when mkstemp is available (most moderun Unixes supports this). The PID
is included in filename : tempdir/zrs_PID_XXXXXX.

17 years agoAdded support for specification of staticrank for grs-class of filters.
Adam Dickmeiss [Fri, 22 Dec 2006 13:57:25 +0000 (13:57 +0000)]
Added support for specification of staticrank for grs-class of filters.
This is enabled by using 'staticrank indexname' in .abs. Contents of
indexname (elm ... indexname:w) is then used as value for staticrank.

17 years agoUsing atoizint for staticrank
Adam Dickmeiss [Fri, 22 Dec 2006 13:57:03 +0000 (13:57 +0000)]
Using atoizint for staticrank

17 years agoAdded function atoizint with a atoi/atoll wrapper.
Adam Dickmeiss [Fri, 22 Dec 2006 13:48:20 +0000 (13:48 +0000)]
Added function atoizint with a atoi/atoll wrapper.

17 years agoAdd timing for zebra_start/zebra_stop.
Adam Dickmeiss [Fri, 22 Dec 2006 12:14:25 +0000 (12:14 +0000)]
Add timing for zebra_start/zebra_stop.

17 years agoUse atoll/atoi for ascii to zint conversion in alvis filter.
Adam Dickmeiss [Fri, 22 Dec 2006 12:13:39 +0000 (12:13 +0000)]
Use atoll/atoi for ascii to zint conversion in alvis filter.

17 years agoDo not warn for truncmax limit
Adam Dickmeiss [Wed, 20 Dec 2006 14:29:04 +0000 (14:29 +0000)]
Do not warn for truncmax limit

17 years agoDo not truncate more than at most 'truncmax' terms in one single
Adam Dickmeiss [Wed, 20 Dec 2006 14:19:21 +0000 (14:19 +0000)]
Do not truncate more than at most 'truncmax' terms in one single
term with right-truncation/regular attribute. By default, 'truncmax',
is 10000.

17 years agoFix reference to uninitialized var (bug introduced in previous commit).
Adam Dickmeiss [Wed, 20 Dec 2006 08:37:49 +0000 (08:37 +0000)]
Fix reference to uninitialized var (bug introduced in previous commit).

17 years agoUsing zebra::index::field:s rather than zebra::sort::field for retrieval
Adam Dickmeiss [Tue, 19 Dec 2006 16:57:38 +0000 (16:57 +0000)]
Using zebra::index::field:s rather than zebra::sort::field for retrieval
of sort keys.

17 years agoFix bad ptr in yaz_log debug msg
Adam Dickmeiss [Tue, 19 Dec 2006 13:55:05 +0000 (13:55 +0000)]
Fix bad ptr in yaz_log debug msg

17 years agoAvoid file sync for read-only files.
Adam Dickmeiss [Tue, 19 Dec 2006 13:17:34 +0000 (13:17 +0000)]
Avoid file sync for read-only files.

17 years agoChanged assert in isamb, since we have larger keys
Adam Dickmeiss [Tue, 19 Dec 2006 00:25:41 +0000 (00:25 +0000)]
Changed assert in isamb, since we have larger keys

17 years agoImplemented sorting via the ISAMB system. To enable, use sortindex:b
Adam Dickmeiss [Mon, 18 Dec 2006 23:40:06 +0000 (23:40 +0000)]
Implemented sorting via the ISAMB system. To enable, use sortindex:b
in zebra.cfg. The sort keys can also be fetched using zebra::sort:field .

17 years agoDisable check: bug #730
Adam Dickmeiss [Sat, 16 Dec 2006 09:04:28 +0000 (09:04 +0000)]
Disable check: bug #730

17 years agoDefine YAZ_HAVE_XML2
Adam Dickmeiss [Fri, 15 Dec 2006 21:11:37 +0000 (21:11 +0000)]
Define YAZ_HAVE_XML2

17 years agoFixed bug #767: Zebra indexing stylesheet fails to work. The problem
Adam Dickmeiss [Wed, 13 Dec 2006 13:05:45 +0000 (13:05 +0000)]
Fixed bug #767: Zebra indexing stylesheet fails to work. The problem
was that the root node was not of type element and split_level = 0 (deal with
whole record / no split). We make split level 0 a special case (again) and
use extract_full for this.

17 years agoMore digits in output for total docs
Adam Dickmeiss [Tue, 12 Dec 2006 17:33:35 +0000 (17:33 +0000)]
More digits in output for total docs

17 years agoRename isb to isb_postings.
Adam Dickmeiss [Tue, 12 Dec 2006 15:23:45 +0000 (15:23 +0000)]
Rename isb to isb_postings.

17 years agoPossible fix of two bugs (1) bug #759: Crash during retrieval of some
Adam Dickmeiss [Tue, 12 Dec 2006 15:22:26 +0000 (15:22 +0000)]
Possible fix of two bugs (1) bug #759: Crash during retrieval of some
records (2) bug #765.

17 years agoAdded gnuplot scripts for dictisam timings, filesystems comparisons.
Adam Dickmeiss [Tue, 12 Dec 2006 13:54:33 +0000 (13:54 +0000)]
Added gnuplot scripts for dictisam timings, filesystems comparisons.

17 years agoAll output to stdout for easy feet into gnuplot. The following is now
Adam Dickmeiss [Tue, 12 Dec 2006 13:51:23 +0000 (13:51 +0000)]
All output to stdout for easy feet into gnuplot. The following is now
logged: user,real,sys for dict+isam, postings, documents, distinct
words, new words, dict splits, isamb leaf splits, isamb interior splits.

17 years agoAdded functions isamb_set_int_count, isamb-set_cache_size,
Adam Dickmeiss [Tue, 12 Dec 2006 13:46:41 +0000 (13:46 +0000)]
Added functions isamb_set_int_count, isamb-set_cache_size,
isamb_get_int_splits, isamb_get_leaf_splits, iasmb_set_int_count.

17 years agoAdded statistics functions dict_get_no_{lookup,insert,split}.
Adam Dickmeiss [Tue, 12 Dec 2006 13:42:23 +0000 (13:42 +0000)]
Added statistics functions dict_get_no_{lookup,insert,split}.

17 years agoRemove unused variable (dst_buf)
Adam Dickmeiss [Mon, 11 Dec 2006 17:08:03 +0000 (17:08 +0000)]
Remove unused variable (dst_buf)

17 years agoAllow MARC line format to be read.
Adam Dickmeiss [Mon, 11 Dec 2006 15:08:55 +0000 (15:08 +0000)]
Allow MARC line format to be read.

17 years agoUsing timing utility
Adam Dickmeiss [Mon, 11 Dec 2006 10:02:14 +0000 (10:02 +0000)]
Using timing utility

17 years agoAdded a utility for doing timings.
Adam Dickmeiss [Mon, 11 Dec 2006 09:50:36 +0000 (09:50 +0000)]
Added a utility for doing timings.

17 years agoMore appropriate usage
Adam Dickmeiss [Sun, 10 Dec 2006 21:02:28 +0000 (21:02 +0000)]
More appropriate usage

17 years agoMake MARC indexer with different ISAM strategy.
Adam Dickmeiss [Sun, 10 Dec 2006 21:00:56 +0000 (21:00 +0000)]
Make MARC indexer with different ISAM strategy.

17 years agoUse rand() twice on platforms where RAND_MAX < 2^16.
Adam Dickmeiss [Sun, 10 Dec 2006 20:59:52 +0000 (20:59 +0000)]
Use rand() twice on platforms where RAND_MAX < 2^16.

17 years agoAdded new test which makes variations to ISAM entry size.
Adam Dickmeiss [Sun, 10 Dec 2006 11:54:29 +0000 (11:54 +0000)]
Added new test which makes variations to ISAM entry size.