Marc Cromme [Wed, 21 Feb 2007 13:38:22 +0000 (13:38 +0000)]
started explaining each dom filter pipeline
Marc Cromme [Wed, 21 Feb 2007 12:29:52 +0000 (12:29 +0000)]
added figure of workflow on DOM XML filter
Marc Cromme [Tue, 20 Feb 2007 15:02:18 +0000 (15:02 +0000)]
small changes to format
Marc Cromme [Tue, 20 Feb 2007 14:57:00 +0000 (14:57 +0000)]
added proper namespace in example config
Marc Cromme [Tue, 20 Feb 2007 14:53:25 +0000 (14:53 +0000)]
some more changes, more to come
Marc Cromme [Tue, 20 Feb 2007 14:28:31 +0000 (14:28 +0000)]
added initial DOM XML filter documentation. Much is missing yet ...
Adam Dickmeiss [Sun, 18 Feb 2007 21:53:22 +0000 (21:53 +0000)]
Fixed bug #898: xslt tests fails on several platforms. Problem was
that test for zs:index node crashed for absent namespace (href==NULL).
Added all .xslt-files in use in est/xslt tests.
Also fixed memory leak in use of xmlGetNodePath.
Adam Dickmeiss [Sun, 18 Feb 2007 21:50:52 +0000 (21:50 +0000)]
Fixed minor memory leak
Marc Cromme [Thu, 15 Feb 2007 15:41:16 +0000 (15:41 +0000)]
changed to respect correct index instructions in new DOM filter
Marc Cromme [Thu, 15 Feb 2007 15:08:41 +0000 (15:08 +0000)]
optimized code such that the RecWord structure recword is only
initialized once for each to-be-indexed record, and not once for each
to-be-indexed term - at the expense of a bit of pointer passing when
recursively transversing the XML DOM tree
Marc Cromme [Thu, 15 Feb 2007 14:44:48 +0000 (14:44 +0000)]
removed dead code pieces which are reminisences from the original
alvis-style parsin and indexing stuff. Now only new dom indexing code
is present.
Marc Cromme [Thu, 15 Feb 2007 14:33:41 +0000 (14:33 +0000)]
pretty formatting warning messages, always giving the file name and
the XML node path as informative parameters along
Marc Cromme [Thu, 15 Feb 2007 13:01:00 +0000 (13:01 +0000)]
rewritten mod_dom instruction parsing code hooked into mod_dom indexing
new stylesheets added, one for PI based indexing, and one for <z:index> based indexing
segmentation fault traced and fixed
test framework updated to use new mod_dom parsing
Marc Cromme [Wed, 14 Feb 2007 16:43:37 +0000 (16:43 +0000)]
added 'static' declaration to functiondefinitions
Marc Cromme [Wed, 14 Feb 2007 16:38:41 +0000 (16:38 +0000)]
changing attribute 'action' to 'type' for better confrmance with Alvis
filter syntax
Marc Cromme [Wed, 14 Feb 2007 16:31:37 +0000 (16:31 +0000)]
indenting entire file according to the rules stated in the very end of
the file, using emacs M-x indent-region, and manual line breaking afterwards
Marc Cromme [Wed, 14 Feb 2007 16:16:15 +0000 (16:16 +0000)]
continued hooking in tinfo and recctr, still need to do real indexing
Marc Cromme [Wed, 14 Feb 2007 15:42:24 +0000 (15:42 +0000)]
removed wanings by zillions of (const char *) casts and the like
Marc Cromme [Wed, 14 Feb 2007 15:23:33 +0000 (15:23 +0000)]
removed the crappy PI and <z:index> parsing code comitted yesterday
replaced with clean parsing logic developped outside mod_dom.c
needs to take care of all new warnings due to stricter compile flags
finally, needs to be hooked into actual indexing of records
Marc Cromme [Tue, 13 Feb 2007 12:19:37 +0000 (12:19 +0000)]
removed unnecessary out-commented code lines
Marc Cromme [Tue, 13 Feb 2007 11:37:02 +0000 (11:37 +0000)]
facturized DOM XML indexing code out into function
static void extract_doc_alvis(struct filter_info *tinfo,
struct recExtractCtrl *recctr,
xmlDocPtr doc)
This is the function to be re-written using both PI and <z:index> instructions,
and also fixing the bug of index type 'p' and '0' chop-over of merged content.
Marc Cromme [Mon, 12 Feb 2007 14:00:20 +0000 (14:00 +0000)]
experimental processing-instruction based indexing XSLT added
Marc Cromme [Mon, 12 Feb 2007 13:58:12 +0000 (13:58 +0000)]
avoiding unnecesasary unused namespace declarations in output documents
Marc Cromme [Mon, 12 Feb 2007 13:24:31 +0000 (13:24 +0000)]
added parsing function 'parse_pi_zebra_20' for processing-instruction parsing and 'format_pi_zebra_err' for error or wanrning formatting. Those are yet not called, and need to be build into the XML parsing in the DOM module.
Adam Dickmeiss [Mon, 12 Feb 2007 10:33:50 +0000 (10:33 +0000)]
Fixed bug #884: Entity declarations in input are lost at retrieval time.
Adam Dickmeiss [Sat, 10 Feb 2007 18:37:42 +0000 (18:37 +0000)]
Fixed serious bug in mf_open which made it fail to see an already existing
metafile. The bug was introduced in mfile 1.70.
Adam Dickmeiss [Sat, 10 Feb 2007 12:46:54 +0000 (12:46 +0000)]
buildconf.sh part of dist.
Marc Cromme [Wed, 7 Feb 2007 13:33:17 +0000 (13:33 +0000)]
corrected DEPRECIATED to DEPRECATED
Marc Cromme [Wed, 7 Feb 2007 13:19:35 +0000 (13:19 +0000)]
added debian libidzebra-2.0-mod-dom package
Marc Cromme [Wed, 7 Feb 2007 12:50:13 +0000 (12:50 +0000)]
making 'dox' target phony
Adam Dickmeiss [Wed, 7 Feb 2007 12:08:54 +0000 (12:08 +0000)]
Implemented new filter 'dom'. See test/xslt/dom-config*xml for examples.
This, like alvis, performs indexing and retrieval using XSLT. But Unlike
alvis, it allows multiple XSLT steps to be performed and does ISO2709
Adam Dickmeiss [Tue, 6 Feb 2007 09:34:56 +0000 (09:34 +0000)]
The configuration, fileverboselimit, has a value of 1000. When
reached a message is logged. Bug #845.
Adam Dickmeiss [Tue, 6 Feb 2007 09:33:31 +0000 (09:33 +0000)]
Omit sort info: bug #844.
Adam Dickmeiss [Tue, 6 Feb 2007 09:32:50 +0000 (09:32 +0000)]
More compact statistics
Marc Cromme [Mon, 5 Feb 2007 14:32:31 +0000 (14:32 +0000)]
dropped section on future directions
Marc Cromme [Mon, 5 Feb 2007 14:05:26 +0000 (14:05 +0000)]
spll checked
Marc Cromme [Mon, 5 Feb 2007 14:02:27 +0000 (14:02 +0000)]
fromatting of feature tables updated
Marc Cromme [Mon, 5 Feb 2007 13:35:12 +0000 (13:35 +0000)]
feature table updated
Marc Cromme [Fri, 2 Feb 2007 14:42:44 +0000 (14:42 +0000)]
cleaning a bit. more cleaning needed
Marc Cromme [Fri, 2 Feb 2007 14:34:20 +0000 (14:34 +0000)]
more feature info. tables still look like a grande disaster, but the content is there - more or less. needs pretty formating and tweaking
Adam Dickmeiss [Fri, 2 Feb 2007 13:48:13 +0000 (13:48 +0000)]
Fixed bug in zebrasrv: the default module path and default module path
was not set recognized.
Adam Dickmeiss [Fri, 2 Feb 2007 12:16:38 +0000 (12:16 +0000)]
Use YAZ_BIB1_SYSTEM_ERROR_IN_PRESENTING_RECORDS everywhere where
this diagnostic is returned. Put more appropriate addinfo in case
of filter load failure during retrieval.
Adam Dickmeiss [Fri, 2 Feb 2007 12:07:33 +0000 (12:07 +0000)]
Fix DEFAULT_PROFILE_PATH
Marc Cromme [Fri, 2 Feb 2007 11:10:08 +0000 (11:10 +0000)]
replaces acronymes in XML text with new defined acronyme entities
Marc Cromme [Fri, 2 Feb 2007 09:58:39 +0000 (09:58 +0000)]
added acronyme entities
Marc Cromme [Thu, 1 Feb 2007 21:26:30 +0000 (21:26 +0000)]
some more typos corrected
Marc Cromme [Thu, 1 Feb 2007 21:18:53 +0000 (21:18 +0000)]
corrected typos
Marc Cromme [Thu, 1 Feb 2007 21:08:52 +0000 (21:08 +0000)]
added Alvis 'XML'
Marc Cromme [Thu, 1 Feb 2007 21:08:12 +0000 (21:08 +0000)]
placed Alvis filter module before GRS-1 in arch chapter
Marc Cromme [Thu, 1 Feb 2007 21:04:15 +0000 (21:04 +0000)]
placing Alvis filter chapter before GRS-1 filter chapter
Marc Cromme [Thu, 1 Feb 2007 20:49:05 +0000 (20:49 +0000)]
first shot on tabulated feature overview - much needs to be done yet
Mike Taylor [Wed, 31 Jan 2007 12:26:50 +0000 (12:26 +0000)]
New
Adam Dickmeiss [Wed, 24 Jan 2007 18:00:39 +0000 (18:00 +0000)]
Bump version to 2.0.11
Adam Dickmeiss [Wed, 24 Jan 2007 16:05:25 +0000 (16:05 +0000)]
Depend on YAZ 2.1.48 or later
Adam Dickmeiss [Wed, 24 Jan 2007 15:23:58 +0000 (15:23 +0000)]
Towards 2.0.10.
Adam Dickmeiss [Mon, 22 Jan 2007 18:15:02 +0000 (18:15 +0000)]
Staticrank indexing is now an index register type defined in default.idx
via directive 'staticrank'. The 'staticrank' directive for grs is no longer
supported (was only implemented for Zebra 2.0.8).
Mike Taylor [Mon, 22 Jan 2007 11:02:12 +0000 (11:02 +0000)]
New
Adam Dickmeiss [Wed, 17 Jan 2007 15:35:47 +0000 (15:35 +0000)]
Avoid full rset count for rset_count. Proper break for result set
sort/rank.
Adam Dickmeiss [Wed, 17 Jan 2007 13:51:36 +0000 (13:51 +0000)]
Change prototype of busyhandler
Marc Cromme [Wed, 17 Jan 2007 13:31:36 +0000 (13:31 +0000)]
documentation updated according to suggestion from zebra list
Adam Dickmeiss [Wed, 17 Jan 2007 13:22:53 +0000 (13:22 +0000)]
Experiment with check for dead sessions.
Adam Dickmeiss [Wed, 17 Jan 2007 12:59:38 +0000 (12:59 +0000)]
For searches, allow truncmax value to be controlled with attribute 13.
If given, that overrides the value of 'truncmax'.
Adam Dickmeiss [Tue, 16 Jan 2007 15:31:23 +0000 (15:31 +0000)]
For truncations being limited (abort at truncmax terms), zebrasrv returns
SearchResponse with resultSetStatus=subset.
Adam Dickmeiss [Tue, 16 Jan 2007 15:01:15 +0000 (15:01 +0000)]
For estimated hit counts, Zebra returns resultSetStatus=estimtate as
part of SearchResponse. Requires YAZ 2.1.47 or later. Bump Zebra
to version 2.0.9.
Adam Dickmeiss [Mon, 15 Jan 2007 20:14:37 +0000 (20:14 +0000)]
Towards 2.0.8.
Adam Dickmeiss [Mon, 15 Jan 2007 20:11:44 +0000 (20:11 +0000)]
Noted some changes to NEWS
Adam Dickmeiss [Mon, 15 Jan 2007 20:08:23 +0000 (20:08 +0000)]
2007.
Adam Dickmeiss [Mon, 15 Jan 2007 20:04:34 +0000 (20:04 +0000)]
Use dblatex instead pdfjadetex to generate PDF
Adam Dickmeiss [Mon, 15 Jan 2007 15:10:13 +0000 (15:10 +0000)]
2007.
Adam Dickmeiss [Mon, 15 Jan 2007 15:01:35 +0000 (15:01 +0000)]
Added releaseinfo
Adam Dickmeiss [Mon, 15 Jan 2007 14:55:50 +0000 (14:55 +0000)]
Productname from ZEBRA to zebra in refentries
Adam Dickmeiss [Mon, 15 Jan 2007 14:51:04 +0000 (14:51 +0000)]
Rename to table ID to unique and proper ID.
Adam Dickmeiss [Mon, 15 Jan 2007 14:48:51 +0000 (14:48 +0000)]
Put author elements in authorgroup.
Adam Dickmeiss [Mon, 15 Jan 2007 13:01:26 +0000 (13:01 +0000)]
Fixed bug #815: Extended Services "delete" requires a dummy "record" to
be set. The fix, surprisingly, is to allow SUTRS for "update". The problem
is that Z39.50 mandates that *some* record to be sent (including the
case where a record is to be deleted by ID). However, ZOOM-C sends a "void"
SUTRS record in this case.. This record is not parsed by the server.
Adam Dickmeiss [Tue, 9 Jan 2007 09:11:22 +0000 (09:11 +0000)]
Suggest fakeroot
Adam Dickmeiss [Fri, 5 Jan 2007 16:11:25 +0000 (16:11 +0000)]
Null value for res_set makes it do nothing
Adam Dickmeiss [Fri, 5 Jan 2007 10:45:11 +0000 (10:45 +0000)]
Removed zebra_timing_t. Using yaz_timing_t instead. Requires YAZ 2.1.45.
Adam Dickmeiss [Fri, 22 Dec 2006 19:47:36 +0000 (19:47 +0000)]
Implemented bug #781: Easier tracking of result sets. We only do this
when mkstemp is available (most moderun Unixes supports this). The PID
is included in filename : tempdir/zrs_PID_XXXXXX.
Adam Dickmeiss [Fri, 22 Dec 2006 13:57:25 +0000 (13:57 +0000)]
Added support for specification of staticrank for grs-class of filters.
This is enabled by using 'staticrank indexname' in .abs. Contents of
indexname (elm ... indexname:w) is then used as value for staticrank.
Adam Dickmeiss [Fri, 22 Dec 2006 13:57:03 +0000 (13:57 +0000)]
Using atoizint for staticrank
Adam Dickmeiss [Fri, 22 Dec 2006 13:48:20 +0000 (13:48 +0000)]
Added function atoizint with a atoi/atoll wrapper.
Adam Dickmeiss [Fri, 22 Dec 2006 12:14:25 +0000 (12:14 +0000)]
Add timing for zebra_start/zebra_stop.
Adam Dickmeiss [Fri, 22 Dec 2006 12:13:39 +0000 (12:13 +0000)]
Use atoll/atoi for ascii to zint conversion in alvis filter.
Adam Dickmeiss [Wed, 20 Dec 2006 14:29:04 +0000 (14:29 +0000)]
Do not warn for truncmax limit
Adam Dickmeiss [Wed, 20 Dec 2006 14:19:21 +0000 (14:19 +0000)]
Do not truncate more than at most 'truncmax' terms in one single
term with right-truncation/regular attribute. By default, 'truncmax',
is 10000.
Adam Dickmeiss [Wed, 20 Dec 2006 08:37:49 +0000 (08:37 +0000)]
Fix reference to uninitialized var (bug introduced in previous commit).
Adam Dickmeiss [Tue, 19 Dec 2006 16:57:38 +0000 (16:57 +0000)]
Using zebra::index::field:s rather than zebra::sort::field for retrieval
of sort keys.
Adam Dickmeiss [Tue, 19 Dec 2006 13:55:05 +0000 (13:55 +0000)]
Fix bad ptr in yaz_log debug msg
Adam Dickmeiss [Tue, 19 Dec 2006 13:17:34 +0000 (13:17 +0000)]
Avoid file sync for read-only files.
Adam Dickmeiss [Tue, 19 Dec 2006 00:25:41 +0000 (00:25 +0000)]
Changed assert in isamb, since we have larger keys
Adam Dickmeiss [Mon, 18 Dec 2006 23:40:06 +0000 (23:40 +0000)]
Implemented sorting via the ISAMB system. To enable, use sortindex:b
in zebra.cfg. The sort keys can also be fetched using zebra::sort:field .
Adam Dickmeiss [Sat, 16 Dec 2006 09:04:28 +0000 (09:04 +0000)]
Disable check: bug #730
Adam Dickmeiss [Fri, 15 Dec 2006 21:11:37 +0000 (21:11 +0000)]
Define YAZ_HAVE_XML2
Adam Dickmeiss [Wed, 13 Dec 2006 13:05:45 +0000 (13:05 +0000)]
Fixed bug #767: Zebra indexing stylesheet fails to work. The problem
was that the root node was not of type element and split_level = 0 (deal with
whole record / no split). We make split level 0 a special case (again) and
use extract_full for this.
Adam Dickmeiss [Tue, 12 Dec 2006 17:33:35 +0000 (17:33 +0000)]
More digits in output for total docs
Adam Dickmeiss [Tue, 12 Dec 2006 15:23:45 +0000 (15:23 +0000)]
Rename isb to isb_postings.
Adam Dickmeiss [Tue, 12 Dec 2006 15:22:26 +0000 (15:22 +0000)]
Possible fix of two bugs (1) bug #759: Crash during retrieval of some
records (2) bug #765.
Adam Dickmeiss [Tue, 12 Dec 2006 13:54:33 +0000 (13:54 +0000)]
Added gnuplot scripts for dictisam timings, filesystems comparisons.
Adam Dickmeiss [Tue, 12 Dec 2006 13:51:23 +0000 (13:51 +0000)]
All output to stdout for easy feet into gnuplot. The following is now
logged: user,real,sys for dict+isam, postings, documents, distinct
words, new words, dict splits, isamb leaf splits, isamb interior splits.
Adam Dickmeiss [Tue, 12 Dec 2006 13:46:41 +0000 (13:46 +0000)]
Added functions isamb_set_int_count, isamb-set_cache_size,
isamb_get_int_splits, isamb_get_leaf_splits, iasmb_set_int_count.