Marc Cromme [Wed, 29 Nov 2006 09:01:53 +0000 (09:01 +0000)]
shortened zebra::meta:.sysno lement XML format
Marc Cromme [Tue, 28 Nov 2006 14:37:45 +0000 (14:37 +0000)]
added example of config file
Marc Cromme [Tue, 28 Nov 2006 14:18:26 +0000 (14:18 +0000)]
added entry on encoding directive
Marc Cromme [Tue, 28 Nov 2006 13:05:57 +0000 (13:05 +0000)]
added more information on character mapping
added entry on equivalent directive
added short examples
Adam Dickmeiss [Tue, 28 Nov 2006 11:42:13 +0000 (11:42 +0000)]
Towards 2.0.6.
Adam Dickmeiss [Tue, 28 Nov 2006 08:44:10 +0000 (08:44 +0000)]
Issue warning if staticrank is < 0. And set to 0
Adam Dickmeiss [Tue, 28 Nov 2006 08:43:53 +0000 (08:43 +0000)]
Added assert
Adam Dickmeiss [Mon, 27 Nov 2006 21:50:15 +0000 (21:50 +0000)]
Zebra 2 RPM with the following packages: idzebra-2.0, libidzebra-2.0,
libidzebra-2.0-modules, libidzebra-2.0-devel.
Adam Dickmeiss [Mon, 27 Nov 2006 14:27:12 +0000 (14:27 +0000)]
Require YAZ 2.1.40 or later
Adam Dickmeiss [Mon, 27 Nov 2006 10:40:33 +0000 (10:40 +0000)]
Avoid indexing of zebraInfo stuff. Problem is that explain.abs has lines
elm (4,2) targetInfo ExplainCategory
and it will zebraInfo child as ExplainCategory. This is now prevented
by making a zebraInfo node with no children.
Adam Dickmeiss [Mon, 27 Nov 2006 10:10:14 +0000 (10:10 +0000)]
Better debugging for indexing.. Introduction of zebra_log_dict_entry.
Adam Dickmeiss [Mon, 27 Nov 2006 10:09:48 +0000 (10:09 +0000)]
Fixed bug #736: Updates gets slower. The problem was that duplicate
name tags (with database name in it) was generated for attributeDetails.
This caused indexes to grow and grow for each update iteration.
Adam Dickmeiss [Mon, 27 Nov 2006 09:17:01 +0000 (09:17 +0000)]
Disable hit snippets for now. It does not work well and it slows retrieval
down a lot.
Adam Dickmeiss [Sat, 25 Nov 2006 09:15:19 +0000 (09:15 +0000)]
Fixed bug #735: Delete fails to process rest of file after a record that
does not exist,
Adam Dickmeiss [Fri, 24 Nov 2006 19:48:23 +0000 (19:48 +0000)]
Added new object files to list
Adam Dickmeiss [Fri, 24 Nov 2006 13:25:42 +0000 (13:25 +0000)]
Rephase a bit.
Adam Dickmeiss [Fri, 24 Nov 2006 13:25:02 +0000 (13:25 +0000)]
Mention --disable-shared option as a way to prevent modules to be
built.
Adam Dickmeiss [Fri, 24 Nov 2006 13:05:11 +0000 (13:05 +0000)]
Moved section about special retrieval zebra:: to Architecture chapter.
Added table with supported zebra:: elements.
Marc Cromme [Fri, 24 Nov 2006 12:21:31 +0000 (12:21 +0000)]
updated test since non-used attributes are not longer printed out
Adam Dickmeiss [Fri, 24 Nov 2006 11:35:23 +0000 (11:35 +0000)]
Using wrbuf rather than sprintf. Allow all syntaxes for ::data fetch.
Adam Dickmeiss [Thu, 23 Nov 2006 21:38:26 +0000 (21:38 +0000)]
Added check for sys/utsname.h.
Adam Dickmeiss [Thu, 23 Nov 2006 21:35:02 +0000 (21:35 +0000)]
Added Include of string.h
Adam Dickmeiss [Thu, 23 Nov 2006 21:34:22 +0000 (21:34 +0000)]
Omit test with multiple processes and update on FreeBSD. Bug #733.
Adam Dickmeiss [Thu, 23 Nov 2006 18:42:48 +0000 (18:42 +0000)]
Make buildconf.sh work for FreeBSD
Adam Dickmeiss [Thu, 23 Nov 2006 14:01:21 +0000 (14:01 +0000)]
Use size of rec->info stuff in meta record fetch. Failing that will
result in a SEGV or similar.
Marc Cromme [Thu, 23 Nov 2006 09:03:50 +0000 (09:03 +0000)]
implemented full metadata display of fast element set names
implemented more test of fast element set names, including zebra::meta and zebra::data element set names
updated documentation on fast element set names
Adam Dickmeiss [Wed, 22 Nov 2006 14:12:49 +0000 (14:12 +0000)]
Added include of yaz/xmalloc.h.
Adam Dickmeiss [Wed, 22 Nov 2006 14:06:53 +0000 (14:06 +0000)]
Remove function tl_start_log.
Adam Dickmeiss [Wed, 22 Nov 2006 11:13:17 +0000 (11:13 +0000)]
Added a few memset for structures that are later written to disk.
This makes valgind happy on platforms which aligns members in structs.
Adam Dickmeiss [Wed, 22 Nov 2006 10:26:12 +0000 (10:26 +0000)]
Added initializer value for two locals
Adam Dickmeiss [Wed, 22 Nov 2006 09:42:27 +0000 (09:42 +0000)]
For build script: -p enables profiling.
Adam Dickmeiss [Tue, 21 Nov 2006 22:17:49 +0000 (22:17 +0000)]
Get rid of SYSNO which is zint anyway. Removed various prototypes
of unused/undefined functions.
Adam Dickmeiss [Tue, 21 Nov 2006 17:55:47 +0000 (17:55 +0000)]
Added
Adam Dickmeiss [Tue, 21 Nov 2006 17:48:08 +0000 (17:48 +0000)]
Make threading for index sort configurable.
Adam Dickmeiss [Tue, 21 Nov 2006 14:54:12 +0000 (14:54 +0000)]
Remove log msg
Adam Dickmeiss [Tue, 21 Nov 2006 14:32:38 +0000 (14:32 +0000)]
Split it_key stuff into it_key.c. For indexing, sort key chunks in a separate
thread (POSIX threads only).
Adam Dickmeiss [Mon, 20 Nov 2006 13:59:35 +0000 (13:59 +0000)]
Increase reckeys hash size from 1023 to 32767.
Adam Dickmeiss [Mon, 20 Nov 2006 13:59:13 +0000 (13:59 +0000)]
Remove obsolete SORT_EXTRA code (no longer in use).
Adam Dickmeiss [Fri, 17 Nov 2006 17:32:45 +0000 (17:32 +0000)]
Contact address for Zebra is zebra-help@indexdata.dk
Marc Cromme [Fri, 17 Nov 2006 14:54:00 +0000 (14:54 +0000)]
updated docs to reflect change to 'zebra::index::' special elem set name
Marc Cromme [Fri, 17 Nov 2006 14:24:56 +0000 (14:24 +0000)]
added tests fror wrong element set names which are not recognized
Marc Cromme [Fri, 17 Nov 2006 13:47:22 +0000 (13:47 +0000)]
preventing later memory acess violation when one leaves retrieve.c:zebra_special_fetch by error condition before setting *rec_lenp by setting int len = 0; upon initialization.
Marc Cromme [Fri, 17 Nov 2006 08:20:17 +0000 (08:20 +0000)]
removing unnecessary comments
not setting the *rec_lenp = 0; at retrieve.c:127 to provoke test failure with segfault in the last unit test of test/api/t16.c , which is commented out. It seems like *rec_lenp = wrbuf_len(wrbuf); at retrieve.c:247 leaves rec_lenp in an uninitialized state, which causes a segfault later. More investigation needed.
Marc Cromme [Thu, 16 Nov 2006 21:58:54 +0000 (21:58 +0000)]
protected wrong buffer-length write by initializing to 0, but there is more debugging needed: there is still something fishy with the last out-commented test of test/api/t16.c.
Marc Cromme [Thu, 16 Nov 2006 20:44:28 +0000 (20:44 +0000)]
added tests for zebra::data, zebra::meta::sysno and zebra::index::title:p element sets. Last one segfaults and need further debugging, it is commented out.
Adam Dickmeiss [Thu, 16 Nov 2006 13:27:54 +0000 (13:27 +0000)]
For alvis filter make XML pass-through if stylesheet is attribute
is omitted (not specified).
Adam Dickmeiss [Thu, 16 Nov 2006 12:48:28 +0000 (12:48 +0000)]
Added new test for check for zebra:: element set names.
Adam Dickmeiss [Thu, 16 Nov 2006 11:11:36 +0000 (11:11 +0000)]
Fix setting of recordAttr->recordSize in case of storeData method.
Adam Dickmeiss [Thu, 16 Nov 2006 11:10:46 +0000 (11:10 +0000)]
Output format = input for zebra::data fetch (raw fetch).
Adam Dickmeiss [Thu, 16 Nov 2006 10:49:11 +0000 (10:49 +0000)]
idzebra:: stuff functional.
Adam Dickmeiss [Thu, 16 Nov 2006 10:48:35 +0000 (10:48 +0000)]
Annotate zerbaExplain_lookup_attr_str
Marc Cromme [Wed, 15 Nov 2006 14:09:43 +0000 (14:09 +0000)]
added suggestions for better workflow in comments
Marc Cromme [Tue, 14 Nov 2006 14:32:13 +0000 (14:32 +0000)]
re-organized record retrieval such that special element set names
zebra::index display all indexed terms from all indexes
zebra::index::title display all terms from only title index
zebra::index::title:p display all terms from only title index type p
zebra::meta::sysno display only zebra internal record sysno
zebra::data dislpay raw record binary data as SUTRS
all work
More testing with zebra:.data is needed
Sitll missing zebra::meta::filename, zebra::meta::count, etc ..
Better error messages would be an asset...
Adam Dickmeiss [Tue, 14 Nov 2006 12:41:19 +0000 (12:41 +0000)]
Added small test of mfile sub system
Adam Dickmeiss [Tue, 14 Nov 2006 12:11:48 +0000 (12:11 +0000)]
Avoid abort. EXTRA_CHECK defined and annotated
Adam Dickmeiss [Tue, 14 Nov 2006 12:03:48 +0000 (12:03 +0000)]
Use assert rather than abort
Adam Dickmeiss [Tue, 14 Nov 2006 10:45:34 +0000 (10:45 +0000)]
Changed test to pass for Libxml 2.6.16.
Adam Dickmeiss [Tue, 14 Nov 2006 10:03:21 +0000 (10:03 +0000)]
Fix mf_write call in cf_commit_flat.
Adam Dickmeiss [Tue, 14 Nov 2006 08:12:06 +0000 (08:12 +0000)]
Work on bug #550: Avoid exit. In particular the mfile/cfile/bfile has
been refactored quite a bit. Remaining exit uses zebra_exit.
Removed redundant apitest.c.
Marc Cromme [Mon, 13 Nov 2006 14:53:40 +0000 (14:53 +0000)]
added section on special zebra:: elemset names, and updated docs to follow suit with changes alvis oai filter example
Marc Cromme [Mon, 13 Nov 2006 13:53:49 +0000 (13:53 +0000)]
written XML formatting for new zebra-specific elem set 'zebra::' which lets you dump the exact content of indexed fields in records.
Using for example elemset 'zebra::dc_title' dumps only the physical string index 'dc_title' fields in the virtual, from indexes constructed, record. Usisn 'zebra::dc_title:w' dumps only te subset of those which are indexed as type 'w'.
Added test record for oai-alvis example with utf-8 sequences and nasty XML escapes. Testing works on GRS-1 filters, but yet not on 'Alvis' filters, the direct YAZ function zebra_storekeys_fetch in index/retrieve.c seems not be called at all .. more debugging needed.
Adam Dickmeiss [Mon, 13 Nov 2006 09:07:05 +0000 (09:07 +0000)]
Parse element spec zebra::field,type to make Zebra return storekeys for
records.
Adam Dickmeiss [Fri, 10 Nov 2006 13:10:31 +0000 (13:10 +0000)]
Test for bug #730.
Adam Dickmeiss [Fri, 10 Nov 2006 12:57:49 +0000 (12:57 +0000)]
In tl_init_data check for return code from zebra_add_record.
Adam Dickmeiss [Fri, 10 Nov 2006 12:56:26 +0000 (12:56 +0000)]
Fixed bug #730: Bad XML for update with alvis filter does not fail as
it should. The Libxml2 function xmlReadIO does not report errors so
we use xml reader interface always for ALVIS filter.
Adam Dickmeiss [Thu, 9 Nov 2006 14:39:24 +0000 (14:39 +0000)]
Renamed function zebra_snippets_rec_keys to zebra_rec_keys_to_snippets.
Factor storedata retrieval out of zebra_record_fetch.
Adam Dickmeiss [Wed, 8 Nov 2006 22:08:26 +0000 (22:08 +0000)]
Change mfile_/cfile-functions to return error code(s) rather than exit.
Added bf_read2 / bf_write2 which does not call exit for errors.
Adam Dickmeiss [Wed, 8 Nov 2006 22:06:50 +0000 (22:06 +0000)]
Removed a few redundant comments.
Adam Dickmeiss [Wed, 8 Nov 2006 22:05:43 +0000 (22:05 +0000)]
Use blanks rather than NULs in block (bfile test)
Adam Dickmeiss [Wed, 8 Nov 2006 13:05:33 +0000 (13:05 +0000)]
Changed test to that bfs is reconstructed after bfs_reset.
Adam Dickmeiss [Wed, 8 Nov 2006 12:59:27 +0000 (12:59 +0000)]
Added test of bfile/cfile/mfile system
Adam Dickmeiss [Mon, 6 Nov 2006 20:48:42 +0000 (20:48 +0000)]
Filter 'text' loadable as most other filter types. 'grs.sgml' always
static.
Adam Dickmeiss [Sun, 5 Nov 2006 19:57:40 +0000 (19:57 +0000)]
Only enable ALVIS if Libxml2 2.6.15 or later is in use.
Adam Dickmeiss [Fri, 3 Nov 2006 23:17:08 +0000 (23:17 +0000)]
Fixed bug #723: Enable EXSLT for alvis. We use whatever YAZ
finds out for Unix confiugre (YAZ_HAVE_EXSLT & YAZ_HAVE_XSLT).
Adam Dickmeiss [Thu, 2 Nov 2006 10:47:05 +0000 (10:47 +0000)]
Log level -v indexdetails logs all terms indexed for each record.
Mike Taylor [Wed, 1 Nov 2006 18:05:53 +0000 (18:05 +0000)]
Ignore another generated binary
Adam Dickmeiss [Mon, 30 Oct 2006 16:21:29 +0000 (16:21 +0000)]
Use f(void) instead of f()
Adam Dickmeiss [Mon, 30 Oct 2006 14:05:30 +0000 (14:05 +0000)]
Fixed bug #234: mbox filter hangs when seeing a non-mbox filter.
Also, again (!), fixed the way regx determines EOF; now EOF
is signalled if nore rules matches (except BEGIN which always matches).
Adam Dickmeiss [Mon, 30 Oct 2006 11:18:26 +0000 (11:18 +0000)]
Revert last change for extract.c, due to tests failing. We can't use
offset for determining whether a record stream indexing is finished
or not. Indeed filters, like ALVIS, reads large chunks at a time.
The grs.{regx,tcl} filter instead is modified to return the
"right thing", i.e. NULL data1 tree (signalling EOF / no more data).
Adam Dickmeiss [Sun, 29 Oct 2006 20:35:58 +0000 (20:35 +0000)]
For record file stream indexing, stop looping if offset is unmodified
by call to filter. This appears as bug #234, but really isn't.
Adam Dickmeiss [Sun, 29 Oct 2006 18:01:26 +0000 (18:01 +0000)]
Use f(void) rather than f()
Adam Dickmeiss [Sun, 29 Oct 2006 17:20:00 +0000 (17:20 +0000)]
Fixed several prototypes.. Most changes are f() to f(void).
Adam Dickmeiss [Sun, 29 Oct 2006 17:19:07 +0000 (17:19 +0000)]
Added -Wstrict-prototypes for buildconf.sh -d mode. Added -o mode which
uses -O3 plus warning options.
Adam Dickmeiss [Sun, 29 Oct 2006 17:18:05 +0000 (17:18 +0000)]
Removed unused function rpn_scan_ver1
Adam Dickmeiss [Fri, 27 Oct 2006 11:05:14 +0000 (11:05 +0000)]
Initialize variable which could be un-initialized
Adam Dickmeiss [Thu, 26 Oct 2006 23:49:14 +0000 (23:49 +0000)]
Change prototype for data1_absyn_getmarctab, data1_absyn_getelements.
Adam Dickmeiss [Thu, 26 Oct 2006 23:45:46 +0000 (23:45 +0000)]
Fixed bug #710: Duplicate keys for CDATA in xelm/melm rules. For X-Path
termlist indexing, the sequence number is updated to the maximum sequence
number generated.
Adam Dickmeiss [Mon, 23 Oct 2006 09:31:48 +0000 (09:31 +0000)]
Added a check to verify that private OIDs works
Adam Dickmeiss [Mon, 23 Oct 2006 09:01:42 +0000 (09:01 +0000)]
Removed no longer in use zebrazv.cfg
Adam Dickmeiss [Mon, 16 Oct 2006 11:03:47 +0000 (11:03 +0000)]
Towards 2.0.4
Adam Dickmeiss [Mon, 16 Oct 2006 07:04:29 +0000 (07:04 +0000)]
Build-Depends yaz > 2.1.36
Adam Dickmeiss [Sun, 15 Oct 2006 07:28:21 +0000 (07:28 +0000)]
More news items
Adam Dickmeiss [Thu, 12 Oct 2006 13:06:00 +0000 (13:06 +0000)]
Fixed bug #47: Commit needs to check for roll-back.
Adam Dickmeiss [Thu, 12 Oct 2006 12:28:42 +0000 (12:28 +0000)]
Fixed bug #696: Less than Date searches returns unexpected hit count.
Adam Dickmeiss [Thu, 12 Oct 2006 12:27:28 +0000 (12:27 +0000)]
Allow character ordinal 1 in [..] ranges to specify "raw" value.
Adam Dickmeiss [Thu, 12 Oct 2006 10:13:33 +0000 (10:13 +0000)]
Fix crash that occurred if mc-ldr rules was used in conjunction with
missing .mar file.
Adam Dickmeiss [Wed, 11 Oct 2006 20:19:31 +0000 (20:19 +0000)]
Added tstpass test
Adam Dickmeiss [Wed, 11 Oct 2006 20:18:47 +0000 (20:18 +0000)]
Fixed bug 672: Trailing characters in password are ignored. By
default htpasswd uses DES encryption of maximum key length 8. This
means that a password can only have 8 significant digits. Function
passwd_db_auth now returns -2 if password is greater than 8.
Adam Dickmeiss [Wed, 11 Oct 2006 12:37:23 +0000 (12:37 +0000)]
More on regular expressions in grs.regx filters.
Adam Dickmeiss [Wed, 11 Oct 2006 12:23:24 +0000 (12:23 +0000)]
Fixed bug #658: update docs on zebra.cfg settings.