Adam Dickmeiss [Fri, 7 Dec 2007 10:01:53 +0000 (10:01 +0000)]
Suggest yaz-doc instead of Recommend for libyaz3-dev
Adam Dickmeiss [Fri, 30 Nov 2007 11:44:46 +0000 (11:44 +0000)]
Adjustments to make YAZ compile as C++ code.
Adam Dickmeiss [Fri, 30 Nov 2007 11:43:47 +0000 (11:43 +0000)]
Changed return value of function icu_sortkey8_from_utf16 to void.
The return value is useless and error is signalled by status
parameter. This breaks binary compatibility for that function but
it is not in use anywhere.
Adam Dickmeiss [Fri, 30 Nov 2007 11:42:04 +0000 (11:42 +0000)]
Use oid_class rather than int for OID class.
Changed oclass parameter for yaz_oid_add function from type int to
enum oid_class. This should not break binary compatibility.
Changed oid member in struct yaz_oid_entry to enum oid_class.
Adam Dickmeiss [Wed, 21 Nov 2007 13:53:20 +0000 (13:53 +0000)]
Use schema in Z39.50 Update.
If schema is given, supply it as part of Z39.50 Extended Service
update. Patch from Jacob Poulsen.
Adam Dickmeiss [Wed, 21 Nov 2007 09:04:46 +0000 (09:04 +0000)]
Man pages in Reference section.
All man pages are combined in one Reference section, manref.xml,
which is extracted via XSLT from each individual man page.
Adam Dickmeiss [Fri, 16 Nov 2007 08:01:47 +0000 (08:01 +0000)]
Update test for new wrbuf_puts_escaped functions
Adam Dickmeiss [Thu, 15 Nov 2007 08:45:52 +0000 (08:45 +0000)]
Introduce wrbuf_{write,puts}_escaped.
Introduce wrbuf_{write,puts}_escaped. Removed wrbuf_verbose_str
which now has the new name wrbuf_write_escaped. wrbuf_verbose_str
has not been used outside YAZ in released software so this should
be safe. Added a few Doxygen comments for wrbuf.h as well.
Adam Dickmeiss [Thu, 15 Nov 2007 08:36:31 +0000 (08:36 +0000)]
Added yaz_puts_escaped
Adam Dickmeiss [Thu, 15 Nov 2007 08:33:15 +0000 (08:33 +0000)]
Version 3.0.17.
Adam Dickmeiss [Thu, 15 Nov 2007 08:28:08 +0000 (08:28 +0000)]
ICU utility prints sortkey as well
Adam Dickmeiss [Wed, 14 Nov 2007 21:03:59 +0000 (21:03 +0000)]
Doxygen comments
Adam Dickmeiss [Mon, 12 Nov 2007 11:48:13 +0000 (11:48 +0000)]
More on the ICU wrapper.
Adam Dickmeiss [Mon, 12 Nov 2007 11:43:47 +0000 (11:43 +0000)]
Fix spelling
Adam Dickmeiss [Mon, 12 Nov 2007 11:41:53 +0000 (11:41 +0000)]
yaz-icu man page/program in yaz-icu package only.
The yaz-icu man page and program is part of package yaz-icu (not
yaz). Descriptions in debian/control fixed to keep linda happy.
Adam Dickmeiss [Mon, 12 Nov 2007 11:14:48 +0000 (11:14 +0000)]
yaz-icu part of packages
Adam Dickmeiss [Mon, 12 Nov 2007 11:13:05 +0000 (11:13 +0000)]
Added yaz-icu man page
Adam Dickmeiss [Mon, 12 Nov 2007 11:12:19 +0000 (11:12 +0000)]
Update to use transform step
Adam Dickmeiss [Mon, 12 Nov 2007 11:11:16 +0000 (11:11 +0000)]
Renamed 'normalize' rule to 'transform'.
The ICU chain step, normalize is now called transform because
that's what ICU calls it. And we better keep things aligned.
Adam Dickmeiss [Mon, 12 Nov 2007 08:58:16 +0000 (08:58 +0000)]
Towards 3.0.16
Adam Dickmeiss [Mon, 12 Nov 2007 08:57:45 +0000 (08:57 +0000)]
Using iochan_event_loop (WIN32)
Adam Dickmeiss [Mon, 12 Nov 2007 08:47:22 +0000 (08:47 +0000)]
Program yaz-icu is noinst rather than bin.
Program yaz-icu is noinst rather than bin for now. Will be installed in
bin when a man page has been created, bug #1797.
Adam Dickmeiss [Mon, 12 Nov 2007 08:43:49 +0000 (08:43 +0000)]
Added note about yaz_poll
Adam Dickmeiss [Mon, 12 Nov 2007 08:41:56 +0000 (08:41 +0000)]
Rename function event_loop to iochan_event_loop.
Adam Dickmeiss [Sat, 10 Nov 2007 08:59:31 +0000 (08:59 +0000)]
Do not handle EINTR in yaz_poll.
Change yaz_poll so that it does not loop around select/poll if
errno=EINTR is set. This sitution is handled in caller, such
as event_loop used by GFS which must take further action if EINTR
is set. If EINTR is handled by yaz_poll it does not get caught.
Adam Dickmeiss [Fri, 9 Nov 2007 22:08:14 +0000 (22:08 +0000)]
Change timeout parameter for yaz_poll.
Change timeout paramater from seconds to seconds+nano-seconds for
yaz_poll. Also make yaz_poll_{poll,select} static (internal).
Adam Dickmeiss [Fri, 9 Nov 2007 21:52:38 +0000 (21:52 +0000)]
Remove include of sys/select.h from tcpip.c . It is not needed.
Adam Dickmeiss [Fri, 9 Nov 2007 19:00:42 +0000 (19:00 +0000)]
Use yaz_poll everywhere.. And fix syntax error!
Adam Dickmeiss [Fri, 9 Nov 2007 19:00:08 +0000 (19:00 +0000)]
Describe new functions.
Adam Dickmeiss [Fri, 9 Nov 2007 18:49:19 +0000 (18:49 +0000)]
Enable use of yaz_poll always in event_loop.
Adam Dickmeiss [Fri, 9 Nov 2007 18:47:50 +0000 (18:47 +0000)]
YAZ GFS uses yaz_poll.
The event_loop function uses by YAZ' Generic Frontend Server uses
the yaz_poll function - instead of select.
Adam Dickmeiss [Fri, 9 Nov 2007 16:46:43 +0000 (16:46 +0000)]
Implemented yaz_poll - select/poll wrapper.
Adam Dickmeiss [Thu, 8 Nov 2007 18:02:04 +0000 (18:02 +0000)]
Reindent
Adam Dickmeiss [Thu, 8 Nov 2007 17:22:48 +0000 (17:22 +0000)]
Allow any locale for icu_chain_create.
Adam Dickmeiss [Thu, 8 Nov 2007 17:20:32 +0000 (17:20 +0000)]
Use xmalloc instead of malloc.
Adam Dickmeiss [Thu, 8 Nov 2007 17:20:11 +0000 (17:20 +0000)]
Ignore test program
Adam Dickmeiss [Thu, 8 Nov 2007 17:15:13 +0000 (17:15 +0000)]
Avoid locale member in icu_tokenizer.
The locale member of icu_tokenizer is not needed.
Adam Dickmeiss [Thu, 8 Nov 2007 17:08:10 +0000 (17:08 +0000)]
Function icu_casemap_create no longer takes locale parameter.
Function icu_casemap_create no longer takes locale parameter. Instead,
icu_casemap_casemap takes a locale parameter.
Adam Dickmeiss [Thu, 8 Nov 2007 13:35:13 +0000 (13:35 +0000)]
Refactor ICU WRT locale.
The locale parameter is no longer a parameter for icu_chain_xml_config
and locale is read as an attribute for top XML node. The icu_casemap
locale is no longer a member and a parater for icu_casemap_casemap
instead. This saves a copy of a locale in each casemap.
Adam Dickmeiss [Thu, 8 Nov 2007 09:29:38 +0000 (09:29 +0000)]
Public header icu.h, private header icu_I18N.h .
Adam Dickmeiss [Thu, 8 Nov 2007 09:07:14 +0000 (09:07 +0000)]
Indent
Adam Dickmeiss [Thu, 8 Nov 2007 09:03:11 +0000 (09:03 +0000)]
Changed header - part of YAZ not Pazpar2
Adam Dickmeiss [Thu, 8 Nov 2007 08:17:17 +0000 (08:17 +0000)]
For ICU enabled YAZ defined YAZ_HAVE_ICU instead of HAVE_ICU.
Adam Dickmeiss [Wed, 7 Nov 2007 10:19:12 +0000 (10:19 +0000)]
For ICU, set ICU status to OK for some public functions.
The ICU status code, UErrorCode, is now set to U_ZERO_ERROR in
the preamble of functions icu_chain_create, icu_chain_xml_config
and icu_chain_next_token.
Adam Dickmeiss [Wed, 7 Nov 2007 09:50:24 +0000 (09:50 +0000)]
ICU element index and sortkey no longer available.
Adam Dickmeiss [Tue, 6 Nov 2007 10:27:30 +0000 (10:27 +0000)]
For ICU, locale, is a char ptr rather than uint8_t ptr.
The type uint8_t is a ICU type that we don't want to use if we
can avoid it.
Adam Dickmeiss [Thu, 1 Nov 2007 10:32:43 +0000 (10:32 +0000)]
Fixed printf of pointer
Adam Dickmeiss [Thu, 1 Nov 2007 10:29:28 +0000 (10:29 +0000)]
Fixed link to OID tools
Adam Dickmeiss [Wed, 31 Oct 2007 21:58:07 +0000 (21:58 +0000)]
Fixed bug #1122: Certain record IDs are not found by searches.
The CQL parser preserves terms completely and \\s are no longer
removed. This changes CQL output for some test cases and so these
are updated accordingly. The CQL to PQF transform only treats
* and ? operators if they aren't escaped. A memory leak was also
fixed for the ANSIZ39.58 pattern case.
Marc Cromme [Mon, 29 Oct 2007 13:50:57 +0000 (13:50 +0000)]
changed ICU chain to return all tokens, also those which became empty under normalization. These might still have some display terms attached to them, which are to be used by the snippet display stuff or the scan term display stuff.
Marc Cromme [Mon, 29 Oct 2007 10:22:22 +0000 (10:22 +0000)]
renamed ICU chain functions to the following names relation the operation
to the actually processed token
int icu_chain_token_number(struct icu_chain * chain);
const char * icu_chain_token_display(struct icu_chain * chain);
const char * icu_chain_token_norm(struct icu_chain * chain);
const char * icu_chain_token_sortkey(struct icu_chain * chain);
Adam Dickmeiss [Mon, 29 Oct 2007 09:16:32 +0000 (09:16 +0000)]
Extended auth command so that NULL User, pass may be set.
This is to solve bug #1820.
Marc Cromme [Thu, 25 Oct 2007 10:04:32 +0000 (10:04 +0000)]
added special case with an empty ICU chain for 'raw' index processing without call of transliterators, case maps or tokenizators. It's optimized such that the following happens:
if (!chain->sort), no utf8 to utf16 translation occurs, and the original cstring is returned for the icu_chain-get_norm8() call.
if (chain->sort), utf8 to utf16 translation is made, the original cstring is returned for the icu_chain-get_norm8() call, and the utf16->sortkey conversion is run previous of calling icu_chain-get_sort8().
Marc Cromme [Thu, 25 Oct 2007 08:42:21 +0000 (08:42 +0000)]
simplified test case XML config root element tags
Marc Cromme [Thu, 25 Oct 2007 08:40:06 +0000 (08:40 +0000)]
removed dead code from ICU files
Marc Cromme [Thu, 25 Oct 2007 08:32:50 +0000 (08:32 +0000)]
added sortkey flag to icu_chain constructor
Adam Dickmeiss [Wed, 24 Oct 2007 22:10:24 +0000 (22:10 +0000)]
Avoid per-target CPPFLAGS.
If we use per-target flags, AM_PROG_CC_C_O is required or we just
dont it.
Marc Cromme [Wed, 24 Oct 2007 14:48:17 +0000 (14:48 +0000)]
Isolated ICU locale out of XML config file. Locale is now the second argument to chain = icu_chain_xml_config(xml_node, (uint8_t *) "en", &status);
Adam Dickmeiss [Wed, 24 Oct 2007 13:50:02 +0000 (13:50 +0000)]
Added yaz_match_glob
Marc Cromme [Wed, 24 Oct 2007 13:23:34 +0000 (13:23 +0000)]
fixed wrong token count when tokens disappear with ICU normalization
Marc Cromme [Wed, 24 Oct 2007 07:41:48 +0000 (07:41 +0000)]
fixed memory violation in ICU chain code, restructuring tokenizer to allow for non-empty tokens still ongoing work
Marc Cromme [Tue, 23 Oct 2007 07:51:57 +0000 (07:51 +0000)]
surpressed ICU test tokenization output
Adam Dickmeiss [Mon, 22 Oct 2007 17:52:09 +0000 (17:52 +0000)]
File m4/ac_check_icu.m4 part of dist.
Adam Dickmeiss [Mon, 22 Oct 2007 17:32:07 +0000 (17:32 +0000)]
Make ICU stuff compatible with old C comps. YAZ License.
Adam Dickmeiss [Mon, 22 Oct 2007 12:50:52 +0000 (12:50 +0000)]
Fixed lalibs output for ICU mode.
Adam Dickmeiss [Mon, 22 Oct 2007 12:40:04 +0000 (12:40 +0000)]
Debian package 3.0.15-1 with ICU.
Adam Dickmeiss [Mon, 22 Oct 2007 12:21:38 +0000 (12:21 +0000)]
Added ICU chain component - which used to be part of Pazpar2.
Adam Dickmeiss [Tue, 16 Oct 2007 10:45:53 +0000 (10:45 +0000)]
Removed material about Object identifiers for YAZ 2.
Adam Dickmeiss [Mon, 15 Oct 2007 20:45:46 +0000 (20:45 +0000)]
Added a record for testing of bug #1778.
Adam Dickmeiss [Mon, 15 Oct 2007 20:45:05 +0000 (20:45 +0000)]
Fixed bug #1778: Extended ASCII not escaped always (UTF-8 to MARC-8).
When in non-ASCII mode, YAZ now escapes combined character sequences.
A test was added in tsticonv.c.
Adam Dickmeiss [Mon, 15 Oct 2007 12:00:31 +0000 (12:00 +0000)]
Increase intermediate buffers for wrbuf+iconv.
The intermediate buffer for wrbuf+iconv is increased from 12 to 16
bytes. There may be MARC-8 sequence overflows overwise - with at
least reset (flush).
Adam Dickmeiss [Fri, 12 Oct 2007 14:22:19 +0000 (14:22 +0000)]
Re-organized MARC-8 charset creating code a bit.
Re-organized MARC-8 charset creating code a bit.. A define + code to work
on bug #1778 was added.
Adam Dickmeiss [Tue, 9 Oct 2007 06:00:55 +0000 (06:00 +0000)]
Revised types for functions cs_complete_auto, etc.
The functions cs_complete_auto, cs_complete_auto_head takes 'const char *'
as buffer rather than 'const unsigned char *'. This makes some casts
unnecessary. This also fixes tst_comstack which used to test
cs_complete_http. This function, howver is no longer public, so
cs_complete_auto is used instead.
Adam Dickmeiss [Sun, 7 Oct 2007 08:53:26 +0000 (08:53 +0000)]
Added HTTP tunnel facility for COMSTACK, bug #1752.
This is a facility that allows a Web proxy, such as squid, to tunnel
Z39.50 traffic. This facility is "transparent" to must applications
using YAZ. It's enabled by using pseudo transport connect: followed bý
the we proxy address, followed by command, then follwed by he regular
"virtual" addresss. For example,
connect:webproxy.com:3128,tcp:z3950.loc.gov:7090/voyager .
Adam Dickmeiss [Fri, 5 Oct 2007 16:46:55 +0000 (16:46 +0000)]
Implemented function cs_create_host.
Implemented function cs_create_host parses a YAZ host component. This
is to be used for HTTP Connect tunneling (bug #1752).
Adam Dickmeiss [Fri, 5 Oct 2007 15:11:36 +0000 (15:11 +0000)]
Changed blank (0x20) handling conversion to MARC-8.
The conversion to MARC-8 now always escapes the BLANK character outside
Basic ASCII. This is stricly speaking not necessary - blank is part
of "all" pages (sets). But some tools cannot handle it. YAZ, however,
treats blank as blank in all pages when MARC-8 is read (converted from).
Adam Dickmeiss [Wed, 26 Sep 2007 19:14:04 +0000 (19:14 +0000)]
Bump to 3.0.15.
Adam Dickmeiss [Tue, 25 Sep 2007 07:34:03 +0000 (07:34 +0000)]
Combine test from tstmarciso.sh and tstmarcxml.sh to tstmarc.sh .. Mostly
because the latter requires output from the former (so make -j2 could
break).
Adam Dickmeiss [Mon, 24 Sep 2007 22:22:13 +0000 (22:22 +0000)]
Update this test due to fix for bug #1666.
Adam Dickmeiss [Mon, 24 Sep 2007 12:51:10 +0000 (12:51 +0000)]
Fixed bug with shadowing of variable in lookup_marc8. Bug introduced
in previous commit (1.44).
Adam Dickmeiss [Sun, 23 Sep 2007 07:40:13 +0000 (07:40 +0000)]
Document more yaz-marcdump options
Adam Dickmeiss [Sun, 23 Sep 2007 07:21:51 +0000 (07:21 +0000)]
Added MARC-8 encoded Arabic record, marc9.marc, to test that blanks are
preserved.
Added UTF-8 encoded Chinese record, marccol3.u8.marc, which has the
Ideographic space sequence it.
Adam Dickmeiss [Sat, 22 Sep 2007 18:56:22 +0000 (18:56 +0000)]
Added test for conversion of utf8 e38080 to marc8 212321 (Ideographic
space).
Adam Dickmeiss [Sat, 22 Sep 2007 18:55:02 +0000 (18:55 +0000)]
Remove duplicates (already existing) in flat entries for MARC-8
tries. Previously only duplicates in index tries were treated.
This ensures that only first sequence (of multiple) is used for
conversion and takes precedence. For example utf E38080 is
converted then converted to marc8 212321 (not marc8 212320).
Adam Dickmeiss [Sat, 22 Sep 2007 18:49:55 +0000 (18:49 +0000)]
Treat space (20X) as present in all MARC-8 charsets
Adam Dickmeiss [Fri, 21 Sep 2007 16:04:48 +0000 (16:04 +0000)]
Fixed compilation for absent Libxml2.
Adam Dickmeiss [Fri, 21 Sep 2007 09:24:17 +0000 (09:24 +0000)]
Towards 3.0.14
Adam Dickmeiss [Fri, 21 Sep 2007 07:10:08 +0000 (07:10 +0000)]
Initialize diag_messÃage member of ZOOM_record
Adam Dickmeiss [Thu, 20 Sep 2007 17:22:45 +0000 (17:22 +0000)]
Fixed bug in conversion to XML from MARC: Non-XML characters in control
fields were not removed.
Adam Dickmeiss [Tue, 18 Sep 2007 19:19:16 +0000 (19:19 +0000)]
Removed reference to url.srw
Adam Dickmeiss [Tue, 18 Sep 2007 13:05:18 +0000 (13:05 +0000)]
Versino 3.0.13
Adam Dickmeiss [Mon, 17 Sep 2007 19:18:27 +0000 (19:18 +0000)]
Make an MARC conversion trie for each characterSet section rather than
codeTable. This is ensure we can deal with G0/G1 sequences. Bug #1464.
Adam Dickmeiss [Mon, 17 Sep 2007 19:15:22 +0000 (19:15 +0000)]
Log iconv tests that fail
Adam Dickmeiss [Mon, 17 Sep 2007 19:14:26 +0000 (19:14 +0000)]
Added wrbuf_verbose_str.
Adam Dickmeiss [Fri, 14 Sep 2007 21:39:36 +0000 (21:39 +0000)]
Use updated codetables.xml as provided by Larry Dixson (bug #1464).
Adam Dickmeiss [Fri, 14 Sep 2007 20:55:57 +0000 (20:55 +0000)]
Bump version to 3.0.13
Adam Dickmeiss [Wed, 12 Sep 2007 06:49:37 +0000 (06:49 +0000)]
Towards 3.0.12
Adam Dickmeiss [Tue, 11 Sep 2007 16:41:37 +0000 (16:41 +0000)]
More news
Adam Dickmeiss [Tue, 11 Sep 2007 16:36:51 +0000 (16:36 +0000)]
ZOOM C announces SRU version 1.2.
Adam Dickmeiss [Tue, 11 Sep 2007 15:42:43 +0000 (15:42 +0000)]
More news