CHANGELOG

   1
   2 --- 1.3.5 2003/02/20
   3
   4 Updated for newer version of YAZ (introduction of string schema).
   5
   6 Directory examples/zthes now part of distribution (was missing
   7 in previous release).
   8
   9 New .abs directive, systag, that control where to put retrieval
  10 information. The directive takes two arguments: system tag, element name.
  11 System tag is one of : rank, sysno, size.
  12
  13 --- 1.3.4 2002/11/26
  14
  15 Perl Filter and Perl API. By Peter Popovics.
  16
  17 For zebra.cfg, if no profilePath is specified, directory
  18  (prefix)/share/idzebra/tab
  19 is used.
  20
  21 Zebra Examples in examples . Zebra tests in test.
  22
  23 Bug fix: sort index was not properly modified on
  24 record updates/deletes.
  25
  26 Fix handling of character entities for sgml filter.
  27
  28 Move data1 to Zebra (used to be part of YAZ).
  29
  30 --- 1.3.3 2002/10/05
  31
  32 Fix character encoding of scan response terms.
  33
  34 Fix character decoding of scan request terms.
  35
  36 Fix ESpec handling (requires YAZ 1.9.1)
  37
  38 Fix searches for complete fields.
  39
  40 --- 1.3.2 2002/09/09
  41
  42 When name zebra is used in a filename or directory 'idzebra' is used
  43 instead to avoid confusion with GNU zebra (routing software).
  44
  45 Zebra server stops with a fatal error if config file cannot be read.
  46
  47 New config setting, followLinks, that controls whether update of files
  48 should follow symbolic. Set it to 1 (for enable) or 0 (to disable).
  49 By default symbolic links are followed.
  50
  51 Fix MARC transfer . MARC fields had wrong data for multiple fields.
  52
  53 XML record reader moved from YAZ to Zebra, to make YAZ less
  54 dependant on external libraries.
  55
  56 Zebra uses yaz_iconv which is mini iconv library supporting UTF-8,
  57 UCS4, ISO-8859-1. This means that Zebra does UNICODE even
  58 on systems that doesn't offer iconv.
  59
  60 XML record reader supports external system entities.
  61
  62 --- 1.3.1 2002/08/20
  63
  64 New .abs-directive "xpath" that takes one argument: "enable"
  65 or "disable" to enable and disable XPath -indexing. If no "xpath"
  66 direcive is found in .abs-file , XPath-indexing is disabled to ensure
  67 backwards compatibility. For missing .abs-files XPath-indexing is
  68 enabled so that such records are searchable.
  69
  70 Zebra warns about missing .abs-file only once (for each type).
  71
  72 Fixed a bug in file update where already-inserted files could
  73 be treated as "new".
  74
  75 --- 1.3.0 2002/08/05
  76
  77 Zebra license changed to GNU GPL.
  78
  79 XPath-like queries used when RPN string attributes are used, eg.
  80    @attr 1=/portal/title sometitle
  81    @attr 1=/portal/title[@xml:lang=da] danishtitle
  82    @attr 1=/portal/title/@xml:lang da
  83    @attr 1=//title sometitle
  84
  85 Zebra uses UTF-8 internally:
  86 1) New setting "encoding" for zebra.cfg that specifies encoding for
  87 OCTET terms in queries and record encoding for most transfer syntaxes
  88 (except those that use International Strings, such as GRS-1).
  89 2) The encoding of International strings is UTF-8 by default. It
  90 may be changed by character set negotiation. If character set
  91 negotiation is in effect and if records are selected for conversion
  92 these'll be converted to the selected character set - thus overriding
  93 the encoding setting in zebra.cfg.
  94 3) New directive "encoding" in .abs-files. This specifies the external
  95 character encoding for files indexed by zebra. However, if records
  96 themselves have an XML header that specifies and encoding that'll be used
  97 instead.
  98
  99 XML filter (-t grs.xml).
 100
 101 Multiple registers. New setting in resource 'root' that holds base
 102 directory for register(s). A group a databases may be put in separate
 103 register in directory root/reg by using db name 'reg/db1' ... 'reg/dbN'.
 104
 105 --- 1.1.1 2002/03/21
 106
 107 Fixes for Digital Unix
 108
 109 Implemented hits per term using USR:SearchResult-1.
 110
 111 New Zebra API. Locking system re-implemented.
 112
 113 --- 1.1.stable 2002/02/20
 114
 115 Rank weight can be controlled with attribute type 9. Default
 116 value is 34. Recommended values between 1-36.
 117
 118 --- 1.1 2001/10/25
 119
 120 Updated for YAZ version 1.8.
 121
 122 Added support for termsets - a result set of terms matching
 123 a given query. For @attr 8=<set> creates termset named <set>.
 124
 125 Added support for raw retrieval. Element Set Name R forces the
 126 text filter which returns the record in its original form.
 127
 128 Added numerical sort - triggered by structure=numeric (4=109).
 129
 130 Remote record import using Z39.50 Extended Services and Segments.
 131
 132 Fixed bug where updating a database with user-defined attributes
 133 could corrupt the register (bad storeKeys).
 134
 135 Multi-threaded version.
 136
 137 Fixed bug regarding proximity.
 138
 139 Documentation updates.
 140
 141 Fixed bug in record retrieval module that occured on 64-bit OSF
 142 architectures.
 143
 144 --- 1.0.1 2000/2/10
 145
 146 Fixed bug in makefile for WIN32.
 147
 148 Fixed bug in configure script - used bash-specific features.
 149
 150 --- 1.0 1999/12/10
 151
 152 Added support for multiple records in one file for filter grs.sgml.
 153
 154 Changed record index structure. New layout is incompatible with
 155 previous releases. Added setting "recordcompression" to control
 156 compression of records. Possible values are "none" (no
 157 compression) and bzip2 (compression using libbz2).
 158
 159 Added XML transfer syntax support for retrieval of structured records.
 160 Schema in CompSpec is recognised in retrieval of structured records.
 161
 162 Changed Tcl record filter so that it attemps to read  <filt>.tflt. If
 163 that fails, the filter reads the file <filt>.flt (regx style filter).
 164
 165 Implemented new Tcl record filter -  use grs.tcl.<filter> to enable it.
 166 Zebra's configure script automatically attempts to locate Tcl. For
 167 manual Tcl configuration use option --with-tclconfig=<path> to specify
 168 where Tcl's library files are located.
 169
 170 Implemented "compression" of Dictionary and ISAM system. Dictionary
 171 format HAS changed.
 172
 173 Added "tagsysno" directive to zebra.cfg to control under which tag the
 174 system ID is placed. Use tagsysno: 0 to disable Zebra's system number
 175 entirely.
 176
 177 Added "tagrank" as above.
 178
 179 Changed file naming scheme for register files from <name>.mf.<no> to
 180 <name>-<no>.mf.
 181
 182 Implemented "position"-flag for register type (as defined in
 183 default.idx). When set to zero no position (or seqence number) is
 184 saved in register for each word occurrence, thus saving some register
 185 space.
 186
 187 Implemented database mapping. Using mapdb one can specify a database
 188 to be mapped to one or more physical databases. Usage:
 189 mapdb <fromdb> <todb> ..
 190
 191 Added SOIF-filter. Thanks to Peter Valkenburg.
 192
 193 For the regx-filter "end element -record" may trigger a mark-of-record
 194 if outer level is reached.
 195
 196 Tag sets may be typed in the reference to it. From the .abs-file the
 197 "tagset" directive takes a third optional integer type for the tag set
 198 referenced. From a .tag-file the "include" directive takes a third
 199 optional type as well. The old "type" directive in the tag set itself
 200 is still recognized but acts as the default type for the tag set.
 201
 202 Zebra supports the specification of arbitrary attributes sets, schemas
 203 and tag sets, because of the change in YAZ' OID management system.
 204
 205 Fixed bug in Sort that caused it NOT to use character mapping as it
 206 should.
 207
 208 Zebra now uses GNU configure to generate Makefile(s).
 209
 210 Added un-optimised support for left and left/right truncation attributes.
 211
 212 Added support for relational operators on text when using RPN queries.
 213
 214 Added support for sort specifications in RPN queries. Type 7 specifies
 215 'sort' where value 1=ascending, value 2=descending. The use attribute
 216 specifies the field criteria as usual.  The term specifies priority
 217 where 0=first, 1=second, ...
 218
 219 Changed the way use attributes are specified in the recordId
 220 specification.
 221
 222 Maximum number of databases in one Zebra register increased.
 223
 224 New setting, databasePath, which specifies that first directory during
 225 update traversal is the database name (instead of a fixed one).
 226
 227 New setting, explainDatabase, which specifies that databases are
 228 EXPLAIN aware.
 229
 230 Modified Zebra so that it works with ASN.1 compiled code for YAZ.
 231
 232 Implemented EXPLAIN database maintenance. Zebra automatically
 233 generate - and update CategoryList, TargetInfo, DatabaseInfo,
 234 AttributeSetInfo and AttributeDetails records at this stage. The
 235 records may be transferred as GRS-1, SUTRS or Explain.
 236
 237 Fixed register spec so that colon isn't treated as size separator
 238 unless followed by [0-9+-] in order to allow DOS drive specifications.
 239
 240 Fixed two bugs in ISAMC system.
 241
 242 Changed the way Zebra keeps its maintenance information about attribute
 243 sets, available attributes, etc.. Records in "SGML" notation using an
 244 EXPLAIN schema is now used when appropriate.
 245
 246 Bug fix: Index didn't handle update/insert/delete of the same record
 247 (i.e. same recordId) in one run (one invocation of zebraidx). Only the
 248 first occurence of a record is considered.
 249
 250 Most searches now return correct number of hits.
 251
 252 New modular ranking system. Interested programmers are encouraged to
 253 inspect rank1.c and improve the algorithm.
 254
 255 Bug fix: Lock files weren't removed as they should on NT.
 256
 257 Implemented Z39.50 Sort. Zebra's sort handler uses use attributes to
 258 specify a "sort register". Refer to the gils sample records which refer
 259 to index type "s" which is specified as "sort" in the default.idx file.
 260 Each sort criteria can either be Ascending or Descending and at most
 261 three sort elements can be specified.
 262
 263 Bug fix: Character mapping didn't work for text files.
 264
 265 --- 1.0b1 1998/1/29
 266
 267 Simple ranked searches now return correct number of hits.
 268
 269 The test option (-s) only makes a read-lock on the index as well
 270 as using read-only operations anywhere.
 271
 272 Moved towards generic character mapping. Configuration file default.idx
 273 specifies character map files for register types w, p, u, etc.
 274
 275 Implemented "begin variant" for the sgml.regx - filter.
 276
 277 Fixed a few memory leaks.
 278
 279 Added support for C++, headers uses extern "C" for public definitions.
 280
 281 Bug fix: The show records facility (-s) only displayed information for
 282 the first record in a file (and not for every record in the file).
 283
 284 Added option "-f <n>" to limit the logging of record operations. After
 285 <n> records has been processed no logging is performed (unless errors
 286 occur).
 287
 288 Bug fix: the compressed ISAM system didn't handle update operations
 289 correctly.
 290
 291 Added setting, "maxResultSetSize", to hold the number of records to
 292 save in a result set.
 293
 294 Bug fix: Complete phrase did't work for search operations.
 295
 296 Bug fix: temporary result sets weren't deleted.
 297
 298 Reduced disk space for saved keys (storeKeys = 1).
 299
 300 Added optional, physical ANY (key replication)
 301
 302 Implemented proximity operator in search.
 303
 304 Bug fix: the path name buffers used by file match traversal routines
 305 have been extended to support long file names.
 306
 307 New C(ompressed) ISAM system. To enable it, specify "isam: c" in the
 308 configuration file. The resulting register without "storeKeys" is about
 309 half the size, and the memory used by zebraidx during phase 2 (merge) is
 310 reduced to a minimum.
 311
 312 Reworked the way Regexp-2 queries with error tolerance are handled and
 313 specified. The documentation has been updated accordingly.
 314
 315 Bug fix: Zebrasrv didn't search correctly when queries contained masking
 316 characters. This bug was introduced in 1.0a8.
 317
 318 Zebrasrv now tag records with the proper database name.
 319
 320 New settings, memMax and keyTmpDir.
 321
 322 Changed name of setting lockDir (previously called lockPath) and
 323 setTmpDir (previously called tempSetPath).
 324
 325 Generalized and changed record type specifications. In short, there are:
 326        text                plain SUTRS
 327        grs.sgml            structured, "SGML-like" syntax
 328        grs.regx.<filter>   structured, Regular expression filter
 329        grs.marc.<abs>      Reads *MARC records in the ISO2709 format. <abs>
 330                            is the name of an abstract syntax file.
 331 Bug fix: Result sets weren't sorted in operations involving boolean
 332 operations with "ranked" operands.
 333
 334 --- 1.0a8 1996/6/6
 335
 336 Added national character-handling subsystem.
 337
 338 Various fixes.
 339
 340 Small modifications to input filters and profiles.
 341
 342 Added support for SOIF syntax (with private OID).
 343
 344 --- 1.0a7 1996/5/16
 345
 346 Fixed buffer-size problem in indexing.
 347
 348 Added compression to temporary files for updating.
 349
 350 Added phrase registers.
 351
 352 Added dynamic mapping of search attribute to multiple termlists (ANY).
 353
 354 Scan support in multiple databases/registers.
 355
 356 Configuration settings are case-insensitive and single dash (-)
 357 characters are ignored in comparisons.
 358
 359 The index processing ignores empty files - warning given.
 360
 361 New option to zebraidx (-V) displays version information.
 362
 363 --- 1.0a6 1996/2/24
 364
 365 Fixed problem in file-update system.
 366
 367 Fixed problem in shadow system; register was sometimes corrupted after
 368 a commit operation.
 369
 370 --- 1.0a5 1996/2/10
 371
 372 Fixed problems in the ISAM subsystem. Caused difficulties when updating
 373 existing registers.
 374
 375 Fixed small problem in SUTRS-filter. A newline was sometimes inserted before
 376 the rank and record number.
 377
 378 Fixed bug in the isam subsystem - caused a malfunction when accessing
 379 words which occurred more than 10000 times.
 380
 381 Distribution should now include YAZ (Z39.50 protocol stack) to simplify
 382 installation.
 383
 384 Server can now run under inetd. Use option -i, and -w <directory> to
 385 set working directory to desired location.
 386
 387 New zebraidx command: clean - removes temporary shadow files.
 388
 389 Fixed bug in ISAM system. Occurred rarely during register updates.
 390
 391 Logging during index merge phase is improved. The remaining running
 392 time is estimated.
 393
 394 Temporary files generated by zebraidx are removed after each run.
 395
 396 Bug fix: Dictionary didn't handle 8-bit characters correctly; was obvious
 397 when doing scan operations in dictionaries with European characters.
 398
 399 --- 1.0a4 1996/01/11
 400
 401 A whole slew of updates, to make the first publicized release. Get the doc
 402 and check it out.
 403
 404 --- 1.0a3 1995/12/06
 405
 406 Memory-problems in ISAM fixed. More blocktypes added to the default setup
 407 to increase performance on larger databases.
 408
 409 Various minor changes in data management system.
 410
 411 --- 1.0a2 1995/12/05
 412
 413 A couple of portability-problems resolved.
 414
 415 Changed some malloc() to xmalloc().
 416
 417 --- 1.0a1 1995/11/28
 418
 419 First release.