[Dspace-general] Sort order of author names with diacritics

Frank wcs at iist.unu.edu
Mon Apr 5 22:29:03 EDT 2004


We just started using DSpace to build a repository of research reports and
have been very impressed so far.

But we have a problem: author names involving diacritical marks are sorted
very strangely.  For example, the name Ozler, where the O has an umlaut,
entered as Ö sorts between Azam and Azzoni.  It seems that the O-umlaut
sorts as an A.  Similar is Soricut when the S has a cedilla (Ş).
Soricut sorts between Aoki and Appleton: again the diacritic sorts as an A.

These names were input using the bulk input method, using the standard
&#nnn; encoding.

If such names are entered in forms using this encoding the resulting names
sort before any unaccented characters: looks like they are sorted as
starting with &.

The database was created with Unicode enabled.

We would be grateful for any help.

Regards,

Frank Wong



More information about the Dspace-general mailing list