Skip to content

Mirror of the CHISE Project's IDS database, to preserve it for use with the Tsukurimashou Project's IDSgrep

License

Notifications You must be signed in to change notification settings

zhangchn/chise-ids

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

On March 5, 2013, it was announced on the CHISE Project's mailing lists that
most of that project's servers would cease operation on March 11.  It is not
clear to me whether that is a temporary interruption or a permanent end to
the project's Web presence.  No replacement servers, nor time at which the
original servers might come up again, were mentioned.  The Japanese-language
posting sounded less final to me than the English-language posting, but that
could simply be a cultural politeness thing, and my Japanese isn't reliable
enough to be certain anyway.  Given that the project doesn't seem to have
updated in years, I fear the worst.  Since I maintain a project of my own
for which CHISE's IDS database is useful, I'm placing a snapshot (taken
March 6) of the CHISE IDS database here in my own space, against the
possibility of the original repository's present or future abandonment.

The original repository location from which I took the snapshot was:
   http://git.chise.org/git/chise/ids.git

The CHISE project's home (but this is one of the servers they said was going
down) is currently at:
   http://www.chise.org/

The intended purpose of this snapshot is to support IDSgrep, which is part
of the Tsukurimashou Project, whose home page is here:
   http://tsukurimashou.sourceforge.jp/

I want to be at most a curator, not a maintainer, of this data; don't expect
me to be able to answer questions about its original context within CHISE,
which is a large, complicated project with goals different from mine and
documented almost exclusively in Japanese.  I also don't plan to maintain a
snapshot of other parts of CHISE, only this part which is relevant to my own
project.  俺は通りすがりのバニーボーイです※ - I'm a passerby who is
redistributing the data under its free license.  I've added a copy of that
license, the GNU GPL version 2, which wasn't actually included in the
original repository but is referenced in the README below.

Matthew Skala
[email protected]
March 6, 2013

※ This is a pop culture reference; if you have to ask, you won't understand
the answer.

Material below these hyphens is the original README from CHISE:
------------------------------------------------------------------------

This directory holds the CHISE-IDS package which contains data and
utilities about structures of Han Ideographs (漢字).


* How to install

Please install XEmacs CHISE before install this package.

If XEmacs CHISE is installed in your system, please type

    % make install

in the directory of the CHISE-IDS distribution.


[Note] If you don't have XEmacs CHISE, the CHISE-base package may be
useful.  It is an installer package to install CHISE functionalities
including libchise, XEmacs CHISE, the CHISE-IDS package, some other
Emacs Lisp utilities, some fonts, etc.  It is available at:

	http://www.chise.org/dist/base/

In addition, if you use Mac OS X with Fink,

http://corpus.kanji.zinbun.kyoto-u.ac.jp/cgi-bin/gitweb.cgi?p=corpus/env.git;a=blob_plain;f=install-base-system_mac-fink.sh;hb=HEAD

is available and easier than plain CHISE-base package.


* IDS files

The following files contains the data about structures of Han
Ideographs (漢字).
    
    IDS-UCS-Basic.txt	CJK Unified Ideographs (U+4E00 〜 U+9FA5)
			of ISO/IEC 10646-1:2000
    IDS-UCS-Ext-A.txt	CJK Unified Ideographs Extension A
			(U+3400 〜 U+4DB5, U+FA1F and U+FA23)
			of ISO/IEC 10646-1:2000
    IDS-UCS-Compat.txt	CJK Compatibility Ideographs
			(U+F900 〜 U+FA2D, except U+FA1F and U+FA23)
			of ISO/IEC 10646-1:2000
    IDS-UCS-Ext-B-1.txt	CJK Unified Ideographs Extension B [part 1]
			(U-00020000 〜 U-00021FFF)
			of ISO/IEC 10646-2:2001
    IDS-UCS-Ext-B-2.txt	CJK Unified Ideographs Extension B [part 2]
			(U-00022000 〜 U-00023FFF)
			of ISO/IEC 10646-2:2001
    IDS-UCS-Ext-B-3.txt	CJK Unified Ideographs Extension B [part 3]
			(U-00024000 〜 U-00025FFF)
			of ISO/IEC 10646-2:2001
    IDS-UCS-Ext-B-4.txt	CJK Unified Ideographs Extension B [part 4]
			(U-00026000 〜 U-00027FFF)
			of ISO/IEC 10646-2:2001
    IDS-UCS-Ext-B-5.txt	CJK Unified Ideographs Extension B [part 5]
			(U-00028000 〜 U-00029FFF)
			of ISO/IEC 10646-2:2001
    IDS-UCS-Ext-B-6.txt	CJK Unified Ideographs Extension B [part 6]
			(U-0002A000 〜 U-0002A6D6)
			of ISO/IEC 10646-2:2001
    IDS-UCS-Compat-Supplement.txt
			CJK Compatibility Ideographs Supplement
			(U-0002F800 〜 U-0002FA1D)
			of ISO/IEC 10646-2:2001

These files are encoded by UTF-8.  The format of each line of the
files is:

    <CODEPOINT><TAB><CHARACTER><TAB><IDS>

or

    ;; <COMMENTS>

Each element means

    <TAB>	<HORIZONTAL TABULATION> (U+0009)
    <CODEPOINT>	code point
		U+hhhh		Hex form of UCS code point
				(U+0000 〜 U+FFFF)
		U-hhhhhhhh	Hex form of UCS code point
				(U+00000000 〜 U+7FFFFFFF)
    <CHARACTER>	character corresponding with <CODEPOINT>
    <IDS>	Ideographic Description Sequence
		(based on ISO/IEC 10646-1:2000 F.3.1; however
		Compatibility Ideographs and non-UCS Ideographs are
		also allowed)
    <COMMENTS>	comment


* License

This package is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2, or (at your option)
any later version.

This package is distributed in the hope that it will be useful, but
WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
General Public License for more details.

You should have received a copy of the GNU General Public License
along with this package; see the file COPYING.  If not, write to
the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
Boston, MA 02111-1307, USA.


* Acknowledgment

The developing of the package was supported by the “Exploratory
Software Project” of Information-technology Promotion Agency, Japan.
Some data in the IDS-UCS* files are derived and expanded from the CDP
database developped by C.C. Hsieh and his team at Academia Sinica in
Taipei, Taiwan.

About

Mirror of the CHISE Project's IDS database, to preserve it for use with the Tsukurimashou Project's IDSgrep

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published