在Facebook上关注我们,随时得到最新消息 在Twitter上关注我们,随时得到最新消息 在新浪微博上关注我们,随时得到最新消息 在豆瓣上关注我们,随时得到最新消息
中国哲学书电子化计划
简体字版

标记客户端

标记客户端是一个ctext插件,其目的为实现有效率的古典文献语意标记。在阅读这里的使用说明前,建议您先参考语意标记的基本原则和目标

插件安装

使用标记客户端前,请建立免费的帐户并登入。登入后,点此处把标记客户端加入到您的帐户,点击后再按“安装”连结即可。

文本载入

载入文本最简单的方式是在本站上打开所欲标记的文本,然后点击右手边的”标记“连结。(如果找不到”标记“连结,请再次确认插件已经安装好。)

文本会载入到客户端,包括文本中已经存在的标记(若有)。

自动标记

标记客户端可利用数据维基知识库中的各种资料列出文本中可能需要标记的内容。点击客户端中的“自动标记”按钮。几秒后,系统会在文本中显示所找到的候选标记。自动建立的候选标记都以深灰色背景显示,另外有彩色下划线表示默认的标记类型。所有灰色背景的标记为”未确定“或“候选”标记:系统自动推荐的,但未必正确,因此在保存或下载资料时,这些标记不会被储存。

客户端为每一个候选标记提供相关资讯,用以判断该标记是否正确,并应该连结到什么实体。例如,当文本中出现了”乾德“时,这两个字有可能指称 宋太祖年号”乾德“,也有可能指称前蜀后主”乾德“年号,甚至也有可能指李日尊的儿子李乾德或其它的实体。标记的主要工作在于用户藉由脉络和对文本的理解明确注明文本在此处所指称的实体是哪一个。

点击一个候选标记,客户端会显示一个或多数个相关实体或时间标记。每一个行代表不同的标记或实体,在它的名称旁边会显示它的类型(如:年号、人物、地方等)。另外,每一个选择会提供以下类型的机种选项:

标记过程中,可使用这些连接来辨识其中有没有哪一项表示正确的实体对象。点击对应的“Y”连接将会把一个对象确认为该标记的对象。确认了之后,标记背景会改为彩色,表示这是一个已确认标记。如果标错了对象,可点击“改变”连接再选择正确对象,或再点击“X“删除标记。

手动标记

用鼠标点击并画出未标记的文字内容,就可以为对应字词建立新的标记。只能对不包括任何标记的文字建立新标记,如果已经有标记,可以先点击其右上角的“X”连接删除之。

建立了新标记后,系统会列出有相关名称的实体。如果列单中有正确的选择,就可以和自动标记一样处理;如果没有恰当的选项,可采取以下方法之一:

  1. 输入字词检索实体 - 把关键词(如:实体的全名、异名、辨识码等)输入到“检索”输入方块中。如果找到正确的实体,点击“Y”确认。
  2. 建立新实体 - Click the link corresponding to the type of entity and annotation you want to create. This will appear as a new confirmed annotation. When you subsequently save the text to ctext, a new entity will be created.
Always try to confirm that a matching entity does not exist before creating a new one.

Annotating dates

Please read the background notes on date annotation before annotating dates. When an annotation is created that is of the "date" type, additional fields will be available in the popup box, labeled Year, Month, and Day. These must be set to correspond to the meaning of the date in its actual context. For example, a date "三年二月甲子" should have Year set to 3, Month set to 2, and Day set to 甲子. The same should be done in cases where the context of the date makes clear what these values should be, even if the literal date does not contain them. For example, if the text containing the previous example date then went on to refer to a date "庚午", the correct annotation for that date would be Year 3, Month 2, Day 庚午. When a date refers only to a month and not a particular day, the Day field should be set to "N/A"; similarly, when a date refers only to a year, both Month and Day should be set to "N/A".

In addition to year, month, and day, every date annotation must be linked to an era entity (or, for rulers whose reign dates are used without eras, a person entity). The annotation client will offer suggestions based on confirmed era references occurring prior to the selected date in the text. So to correctly annotate a date like "大中祥符三年四月十四日", the simplest way to do this is to first confirm an annotation marking "大中祥符" as referring to the era 大中祥符, and then confirming the suggested entity and values for the date, which will be automatically suggested. In some cases, the correct era (and other values) will not be identified automatically, and it will be necessary to supply the correct values. To choose a different era, type the era name into the "Search" box, and confirm the correct selection.

Saving and exporting

If you are confident that the changes you have made are correct and in accordance with these guidelines, you can contribute the changes you have made to the annotations by clicking the "Save to ctext" link (note: this link is only shown after you have made changes to the annotations). Note: only confirmed annotations are saved - it is therefore not necessary to remove unconfirmed annotations suggested by the annotation client, since these will not be saved.

You can also save a local copy of your annotated text in XML, by clicking the "Export as XML" link. This will allow your web browser to download a file containing your annotations. As in the case of saving to ctext, only confirmed annotations are saved.

Annotating using the keyboard

In some materials, certain types of annotation may be repetitive, and the task of moving to the next annotation and approving it can be inefficient using a mouse. To help in these cases, certain keys on the keyboard can be used together with mouse actions to make the task more efficient:

By default, these keys move through all defined annotations. In some cases - particularly annotation of dates in historical texts - it may be useful to set these keys to move over only certain types of annotation, such as eras and dates. This can be done by deselecting tag types from the list at the top right of the annotation client: only those annotations which are of the selected types (or for which the first suggested candidate is of one of the selected types) will be included in the keyboard navigation functions.

Extracting knowledge claims

Texts that have been partially or completely marked up can be used as evidence for knowledge claims. A knowledge claim represents a piece of information about an historical entity - such as a person or place. For instance, a primary source might contain information like this:

1 庚子钦差大臣林则徐道卒,
This annotated fragment - from the 清史稿 - can be used as primary source evidence that the person 林则徐 (ctext:186523) died on a particular date: specifically, 道光三十年十一月庚子 (which corresponds to 15 December 1850 in the Gregorian calendar).

In the annotation client, there are two ways of creating new knowledge claims:

  1. Manual extraction - using the mouse, drag to select the exact sentence or sentence fragment that supports the claim you want to add (note: this fragment must contain at least one annotation). The annotation client will suggest candidate subjects for your claim; click the appropriate subject - for instance in the above example, we would click on "林则徐". The client will then display on the right a form allowing you to add a claim about the selected entity based on your chosen evidence; select the appropriate verb (e.g. 'died-date') and target (e.g. 道光三十年十一月庚子 1850/12/15 [date:583078/30/11/37] ), and click "Add" to save the new claim.
  2. Automatic extraction - click the "Extract" button. Suggested claims will be extracted and highlighted in the text. To review and add a claim, click the "►" icon at the left of the highlighted section of text. If a complete claim has been identified, and you are satisfied that the evidence highlighted supports the claim, click the "Save" link to save it to the data wiki.
In either case, when annotating please make sure to follow the annotation conventions, and in general try to use existing annotations as a guide.