在Facebook上關注我們,隨時得到最新消息 在Twitter上關注我們,隨時得到最新消息 在新浪微博上關注我們,隨時得到最新消息 在豆瓣上關注我們,隨時得到最新消息
中國哲學書電子化計劃

標記客戶端

標記客戶端是一個ctext插件,其目的為實現有效率的古典文獻語意標記。在閱讀這裡的使用說明前,建議您先參考語意標記的基本原則和目標

插件安裝

使用標記客戶端前,請建立免費的帳戶並登入。登入後,點此處把標記客戶端加入到您的帳戶,點擊後再按「安裝」連結即可。

文本載入

載入文本最簡單的方式是在本站上打開所欲標記的文本,然後點擊右手邊的」標記「連結。(如果找不到」標記「連結,請再次確認插件已經安裝好。)

文本會載入到客戶端,包括文本中已經存在的標記(若有)。

自動標記

標記客戶端可利用數據維基知識庫中的各種資料列出文本中可能需要標記的內容。點擊客戶端中的「自動標記」按鈕。幾秒後,系統會在文本中顯示所找到的候選標記。自動建立的候選標記都以深灰色背景顯示,另外有彩色下劃線表示默認的標記類型。所有灰色背景的標記為」未確定「或「候選」標記:系統自動推薦的,但未必正確,因此在保存或下載資料時,這些標記不會被儲存。

客戶端為每一個候選標記提供相關資訊,用以判斷該標記是否正確,並應該連結到什麼實體。例如,當文本中出現了」乾德「時,這兩個字有可能指稱 宋太祖年號」乾德「,也有可能指稱前蜀後主」乾德「年號,甚至也有可能指李日尊的兒子李乾德或其它的實體。標記的主要工作在於用戶藉由脈絡和對文本的理解明確註明文本在此處所指稱的實體是哪一個。

點擊一個候選標記,客戶端會顯示一個或多數個相關實體或時間標記。每一個行代表不同的標記或實體,在它的名稱旁邊會顯示它的類型(如:年號、人物、地方等)。另外,每一個選擇會提供以下類型的機種選項:

標記過程中,可使用這些連接來辨識其中有沒有哪一項表示正確的實體對象。點擊對應的「Y」連接將會把一個對象確認為該標記的對象。確認了之後,標記背景會改為彩色,表示這是一個已確認標記。如果標錯了對象,可點擊「改變」連接再選擇正確對象,或再點擊「X「刪除標記。

手動標記

用鼠標點擊並畫出未標記的文字內容,就可以為對應字詞建立新的標記。只能對不包括任何標記的文字建立新標記,如果已經有標記,可以先點擊其右上角的「X」連接刪除之。

建立了新標記後,系統會列出有相關名稱的實體。如果列單中有正確的選擇,就可以和自動標記一樣處理;如果沒有恰當的選項,可採取以下方法之一:

  1. 輸入字詞檢索實體 - 把關鍵詞(如:實體的全名、異名、辨識碼等)輸入到「檢索」輸入方塊中。如果找到正確的實體,點擊「Y」確認。
  2. 建立新實體 - Click the link corresponding to the type of entity and annotation you want to create. This will appear as a new confirmed annotation. When you subsequently save the text to ctext, a new entity will be created.
Always try to confirm that a matching entity does not exist before creating a new one.

Annotating dates

Please read the background notes on date annotation before annotating dates. When an annotation is created that is of the "date" type, additional fields will be available in the popup box, labeled Year, Month, and Day. These must be set to correspond to the meaning of the date in its actual context. For example, a date "三年二月甲子" should have Year set to 3, Month set to 2, and Day set to 甲子. The same should be done in cases where the context of the date makes clear what these values should be, even if the literal date does not contain them. For example, if the text containing the previous example date then went on to refer to a date "庚午", the correct annotation for that date would be Year 3, Month 2, Day 庚午. When a date refers only to a month and not a particular day, the Day field should be set to "N/A"; similarly, when a date refers only to a year, both Month and Day should be set to "N/A".

In addition to year, month, and day, every date annotation must be linked to an era entity (or, for rulers whose reign dates are used without eras, a person entity). The annotation client will offer suggestions based on confirmed era references occurring prior to the selected date in the text. So to correctly annotate a date like "大中祥符三年四月十四日", the simplest way to do this is to first confirm an annotation marking "大中祥符" as referring to the era 大中祥符, and then confirming the suggested entity and values for the date, which will be automatically suggested. In some cases, the correct era (and other values) will not be identified automatically, and it will be necessary to supply the correct values. To choose a different era, type the era name into the "Search" box, and confirm the correct selection.

Saving and exporting

If you are confident that the changes you have made are correct and in accordance with these guidelines, you can contribute the changes you have made to the annotations by clicking the "Save to ctext" link (note: this link is only shown after you have made changes to the annotations). Note: only confirmed annotations are saved - it is therefore not necessary to remove unconfirmed annotations suggested by the annotation client, since these will not be saved.

You can also save a local copy of your annotated text in XML, by clicking the "Export as XML" link. This will allow your web browser to download a file containing your annotations. As in the case of saving to ctext, only confirmed annotations are saved.

Annotating using the keyboard

In some materials, certain types of annotation may be repetitive, and the task of moving to the next annotation and approving it can be inefficient using a mouse. To help in these cases, certain keys on the keyboard can be used together with mouse actions to make the task more efficient:

By default, these keys move through all defined annotations. In some cases - particularly annotation of dates in historical texts - it may be useful to set these keys to move over only certain types of annotation, such as eras and dates. This can be done by deselecting tag types from the list at the top right of the annotation client: only those annotations which are of the selected types (or for which the first suggested candidate is of one of the selected types) will be included in the keyboard navigation functions.

Extracting knowledge claims

Texts that have been partially or completely marked up can be used as evidence for knowledge claims. A knowledge claim represents a piece of information about an historical entity - such as a person or place. For instance, a primary source might contain information like this:

1 庚子欽差大臣林則徐道卒,
This annotated fragment - from the 清史稿 - can be used as primary source evidence that the person 林則徐 (ctext:186523) died on a particular date: specifically, 道光三十年十一月庚子 (which corresponds to 15 December 1850 in the Gregorian calendar).

In the annotation client, there are two ways of creating new knowledge claims:

  1. Manual extraction - using the mouse, drag to select the exact sentence or sentence fragment that supports the claim you want to add (note: this fragment must contain at least one annotation). The annotation client will suggest candidate subjects for your claim; click the appropriate subject - for instance in the above example, we would click on "林則徐". The client will then display on the right a form allowing you to add a claim about the selected entity based on your chosen evidence; select the appropriate verb (e.g. 'died-date') and target (e.g. 道光三十年十一月庚子 1850/12/15 [date:583078/30/11/37] ), and click "Add" to save the new claim.
  2. Automatic extraction - click the "Extract" button. Suggested claims will be extracted and highlighted in the text. To review and add a claim, click the "►" icon at the left of the highlighted section of text. If a complete claim has been identified, and you are satisfied that the evidence highlighted supports the claim, click the "Save" link to save it to the data wiki.
In either case, when annotating please make sure to follow the annotation conventions, and in general try to use existing annotations as a guide.