Chapter Proposal

Secrets of the secrets: text mining on Dao Canon

 

Abstract: This chapter performs text mining on Dao Canon utilizing the arulesSequences and tm packages of R.

Key words: text mining, Dao Canon, Chinese characters

Sections:

  1. The necessity of text mining on Daoist literature.
  2. Preprocessing: tricks & techs in dealing with Chinese characters.
  3. Mining with arulesSequences on TOC text.
  4. Mining with tm on Brief Introduction of the Canon.

Note: This chapter contains Chinese characters in manuscript body and tables, such as:

黃帝陰符經註       

   8

一卷  

1036

 

956 

道德真經註         

   6

三卷  

 117

杜光庭編 

 13 

太上老君說常清靜經註

   5

二卷  

  88

杜光庭撰 

  8 

周易參同契註       

   4

四卷  

  41

司馬承禎撰

  5 

北斗七元金玄羽章   

   2

五卷  

  32

王 嚞撰  

  5 

道德真經傳         

   2

十卷  

  29

俞琰註   

  5