Search interfaces

The ONCOJ is associated with two powerful user interfaces. Both interfaces were developed by Professor Alastair Butler in collaboration with the Development of and linguistic research with a parsed corpus of Japanese project at NINJAL.

Tregex based search function

A new corpus interface was launched in the summer of 2020.

This new interface has the advantage of enabling search with Tregex, a more powerful search tool than TGrep-lite, and of directly accessing the most up-to-date working files of the ONCOJ database. There are other additional features that will be of interest, including an easily viewed morpheme-by-morpheme correspondence between Old Japanese and English dictionary entries for all of the texts. This interface can be accessed here.

TGrep-lite based search function

A limited search interface, most useful for browsing the content of the ONCOJ, has been associated with the ONCOJ since its first publication in the spring of 2018. This search interface can be accessed here.

This makes accessible the static data uploaded to the present website.

The syntax for searching is called TGrep-lite and is fully documented within the search interface. TGrep-lite allows you to search for strings (terminal nodes) or parts of strings, and for tags (higher node labels) or parts of tags. For example, enter a string that corresponds to a full word such as “kapi” (shell, worth, rear, vale, etc.), or (in square brackets) a part-of speech or phrase level node label such as “[NP]” (noun phrase), or a complex expression such as “no == ({PHON} > [COP])” (“no” written phonographically, with the part-of-speech of “copula”). Clicking “Submit” triggers the search.

Submitting a well-formed search expression opens a results page with attestations matched with text ID numbers that double as links to the corresponding trees. Once inside the Search interface, navigation to all the available functions (Tags, Dictionary, String search, Tree search) can be done through the buttons at the top of the Search interface page. Full documentation for each function is available through the “About” buttons (e.g., “About tree search”). There is also a button for returning to the ONCOJ front page.

Once you arrive at an Analysis view, you can click the lemma ID of any item and trigger a Dictionary search which opens a search result page in the Dictionary with the entry for that item.

TGrep-lite also allows you to specify structural relations between nodes in a search. For example, in order to find genitive-marked subject particle phrases preceding wo-marked object particle phrases, enter the search expression “([SBJ] < [GEN]) $.. ([OB1] < ([P] < wo))” in the search box and click “Submit”.

fsa