User:Kiril kovachev/BED Project

From Wiktionary, the free dictionary
Jump to navigation Jump to search

Project abstract[edit]

The intent of this project is to transcribe the contents of the Bulgarian Etymological Dictionary into Wikitext form so that it can be more reliably understood and to be used as a source of verifiable information. The texts are currently only available in scanned form, which naturally makes accessing them much more laborious than it need be, as searching a particular term precisely is currently impossible.

Means[edit]

The project will be undertaken through the use of an OCR (optical character recognition) interface, which is able to transcribe each PNG image approximately into text. The unclear elements can then be corrected by a human editor.

PDFs[edit]

I have compiled PDF forms of the available volumes, which I've uploaded to a Google Drive if anyone would wish to view them. The online reader suffers from significant buffering, at least on my end, and so a native reader would fare much better in assisting productivity.

Tomes[edit]