Esihlokweni esilandelayo sizobheka i-TextSnatcher. Uma ungomunye wabasebenzisi abavame ukusebenza nabo OCR, ungathanda ukubona uhlelo lokusebenza olulula olwakhelwe phezu kohlelo lokusebenza olukhulu oluyinkimbinkimbi njengalolu I-Tesseract. uma ufuna indlela elula nengelula yokukopisha umbhalo ezithombeni ku-Gnu/Linux, ungabheka i-TextSnatcher, ingase ihambisane nalokho okufunayo.
I kungenzeka khipha umbhalo ezithombeni, amafayela e-PDF noma izinto ezifanayo, akukho okusha. Namuhla singathola amathuluzi amaningi ahlukene okwenza lo msebenzi, kodwa okwamanje awekho awenza kalula ngendlela i-TextSnatcher engenza ngayo.
Leli thuluzi lenza ukuqaphela uhlamvu olubonakalayo (OCR) ngemizuzwana, okuzovumela abasebenzisi kopisha ngokushesha umbhalo kusuka kunoma yini ebonakalayo esikrinini ukuya ebhodini lokunamathisela lesistimu, uyenze ilungele ukunamathiselwa kwenye indawo. Ukuqashelwa kohlamvu, okuvame ukwaziwa ngokuthi i-OCR (kusukela ku-English Optical Character Recognition), inqubo ehloselwe ukwenza imibhalo ibe yidijithali, ekhomba ngokuzenzakalelayo esithombeni, izimpawu noma izinhlamvu eziyingxenye yezinhlamvu ezithile, bese izigcina njengedatha. Ngakho-ke singakwazi ukusebenzisana nalokhu ngohlelo lokuhlela umbhalo.
Mayelana nokuxhumana kwalolu hlelo lokusebenza, bekungeke kube lula ukukusebenzisa. Kuzodingeka ukuthi siyiqale kuphela, chofoza inkinobho ethi 'Thatha Manje!'. Ngemva sizobona ithuluzi lokuthwebula isikrini elizenzakalelayo livela ukuze lithathe isithombe-skrini esigcwele, sithwebule iwindi lamanje noma sikhethe indawo ozosithwebula (kunconyiwe) sigxile kuphela embhalweni esifuna ukuwukopisha.
Izici ezijwayelekile ze-TextSnatcher
- Lolu hlelo luzosivumela kopisha umbhalo wezithombe kalula, singenza imisebenzi ye-OCR ngemizuzwana, ngemiphumela emihle impela.
- I-Akhawunti nge ukwesekwa kwezilimi eziningi. Lezi zingakhethwa enkinobheni engakwesokunxele, phezulu efasiteleni.
- Izosivumela kopisha umbhalo wezithombe ukhetha indawo.
- Kuzo uhlelo olusheshayo nolula ukulusebenzisa.
- Ungakwazi bona amavidiyo athile alolu hlelo esebenza in yakhe Indawo yokugcina izinto zeGitHub.
- Lolu hlelo lokusebenza isebenzisa i-Tesseract OCR 4.x ukuze ibone uhlamvu. Uma ungathanda ukwazi okwengeziwe, ungafunda mayelana I-Tesseract y I-Star Tesseract-Project.
Faka i-TextSnatcher ku-Ubuntu
Lolu hlelo singayithola itholakala njengephakethe leFlatpak ku I-Flathub. Uma usebenzisa Ubuntu 20.04 futhi ungenabo lobu buchwepheshe obunikwe amandla kusistimu yakho, ungaqhubeka Umhlahlandlela ukuthi osebenza naye wabhala kule blog esikhathini esedlule.
para faka lolu hlelo ku-Ubuntu, kuzofanele sivule kuphela i-terminal (Ctrl + Alt + T) bese senza umyalo kuyo:
flatpak install flathub com.github.rajsolai.textsnatcher
Lapho ukufakwa kohlelo sekuqediwe, kuzodingeka sibheke kuphela isiqalisi kukhompyutha yethu, noma sisebenze kutheminali ukuze qala uhlelo:
flatpak run com.github.rajsolai.textsnatcher
Uma ngemuva kokuqala le software, ingasebenzi kahle noma ingaqali nhlobo, kungase kudingeke ukuthi uyifake gnome-skrini. Uma kunjalo, okumele ukwenze nje ukuthayipha isiphetho (Ctrl+Alt+T):
sudo apt install gnome-screenshot
Khipha
Uma kwenzeka ufuna susa uhlelo ohlelweni lwakho, kuyodingeka kuphela ukuvula i-terminal (Ctrl+Alt+T) bese wethula umyalo kuyo:
flatpak uninstall com.github.rajsolai.textsnatcher
Leli thuluzi lakhelwe amasistimu wokusebenza ahlukene. Nakuba ngibhala lesi sihloko, ngisihlole kuphela ku-Ubuntu 20.04/21.10, ngemiphumela emihle kuzo zombili izimo. Imoto I-Tesseract OCR inika amandla leli thuluzi futhi lisebenza kahle uma indawo ekhethiwe inokulungiswa okuphezulu, noma umbhalo ongakopishwa mkhulu futhi ucacile..
Ngokulungiswa okuphansi noma amabhlogo amancane kakhulu 'wombhalo', ezinye izinhlamvu ngezinye izikhathi zikopishwa zibe ezinkulu. Futhi uma ukukhethwa kunokuhlobisa okuningi, kungaholela emiphumeleni ethile engaqondakali, njengoba ithuluzi lizama ukunikeza izinhlamvu zombhalo ezingxenyeni zemingcele, izithombe, njll.