pwn2dict 2.2

Prawie równo pół roku od ostatniej aktualizacji udostępniam nową wersję skryptu. Główną zmianą jest tym razem dodanie pełnego wsparcia dla formatu PWN w wersji 2003, używanego w płytach dołączanych kiedyś do papierowych wersji słowników. Konwersja została sprawdzona na Słowniku Poprawnej Polszczyzny i Słowniku Języka Polskiego PWN (podziękowania dla Jakuba Cholewki za informację zwrotną), a także angielsko-polskim słowniku PWN Oxford 2003. Ponadto zostały poprawione usterki kosmetyczne w słowniku Słowniku Języka Polskiego w wersji dołączonej do Gazety Wyborczej. Jeśli ktoś z was dysponuje którymś z tych słowników i wcześniej nie mógł ich przekonwertować, teraz nic nie powinno stanąć na przeszkodzie :)

Nową wersję można pobrać tutaj bądź tam gdzie zwykle.

22 komentarze

  1. Hi,
    just downloaded GoldenDict and there I got the polniem/niempol files working !
    Uuuuff and thank you,
    especially Michałowie for the excellent work on his script !
    Maybe in the future there'll be chance for a reciprocation for me…

  2. Originally posted by mziab:

    Say the script generated files called niempol.dict, niempol.idx and niempol.ifo, then you have to create a folder named gerpol and move those 3 files there. Afterwards, you copy that folder to Stardict's dic subfolder. The procedure is exactly the same for the other dictionary, except for the names.

    …thank you once more,
    now I got it also running in StarDict.
    Keep doing such excellent work,
    and have a nice day, Georg.

  3. Say the script generated files called niempol.dict, niempol.idx and niempol.ifo, then you have to create a folder named gerpol and move those 3 files there. Afterwards, you copy that folder to Stardict's dic subfolder. The procedure is exactly the same for the other dictionary, except for the names.

  4. Originally posted by mziab:

    pwn2dict only converts the text entries stored in the .win files. So the results you speak of are to be expected. Sorry, but you'll have to seek help elsewhere if you want to extract the audio, as I've currently no plans to extended my script in such a manner.

    Hi,

    and thanks also to Przemek,
    there's a little misunderstanding in regard to the "vaveniem" file.
    Since it's the largest file in the folder [about 500.000 KB]I thought it was necessary to run the dictionary.
    I don't necessarely need this file [would be nice to have it…]
    No, my problem is that, thanks to your script, I was able to convert the 2 .win files in 2 .dict files,
    which are now present in the data folder.
    But when I try to open them [StarDict has been installed] nothing happens.
    What needs to be done in order to run the 2 dictionary files ?
    Sorry for that simple question, but as I mentioned before, my knowledge is [it seems very ?]basic.
    I'd appreciate if the both of you would give me further advice on that matter.
    Thank you and best regards, Georg.

  5. pwn2dict only converts the text entries stored in the .win files. So the results you speak of are to be expected. Sorry, but you'll have to seek help elsewhere if you want to extract the audio, as I've currently no plans to extended my script in such a manner.

  6. Originally posted by mziab:

    Michał Ziąbkowski 16. December 2010, 13:37

    Here's the easiest way:

    1) Install Python -> http://www.python.org/ftp/python/2.7.1/python-2.7.1.msi

    2) Download a copy of pwn2dict and put it in a handy place like the desktop.

    2) Find the file with the dictionary base. It has the .win extension.

    3) Drag and drop that file onto pwn2dict. If everything went okay, three new files should appear. Voilla, you have converted the bases to StarDict format.

    Hi,

    unfortunately it did not quite work,
    now I have 2 .win folders converted to .dict
    but the "waveniem.pak" [512 861 KB]
    folder could not be convertet.
    It seems that it is the most important, and encoded,
    and your script doesn't convert it in that form.

    Any suggestions ?
    THX

    ps did Przemek convert from the cd version or the pendrive version [that's the one I've got ?

  7. W przypadku tabfile dałoby się. Pliki StarDict są bardziej złożone i trzeba by przebudowywać indeksy.
    A encje to właśnie to, co wymieniłeś – wpisy typu &symbol; albo &##1234; Duża część mojej roboty to właśnie wyłapywanie nowych encji i tagów i zamiana ich na coś bardziej pożytecznego w skrypcie.

    W razie czego, mogę dostarczyć skrypt, który wyłapie ze słownika wszystkie takie symbole.

  8. Przemek writes:

    Jakich encji?
    Nie da się tego pliku wynikowego jakoś wyedytować i po prostu użyć znajdź i zamień?
    Np. tam gdzie powinno być "ß" jest zawsze "& s z l i g ;" (bez spacji)
    tam gdzie powinna być "|" (pionowa krecha) jest "& b r v b a r ;"
    znaki transkrypcji sprawdzę jutro.

  9. Tak, dostarczyć mi listę nowych encji :) W razie czego mogę dostarczyć skrypt, który wyciąga takowe z danego słownika. Tak czy inaczej, będę potrzebował dalszych informacji, żeby coś z tym zrobić.

  10. Originally posted by mziab:

    Install Python, it should associate itself with Python scripts. Once you download my script, and Python automatically runs scripts when you double-click them, all you need to do is to drag the dictionary file onto pwn2dict. It'll just take the path of the file you dragged-and-dropped and try to convert it. Personally, I think this is easier and more fool-proof for newbies than other methods involving the command line.

    …thank you, for right now I'll have to quit.
    I'll keep you informed on my progress on that matter…

  11. Przemek writes:

    Witam!
    Dzisiaj dokonałem konwersji słownika pwn de-pl i pl-de i ogólnie działa, ale jest problem z wyświetlaniem niemieckich znaków, np, ß (eszet) oraz znaków transkrypcji. Czy jest na to jakiś sposób?

  12. For the record, here it is -> http://mzblog.grajpopolsku.pl/files/pwn2dict.py

    Install Python, it should associate itself with Python scripts. Once you download my script, and Python automatically runs scripts when you double-click them, all you need to do is to drag the dictionary file onto pwn2dict. It'll just take the path of the file you dragged-and-dropped and try to convert it. Personally, I think this is easier and more fool-proof for newbies than other methods involving the command line.

  13. Georg writes:

    …thank you for your fast reply,
    nevertheless I don't understand point #2 "Download a copy of pwn2dict",
    following your link I get a page full of scripts,and don't know what do do with it "Drag and drop that file onto pwn2dict".
    I'm new to these things,so forgive me my naivety.
    Maybe you'll find the time to give me more detailed instructions [for a beginner] via email georgibrom@gazeta.pl.
    Enjoy your evening, Georg.

    ps
    since I bought the printed edition of this dictionary a year ago [for quite some money]it is a shame for PWN that I must hassle you with my problem…

  14. Georg writes:

    Hi,

    I'd appreciate if you could show me how to do this for the PWN Wielki słownik polsko-niemiecki niemiecko-polski.
    A friend of mine bought the pen drive version and I copied the data to my hard disk.
    But obviously it want start from there.
    I'm running Widows XP and my computer knowledge is basic.
    feel free to answer in Polis.
    Regards, Georg.

Dodaj komentarz

Twój adres e-mail nie zostanie opublikowany. Wymagane pola są oznaczone *