๊ด€๋ฆฌ ๋ฉ”๋‰ด

๋ชฉ๋กPython (1)

Wookang makes AI

Extracting text from epub with python - ํŒŒ์ด์ฌ์œผ๋กœ epub์—์„œ ํ…์ŠคํŠธ ๋ฝ‘๊ธฐ

ai ์—๊ฒŒ ๋จน์ผ ๋ฐ์ดํ„ฐ๋ฅผ ์š”๋ฆฌ์ค‘์ด๋‹ค. ๊ฐ€์ง€๊ณ  ์žˆ๋Š” epub๋“ค์ด ์กฐ๊ธˆ ์žˆ๋Š”๋ฐ ์ด๋Œ€๋กœ๋Š” ๋จน์ผ ์ˆ˜ ์—†์œผ๋‹ˆ ๋ชจ๋‘ text๋กœ ๋ฐ”๊ฟ”๋†”์•ผํ•œ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ์ƒ๊ฐ๋ณด๋‹ค ์ž๋ฃŒ๊ฐ€ ์—†์—ˆ๋‹ค. ํŠนํžˆ ํ•œ๊ธ€๋“ค์ด ๋ชจ๋‘ ๊นจ์ ธ๋‚˜์™”๋‹ค. calibre๋ฅผ ์ถ”์ฒœํ•˜๊ธฐ์— ์„ค์น˜ํ›„ convert ํ•ด๋ดค๋”๋‹ˆ ์ถœ๋ ฅ ํด๋”๋ฅผ ์„ ํƒํ•  ์ˆ˜ ์—†์–ด ์ด๊ฒƒ๋„ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ ๊ฝค๋‚˜ ๊ท€์ฐฎ์€ ์ž‘์—…์ด์—ˆ๋‹ค - ํ•˜์ง€๋งŒ ํŒŒ์ด์ฌ์œผ๋กœ ๋๋‚ด ์‹คํŒจํ•œ๋‹ค๋ฉด ์ด๋ ‡๊ฒŒ๋ผ๋„ ์ž‘์—…ํ•œ ํ›„ txtํŒŒ์ผ๋“ค์„ ๋ชจ๋‘ ์ฐพ์•„ ํ•œ๋ฒˆ์— ๋ชจ์œผ๋Š” ์ฝ”๋“œ๋ฅผ ๋งŒ๋“ค ์ž‘์ •์ด์—ˆ๋‹ค.. ์ง€๋งŒ ํŒŒ์ด์ฌ์œผ๋กœ ํ•ด๊ฒฐํ–ˆ๋‹ค. ๋จผ์ € EbookLib๋ฅผ ์„ค์น˜ํ•œ๋‹ค 1. pip install EbookLib https://pypi.org/project/EbookLib/ EbookLib Ebook library which can handle EPUB2/EPUB3 and ..