A HTML scraper that uses machine learning frameworks to extract labelled fields from raw HTML. The project also involves the development of a tool to display the semi structured data generated by the scraper component.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Galateia HTML Extractor

Galateia HTML Extractor Web Site

Other Useful Business Software
Tigerpaw One | Business Automation Software for SMBs Icon
Tigerpaw One | Business Automation Software for SMBs

Fed up with not having the time, money and resources to grow your business?

The only software you need to increase cash flow, optimize resource utilization, and take control of your assets and inventory.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • Galateia works perfect.
    1 user found this review helpful.
Read more reviews >

Additional Project Details

Intended Audience

Science/Research

Programming Language

Python

Related Categories

Python XML Software, Python HTML XHTML, Python Search Engines, Python Information Analysis Software

Registered

2008-06-27