Apache Tika

Jump to navigation Jump to search
Detects and extracts metadata and text content from documents.
License:Apache License, Version 2.0


Java based tool for detecting and extracting metadata and text content from documents.

User Experiences

Development Activity

Release Feed

Link to any RSS feed that is updated when new releases occur, if any, e.g: Failed to load RSS feed from http://projects.apache.org/feeds/rss/tika.xml: There was a problem during the HTTP request: 404 Not Found

Activity Feed

Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:

2022-07-02 08:49:07
gmnhapkhau176 created

ghế massage Haruko lần trước tiên được giới thiệu ra thị trường dùng vào cuối những năm 1980. Được kiểu dáng để mô hình các chuyển động và công nghệ của một viên chứ...

by gmnhapkhau176https://issues.apache.org/jira/secure/ViewProfile.jspa?name=gmnhapkhau176gmnhapkhau176http://activitystrea.ms/schema/1.0/person
2022-07-02 08:49:04
Tamás Cservenák updated the Description of ASF GitHub Bot changed the Labels to 'pull-request-available'...
by ASF GitHub Bothttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=githubbotgithubbothttp://activitystrea.ms/schema/1.0/person
2022-07-02 08:43:24
ASF GitHub Bot created a link from ASF GitHub Bot updated 2 fields of Julian Reschke commented on Julian Reschke attached one file to