Difference between revisions of "Tika"
Jump to navigation
Jump to search
(Trial import from script.) |
(Trial import from script.) |
||
| Line 1: | Line 1: | ||
| − | == | + | {{Infobox_tool |
| + | |purpose=Detects and extracts metadata and text content from documents. | ||
| + | |image= | ||
| + | |homepage=http://tika.apache.org/ | ||
| + | |license=Apache License, Version 2.0 | ||
| + | |platforms= | ||
| + | }} | ||
| − | < | + | <!-- Delete the Categories that do not apply --> |
| − | + | [[Category:Metadata Extraction]] | |
| − | + | [[Category:File Format Identification]] | |
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| + | = Description = | ||
Java based tool for detecting and extracting metadata and text content from documents. | Java based tool for detecting and extracting metadata and text content from documents. | ||
| − | == | + | == Searching for Tika on OPF Labs == |
| + | |||
| + | {search:query=Tikatype=page} | ||
| − | + | = User Experiences = | |
| − | |||
| + | = Development Activity = | ||
=== Release Feed === | === Release Feed === | ||
| − | Link to any RSS feed that is updated when new releases occur, if any, e.g:< | + | Link to any RSS feed that is updated when new releases occur, if any, e.g: |
| + | <rss max=7>http://projects.apache.org/feeds/rss/tika.xml</rss> | ||
=== Activity Feed === | === Activity Feed === | ||
| − | Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:< | + | Link to any RSS feed that is updated when issue or code updates occur, if any, e.g: |
| − | + | <rss max=7>https://issues.apache.org/jira/activity?maxResults=10&streams=key+IS+TIKA</rss> | |
| − | |||
| − | |||
| − | |||
Revision as of 22:08, 10 November 2013
Description
Java based tool for detecting and extracting metadata and text content from documents.
Searching for Tika on OPF Labs
{search:query=Tikatype=page}
User Experiences
Development Activity
Release Feed
Link to any RSS feed that is updated when new releases occur, if any, e.g: Failed to load RSS feed from http://projects.apache.org/feeds/rss/tika.xml: There was a problem during the HTTP request: 404 Not Found
Activity Feed
Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:
- 2026-03-22 11:46:02
- Richard Zowalla created
{{/*- Edge case: Multi-letter abbreviation at the start of a non-first sentence
- with {@code useTokenEnd = false} (no space between sentences)...
- by Richard Zowallahttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=rzo1rzo1http://activitystrea.ms/schema/1.0/person
- 2026-03-22 11:45:33
- ASF GitHub Bot updated a link from
ASF GitHub Bot logged '10m' on ASF GitHub Bot updated a link from
ASF GitHub Bot logged '10m' on Arpit removed the Link between
Arpit updated the Description of
Currently we are using <tika-core.version>3.2.3</tika-core. Version> , where we are seeing Line Count and Paragraph count attribute are not coming for d...
- by Arpithttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=arvijarvijhttp://activitystrea.ms/schema/1.0/person