Difference between revisions of "Tika"

From COPTR
Jump to navigation Jump to search
(Trial import from script.)
 
(Trial import from script.)
Line 1: Line 1:
== Summary ==
+
{{Infobox_tool
 +
|purpose=Detects and extracts metadata and text content from documents.
 +
|image=
 +
|homepage=http://tika.apache.org/
 +
|license=Apache License, Version 2.0
 +
|platforms=
 +
}}
  
<table>
+
<!-- Delete the Categories that do not apply -->
<tbody>
+
[[Category:Metadata Extraction]]
<tr class="odd">
+
[[Category:File Format Identification]]
<td align="left">Purpose</td>
 
<td align="left">{excerpt}Detects and extracts metadata and text content from documents.{excerpt}</td>
 
</tr>
 
<tr class="even">
 
<td align="left">Homepage </td>
 
<td align="left">[http://tika.apache.org/]</td>
 
</tr>
 
<tr class="odd">
 
<td align="left">Source Code Repository </td>
 
<td align="left">[https://github.com/apache/tika]</td>
 
</tr>
 
<tr class="even">
 
<td align="left">License </td>
 
<td align="left">Apache License, Version 2.0 </td>
 
</tr>
 
<tr class="odd">
 
<td align="left">Debian Package</td>
 
<td align="left"></td>
 
</tr>
 
</tbody>
 
</table>
 
  
== Description ==
 
  
 +
= Description =
 
Java based tool for detecting and extracting metadata and text content from documents.
 
Java based tool for detecting and extracting metadata and text content from documents.
  
== User Experiences ==
+
== Searching for Tika on OPF Labs ==
 +
 
 +
{search:query=Tikatype=page}
  
e.g. links to AQuA/SCAPE/Hackathon issues that use the tool<br />* [SP:IS25 Web Content Characterisation]<br />* [SP:SO11 The Tika characterisation Tool]<br />* [SO17 Web Archive Mime-Type detection workflow based on Droid and Apache Tika|SP:SO17 Web Archive Mime-Type detection workflow based on Droid and Apache Tika]
+
= User Experiences =
  
== News Feeds ==
 
  
 +
= Development Activity =
 
=== Release Feed ===
 
=== Release Feed ===
  
Link to any RSS feed that is updated when new releases occur, if any, e.g:<br />{rss:max=7|url=http://projects.apache.org/feeds/rss/tika.xml}
+
Link to any RSS feed that is updated when new releases occur, if any, e.g:
 +
<rss max=7>http://projects.apache.org/feeds/rss/tika.xml</rss>
  
 
=== Activity Feed ===
 
=== Activity Feed ===
  
Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:<br />{rss:max=7|url=https://issues.apache.org/jira/activity?maxResults=10&amp;streams=key+IS+TIKA}
+
Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:
 
+
<rss max=7>https://issues.apache.org/jira/activity?maxResults=10&amp;streams=key+IS+TIKA</rss>
== Searching for Tika on OPF Labs ==
 
 
 
{search:query=Tikatype=page}
 

Revision as of 22:08, 10 November 2013

Detects and extracts metadata and text content from documents.
Homepage:http://tika.apache.org/
License:Apache License, Version 2.0


Description

Java based tool for detecting and extracting metadata and text content from documents.

Searching for Tika on OPF Labs

{search:query=Tikatype=page}

User Experiences

Development Activity

Release Feed

Link to any RSS feed that is updated when new releases occur, if any, e.g: Failed to load RSS feed from http://projects.apache.org/feeds/rss/tika.xml: There was a problem during the HTTP request: 404 Not Found

Activity Feed

Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:

2024-11-22 02:36:54
Youri created ASF GitHub Bot updated a link from ASF GitHub Bot changed the Labels to 'pull-request-available'...
by ASF GitHub Bothttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=githubbotgithubbothttp://activitystrea.ms/schema/1.0/person
2024-11-22 02:35:53
ASF GitHub Bot created a link from Yang Jie created ASF GitHub Bot updated a link from ASF GitHub Bot updated a link from