Apache Tika

From COPTR
Revision as of 21:05, 13 November 2013 by COPTR Bot (talk | contribs) (Trial import from script.)
Jump to navigation Jump to search
Detects and extracts metadata and text content from documents.
Homepage:http://tika.apache.org/
License:Apache License, Version 2.0


Description

Java based tool for detecting and extracting metadata and text content from documents.

Searching for Tika on OPF Labs

{search:query=Tikatype=page}

User Experiences

Development Activity

Error in widget Ohloh Project: unable to write file /var/www/html/extensions/Widgets/compiled_templates/wrt665c7a0859a7e5_51816980



Release Feed

Link to any RSS feed that is updated when new releases occur, if any, e.g: Failed to load RSS feed from http://projects.apache.org/feeds/rss/tika.xml: There was a problem during the HTTP request: 404 Not Found

Activity Feed

Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:

2024-06-02 13:54:04
Michael Osipov added the Fix Version 'ASF GitHub Bot updated a link from ASF GitHub Bot commented on MOHD KAIF KHAN commented on ableegoldman So far here's what I have done - 
1. Remove the old...
by MOHD KAIF KHANhttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=kkhan1kkhan1http://activitystrea.ms/schema/1.0/person
2024-06-02 13:40:51
Ian Cook updated the Description of

Because of a limitation in PyArrow, when PyArrow Tables containing MapArray columns with nested fields or timestamps are passed to spark.createDataFrame(), null valu...

by Ian Cookhttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=icookicookhttp://activitystrea.ms/schema/1.0/person
2024-06-02 13:39:49
Denys Kuzmenko linked 2 issues
by Denys Kuzmenkohttps://issues.apache.org/jira/secure/ViewProfile.jspa?name=dkuzmenkodkuzmenkohttp://activitystrea.ms/schema/1.0/person
2024-06-02 13:39:20
Denys Kuzmenko resolved