tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-851) M4V magic detection invalid
Date Fri, 27 Jan 2012 14:32:11 GMT

    [ https://issues.apache.org/jira/browse/TIKA-851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13194790#comment-13194790
] 

Nick Burch commented on TIKA-851:
---------------------------------

I'm not sure if we're going to be able to differentiate between .mov, .mp4 and .m4v with only
mime magic, as I believe they all use the same container format

We may need to look at a detector that opens the files up and checks them in a container aware
manner
                
> M4V magic detection invalid
> ---------------------------
>
>                 Key: TIKA-851
>                 URL: https://issues.apache.org/jira/browse/TIKA-851
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>    Affects Versions: 1.0
>            Reporter: Alexander Chow
>
> When the mime type of an M4V file is detected using its name only, it returns video/x-m4v.
 When it is detected using the InputStream (hence utilising the MagicDetector), it incorrectly
returns video/quicktime.
> Using the sample M4V file from Apple's [knowledge base|http://support.apple.com/kb/HT1425]:
> {code:title=TikaTest.java}
> public class TikaTest {
> 	public static void main(String[] args) throws Exception {
> 		String userHome = System.getProperty("user.home");
> 		File file = new File(userHome + "/Desktop/sample_iPod.m4v");
> 		InputStream is = TikaInputStream.get(file);
> 		Detector detector = new DefaultDetector(
> 			MimeTypes.getDefaultMimeTypes());
> 		Metadata metadata = new Metadata();
> 		metadata.set(Metadata.RESOURCE_NAME_KEY, file.getName());
> 		System.out.println("File + filename: " + detector.detect(is, metadata));
> 		System.out.println("File only:       " + detector.detect(is, new Metadata()));
> 		System.out.println("Filename only:   " + detector.detect(null, metadata));
> 	}
> }
> {code}
> Renders the output:
> {code}
> File + filename: video/quicktime
> File only:       video/quicktime
> Filename only:   video/x-m4v
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message