Lotus教程、Java教程、Java虚拟机、Java软件综合开发社区

Lotus、Domino、Java、C#、Web、数据库综合开发教程、资料社区

常见文件类型识别



Published by admin on 08月 1, 2011

根据文件的后缀名识别文件类型并不准确,可以使用文件的头信息进行识别:
以下是各类文件的头:
JPEG (jpg),文件头:FFD8FFE1
PNG (png),文件头:89504E47
GIF (gif),文件头:47494638
TIFF (tif),文件头:49492A00
Windows Bitmap (bmp),文件头:424D
CAD (dwg),文件头:41433130
Adobe Photoshop (psd),文件头:38425053
Rich Text Format (rtf),文件头:7B5C727466
XML (xml),文件头:3C3F786D6C
HTML (html),文件头:68746D6C3E
Email [thorough only] (eml),文件头:44656C69766572792D646174653A


Outlook Express (dbx),文件头:CFAD12FEC5FD746F
Outlook (pst),文件头:2142444E
MS Word/Excel (xls.or.doc),文件头:D0CF11E0
MS Access (mdb),文件头:5374616E64617264204A
WordPerfect (wpd),文件头:FF575043
Postscript (eps.or.ps),文件头:252150532D41646F6265
Adobe Acrobat (pdf),文件头:255044462D312E
Quicken (qdf),文件头:AC9EBD8F
Windows Password (pwl),文件头:E3828596
ZIP Archive (zip),文件头:504B0304
RAR Archive (rar),文件头:52617221
Wave (wav),文件头:57415645
AVI (avi),文件头:41564920
Real Audio (ram),文件头:2E7261FD
Real Media (rm),文件头:2E524D46
MPEG (mpg),文件头:000001BA
MPEG (mpg),文件头:000001B3
Quicktime (mov),文件头:6D6F6F76
Windows Media (asf),文件头:3026B2758E66CF11
MIDI (mid),文件头:4D546864
检测文件类型的代码如下:

Java代码 复制代码 收藏代码

  1. import java.io.File;
  2. import java.io.FileInputStream;
  3. import java.io.IOException;
  4. import java.util.HashMap;
  5. import java.util.Map;
  6. public class FileTypeDetector {
  7. private static Map<String,String> head2FileType = new HashMap<String,String>();
  8. static{
  9. head2FileType.put(“FFD8FFE1″, “jpg”);
  10. head2FileType.put(“89504E47″, “png”);
  11. head2FileType.put(“47494638 “, “gif”);
  12. head2FileType.put(“49492A00″, “tif”);
  13. head2FileType.put(“424D”, “bmp”);
  14. head2FileType.put(“41433130″, “dwg”);
  15. head2FileType.put(“38425053 “, “psd”);
  16. head2FileType.put(“7B5C727466″, “rtf”);
  17. head2FileType.put(“3C3F786D6C”, “xml”);
  18. head2FileType.put(“68746D6C3E “, “html”);
  19. head2FileType.put(“44656C69766572792D646174″, “eml”);
  20. head2FileType.put(“CFAD12FEC5FD746F “, “dbx”);
  21. head2FileType.put(“2142444E”, “pst”);
  22. head2FileType.put(“D0CF11E0″, “xls/doc”);
  23. head2FileType.put(“5374616E64617264204A”, “mdb”);
  24. head2FileType.put(“FF575043″, “wpd”);
  25. head2FileType.put(“252150532D41646F6265″, “eps/ps”);
  26. head2FileType.put(“255044462D312E”, “pdf”);
  27. head2FileType.put(“E3828596″, “pwl”);
  28. head2FileType.put(“504B0304″, “zip”);
  29. head2FileType.put(“52617221″, “rar”);
  30. head2FileType.put(“57415645″, “wav”);
  31. head2FileType.put(“41564920″, “avi”);
  32. head2FileType.put(“2E7261FD”, “ram”);
  33. head2FileType.put(“2E524D46″, “rm”);
  34. head2FileType.put(“000001BA”, “mpg”);
  35. head2FileType.put(“000001B3″, “mpg”);
  36. head2FileType.put(“6D6F6F76″, “mov”);
  37. head2FileType.put(“3026B2758E66CF11″, “asf”);
  38. head2FileType.put(“4D546864″, “mid”);
  39. }
  40. private static String bytesToHexString(String fileName) throws IOException{
  41. FileInputStream fis = null;
  42. StringBuilder stringBuilder = new StringBuilder();
  43. try{
  44. fis = new FileInputStream(new File(fileName));
  45. byte[] b = new byte[4];
  46. fis.read(b, 0, b.length);
  47. for (int i = 0; i < b.length; i++) {
  48. int v = b[i] & 0xFF;
  49. String hv = Integer.toHexString(v);
  50. if (hv.length() < 2) {
  51. stringBuilder.append(0);
  52. }
  53. stringBuilder.append(hv);
  54. }
  55. }finally{
  56. if(fis != null)
  57. fis.close();
  58. }
  59. return stringBuilder.toString().toUpperCase();
  60. }
  61. public static String fileType(String fileName) throws IOException{
  62. String head = bytesToHexString(fileName);
  63. return head2FileType.get(head);
  64. }
  65. public static void main(String[] args) throws IOException {
  66. System.out.println(fileType(“d://aaa.png”));
  67. }
  68. }

参考:http://blog.sina.com.cn/s/blog_4c98b9600100jamb.html


下一篇文章:jvm terminated exit code =-1 »

【版权说明】:本网页上有部分内容来源于网上收集,但不能保证资料的完整性和准确性,仅提供参考和学习。如有侵权请立即通知我们,我们将立即删除,谢谢合作!

Add A Comment