PDF 隐藏的对象

问题描述 投票:0回答:1

我正在研究PDF中的标记内容。

我遇到一个PDF文件,它有标记的内容,但标记的内容中很少有对象是隐藏的,所以这里的一个BDC-EMC块有可见和隐藏的对象,我在文档中没有看到OCGs阵列。我在文档中没有看到OCGs数组,这是如何工作的,如何知道哪个对象(图形文本)是可见的,哪个是隐藏的?

在这里我没有看到附加pdf文件的选项,所以共享内容流。这里只有一个BT-ET块输入代码 "PlacedPDF MC0 BDC "是可见的,其他都是隐藏的。

任何帮助是非常感激的。enter code here谢谢!, Chetan

PDF内容流

/Span <</Lang (en)/MCID 1597 >>BDC 
    /Span <</ActualText (þÿ    )>>BDC 
    EMC 
EMC 
/Span <</Lang (en)/MCID 1598 >>BDC 
EMC 
/Span <</Lang (en)/MCID 1599 >>BDC 
    /Span <</ActualText (þÿ    )>>BDC 
   EMC 
EMC

q
    /Perceptual ri
    /GS0 gs
    /T1_1 1 Tf
   /Fm0 Do
Q
/Figure <</MCID 1602 >>BDC 
/PlacedPDF /MC0 BDC 

<------------------------------- START -------------------------------->

BT
0 0 0 1 k
/Perceptual ri
/GS0 gs
/T1_0 1 Tf
6.7092 0 0 6.7092 91.8006 408.647 Tm
[(St)-20(andard)]TJ
ET

<------------------------------- END -------------------------------->


q
67.107 261.154 77 188.188 re
W n
BT
-0.12 Tw 6.7092 0 0 6.7092 332.5724 347.7748 Tm
[(Mec)50(hanical T)115(ee)]TJ
0 Tw 17.697 9.073 Td
[(A)40(WW)40(A Ductile Iron Pipe)]TJ
-34.941 -1.057 Td
[(R)20(educing)]TJ
-1.399 -20.545 Td
(Outlet Coupling)Tj
ET
Q
q
67.107 261.154 77 188.188 re
W n
BT
6.7092 0 0 6.7092 339.3285 306.0237 Tm
[(Saddle-L)20(et)]TJ
-18.251 6.751 Td
[(R)20(educing)]TJ
-4.096 -1.2 Td
(\(2" x 1\275", 2\275" x 2", 3" x 2\275"\))Tj
-0.025 Tw 20.744 8.715 Td
[(Flange A)20(dapter)]TJ
0 Tw 2.279 -20.578 Td
[(W)-20(ildcat)]TJ
19.004 0 Td
(HDPE Pipe)Tj
ET
Q
q
67.107 261.154 77 188.188 re
W n
BT
-0.025 Tw 6.7092 0 0 6.7092 467.048 359.0001 Tm
[(IPS )-25(to A)40(WW)40(A)]TJ
ET
EMC 
EMC 
/Figure <</MCID 1603 >>BDC 
/PlacedPDF /MC1 BDC 
Q
q
170.527 255.484 83.892 188.189 re
W n
BT
6.7092 0 0 6.7092 73.8793 402.9777 Tm
[(St)-20(andard)]TJ
0.205 -7.706 Td
(GapSeal)Tj
-0.12 Tw 35.682 -1.367 Td
[(Mec)50(hanical T)115(ee)]TJ
0 Tw 17.697 9.073 Td
[(A)40(WW)40(A Ductile Iron Pipe)]TJ
ET
Q
q
170.527 255.484 83.892 188.189 re
W n
BT
6.7092 0 0 6.7092 65.2513 303.5076 Tm
[(End P)20(rotection)]TJ
38.18 -0.47 Td
[(Saddle-L)20(et)]TJ
ET
Q
q
170.527 255.484 83.892 188.189 re
W n
BT
-0.025 Tw 6.7092 0 0 6.7092 310.6531 396.0676 Tm
[(Flange A)20(dapter)]TJ
0 Tw 2.279 -20.578 Td
(W)Tj
6.7092 0 0 6.7092 171.4775 337.5969 Tm
24.043 -11.863 Td
(ildcat)Tj
17.984 0 Td
(HDPE Pipe)Tj
-56.203 -0.017 Td
[(F)20(astFit)]TJ
4.1287 0 0 4.1287 96.3529 259.9574 Tm
(\256)Tj
-0.025 Tw 6.7092 0 0 6.7092 449.1268 353.3308 Tm
[(IPS )-25(to A)40(WW)40(A)]TJ
ET
EMC 
EMC 
/Figure <</MCID 1604 >>BDC 
/PlacedPDF /MC2 BDC 
Q
q
62.748 59.87 83.953 188.188 re
W n
BT
6.7092 0 0 6.7092 -157.3332 207.3635 Tm
[(St)-20(andard)]TJ
0.205 -7.706 Td
(GapSeal)Tj
ET
Q
q
62.748 59.87 83.953 188.188 re
W n
BT
6.7092 0 0 6.7092 202.1706 207.3635 Tm
[(A)40(WW)40(A Ductile Iron Pipe)]TJ
-34.941 -1.057 Td
[(R)20(educing)]TJ
-1.399 -20.545 Td
(Outlet Coupling)Tj
-18.53 6.776 Td
[(End P)20(rotection)]TJ
ET
Q
q
62.748 59.87 83.953 188.188 re
W n
BT
6.7092 0 0 6.7092 -32.2543 150.0337 Tm
[(R)20(educing)]TJ
-4.096 -1.2 Td
(\(2" x 1\275", 2\275" x 2", 3" x 2\275"\))Tj
ET
Q
q
62.748 59.87 83.953 188.188 re
W n
BT
6.7092 0 0 6.7092 222.231 62.3919 Tm
(HDPE Pipe)Tj
-56.203 -0.017 Td
[(F)20(astFit)]TJ
4.1287 0 0 4.1287 -134.8597 64.3432 Tm
(\256)Tj
-0.025 Tw 6.7092 0 0 6.7092 217.9142 157.7166 Tm
[(IPS )-25(to A)40(WW)40(A)]TJ
ET
EMC 
EMC 
/Figure <</MCID 1605 >>BDC 
/PlacedPDF /MC3 BDC 
Q
q
169.441 59.898 85.291 183.362 re
W n
BT
6.7092 0 0 6.7092 -181.845 207.3911 Tm
[(St)-20(andard)]TJ
0.205 -7.706 Td
(GapSeal)Tj
-0.12 Tw 35.682 -1.367 Td
[(Mec)50(hanical T)115(ee)]TJ
ET
Q
q
169.441 59.898 85.291 183.362 re
W n
BT
6.7092 0 0 6.7092 -56.7661 200.2995 Tm
[(R)20(educing)]TJ
-1.399 -20.545 Td
(Outlet Coupling)Tj
-18.53 6.776 Td
[(End P)20(rotection)]TJ
38.18 -0.47 Td
[(Saddle-L)20(et)]TJ
-18.251 6.751 Td
[(R)20(educing)]TJ
-4.096 -1.2 Td
(\(2" x 1\275", 2\275" x 2", 3" x 2\275"\))Tj
-0.025 Tw 20.744 8.715 Td
[(Flange A)20(dapter)]TJ
0 Tw 2.279 -20.578 Td
[(W)-20(ildcat)]TJ
ET
Q
q
169.441 59.898 85.291 183.362 re
W n
BT
6.7092 0 0 6.7092 -179.356 62.3054 Tm
[(F)20(astFit)]TJ
4.1287 0 0 4.1287 -159.3715 64.3708 Tm
(\256)Tj
ET
EMC 
EMC 
Q
pdf pdf-generation pdf-parsing
1个回答
1
投票

文本对象(除了第一个画 "标准 "的)是由一个剪贴路径定义其各自的文本被画在外面。因此,这些文本块是不可见的。

例如:在这个块的开头,当前的文字是 "标准"。

q
67.107 261.154 77 188.188 re
W n
BT
-0.12 Tw 6.7092 0 0 6.7092 332.5724 347.7748 Tm
[(Mec)50(hanical T)115(ee)]TJ
0 Tw 17.697 9.073 Td
[(A)40(WW)40(A Ductile Iron Pipe)]TJ
-34.941 -1.057 Td
[(R)20(educing)]TJ
-1.399 -20.545 Td
(Outlet Coupling)Tj
ET
Q 

在这块的开头,当前的剪切路径被缩减为一个矩形,左下角为(67. 107, 261. 154),大小为77×188. 188. 此后的文字片向右绘制,其基线大致开始于

  • (333, 348)
  • (350, 357)
  • (315, 356)
  • (314, 335)

这些基线起点很明显是在那个剪辑路径矩形的右边,所以向右画的文字片也是。因此,它们被隐藏了。

© www.soinside.com 2019 - 2024. All rights reserved.