PDF: Difference between revisions

[pending revision][accepted revision]
Content deleted Content added
Add convert pdf back to word
Tags: Reverted Visual edit Mobile edit Mobile web edit
Restored revision 1246299291 by Pancho507 (talk)
 
(21 intermediate revisions by 11 users not shown)
Line 42:
 
PDF files may contain a variety of content besides flat text and graphics including logical structuring elements, interactive elements such as annotations and form-fields, layers, [[rich media]] (including video content), three-dimensional objects using [[U3D]] or [[PRC (file format)|PRC]], and various other [[File format|data formats]]. The PDF specification also provides for encryption and [[digital signature]]s, file attachments, and [[metadata]] to enable [[workflow]]s requiring these features.
 
A pdf which create from word document can easy convert back to word at [https://saypdf.com some pdf online tool] and we can convert a scanned pdf to word easy.
 
== History ==
Line 57 ⟶ 55:
PDF 1.7, the sixth edition of the PDF specification that became ISO 32000-1, includes some proprietary technologies defined only by Adobe, such as [[Adobe XML Forms Architecture]] (XFA) and [[JavaScript]] extension for Acrobat, which are referenced by ISO 32000-1 as [[normative]] and indispensable for the full implementation of the ISO 32000-1 specification.<ref>{{cite web|date= June 25, 2013|title= Guide for the procurement of standards-based ICT – Elements of Good Practice, Against lock-in: building open ICT systems by making better use of standards in public procurement|url= https://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=SWD:2013:0224:FIN:EN:PDF|access-date= January 12, 2023|publisher= European Commission|quote= Example: ISO/IEC 29500, ISO/IEC 26300 and ISO 32000 for document formats reference information that is not accessible by all parties (references to proprietary technology and brand names, incomplete scope or dead web links).|archive-date= September 19, 2020|archive-url= https://web.archive.org/web/20200919174545/https://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=SWD:2013:0224:FIN:EN:PDF|url-status= live}}</ref> These proprietary technologies are not standardized, and their specification is published only on Adobe's website.<ref name= "iso-meeting-n603">{{cite web|title=ISO/TC 171/SC 2/WG 8 N 603 – Meeting Report|date=June 27, 2011|url=http://pdf.editme.com/files/pdfREF-meetings/ISO-TC171-SC2-WG8_N0603_SC2WG8_MtgRept_SLC.pdf | work = Edit me | via = Archive |archive-url= https://web.archive.org/web/20121126013025/http://pdf.editme.com/files/pdfREF-meetings/ISO-TC171-SC2-WG8_N0603_SC2WG8_MtgRept_SLC.pdf |url-status=dead|archive-date= November 26, 2012 |quote=XFA is not to be ISO standard just yet. The Committee urges Adobe Systems to submit the XFA Specification, XML Forms Architecture (XFA), to ISO for standardization The Committee is concerned about the stability of the XFA specification Part 2 will reference XFA 3.1}}</ref><ref>{{cite journal|journal=PLOS ONE |date=2013|volume=8|number=9|title=Embedding and publishing interactive, 3-dimensional, scientific figures in Portable Document Format (PDF) files|doi=10.1371/journal.pone.0069446.s001|quote=the implementation of the U3D standard was not complete and proprietary extensions were used.|doi-access=free}}</ref><ref name="rosenthol-adobe-2012">{{cite web |url = http://cdn.parleys.com/p/5148922a0364bc17fc56c6e5/iSUM2012_00_LRO_presentation.pdf |title= PDF and Standards |author=Leonard Rosenthol | publisher = Adobe Systems |year= 2012 |access-date=October 20, 2013 |url-status=dead | via = Parleys |archive-url = https://web.archive.org/web/20130902000323/http://cdn.parleys.com/p/5148922a0364bc17fc56c6e5/iSUM2012_00_LRO_presentation.pdf |archive-date=September 2, 2013 }}</ref> Many of them are not supported by popular third-party implementations of PDF.
 
ISO published version 2.0 of PDF, ISO 32000-2 in 2017, available for purchase, replacing the free specification provided by Adobe.<ref name=nowfree/> In December 2020, the second edition of PDF 2.0, ISO 32000-2:2020, was published, with clarifications, corrections, and critical updates to normative references<ref>{{Cite web |url=https://www.pdfa.org/iso-32000-22020-is-now-available/ |title=ISO 32000-2:2020 is now available |publisher=PDFA |date=December 14, 2020 |access-date=February 3, 2021 |archive-date=December 4, 2022 |archive-url=https://web.archive.org/web/20221204112238/https://www.pdfa.org/iso-32000-22020-is-now-available/ |url-status=live }}</ref> (ISO 32000-2 does not include any proprietary technologies as normative references).<ref name=":0">{{cite web|url=https://www.iso.org/standard/75839.html|title=ISO 32000-2 – Document management — Portable document format — Part 2: PDF 2.0|date=January 5, 2021 |publisher=ISO|access-date=February 3, 2021|archive-date=January 28, 2021|archive-url=https://web.archive.org/web/20210128003836/https://www.iso.org/standard/75839.html|url-status=live}}</ref>
In April 2023 the PDF Association made ISO 32000-2 available for download free of charge.<ref name=nowfree>{{cite press release| title=Announcing no-cost access to the latest PDF standard: ISO 32000-2 (PDF 2.0)| publisher=PDF Association| url=https://pdfa.org/sponsored-standards| date=16 June 2023| orig-date=Updated; originally published 5 April 2023| access-date=October 6, 2023| archive-date=September 23, 2023| archive-url=https://web.archive.org/web/20230923202322/https://pdfa.org/sponsored-standards/| url-status=live}}</ref>
 
Line 82 ⟶ 80:
PostScript was originally designed for a drastically different [[use case]]: transmission of one-way linear print jobs in which the PostScript interpreter would collect a series of commands until it encountered the <code>showpage</code> command, then execute all the commands to render a page as a raster image to a printing device.<ref name="Pfiffner_Page_139">{{cite book |last1=Pfiffner |first1=Pamela |title=Inside the Publishing Revolution: The Adobe Story |date=2003 |publisher=Peachpit Press |location=Berkeley |isbn=0-321-11564-3 |page=139}}</ref> PostScript was not intended for long-term storage and real-time interactive rendering of [[electronic document]]s to [[computer monitor]]s, so there was no need to support anything other than consecutive rendering of pages.<ref name="Pfiffner_Page_139" /> If there was an error in the final printed output, the user would correct it at the application level and send a new print job in the form of an entirely new PostScript file. Thus, any given page in a PostScript file could be accurately rendered only as the cumulative result of executing all preceding commands to draw all previous pages—any of which could affect subsequent pages—plus the commands to draw that particular page, and there was no easy way to bypass that process to skip around to different pages.<ref name="Pfiffner_Page_139" />
 
Traditionally, to go from PostScript to PDF, a source PostScript file (that is, an executable program) is used as the basis for generating PostScript-like PDF code (see, e.g., [[Adobe Distiller]]). This is done by applying standard [[compiler]] techniques like [[loop unrolling]], [[inline expansion|inlining]] and removing unused branches, resulting in code that is purely declarative and static.<ref name="Pfiffner_Page_139" /> The end result is then packaged into a [[container format]], together with all necessary [[Dependency (computer science)|dependencies]] for correct rendering (external files, graphics, or fonts to which the document refers), and [[Data compression|compressed]]. Modern applications write to printer drivers whichthat directly generate PDF rather than going through PostScript first.
 
As a document format, PDF has several advantages over PostScript:
Line 265 ⟶ 263:
=== Forms ===
 
''Interactive Forms'' is a mechanism to add forms to the PDF file format. PDF currently supports two different methods for integrating data and PDF forms. Both formats today coexist in the PDF specification:<ref name="iso32000">{{cite web |url=https://www.adobe.com/devnet/acrobat/pdfs/PDF32000_2008.pdf |title=Document Management – Portable Document Format – Part 1: PDF 1.7, First Edition |author=Adobe Systems Inc.|date=July 1, 2008|access-date=January 12, 2023|url-status=dead|archive-url=https://web.archive.org/web/20081203002256/https://www.adobe.com/devnet/acrobat/pdfs/PDF32000_2008.pdf|archive-date=December 3, 2008}}</ref><ref>{{cite web |url=http://gnupdf.org/Forms_Data_Format |title=Gnu PDF – PDF Knowledge – Forms Data Format |archive-url=https://web.archive.org/web/20130101054615/http://www.gnupdf.org/Forms_Data_Format |archive-date=January 1, 2013 |access-date=January 12, 2023|url-status=deadusurped}}</ref><ref>{{cite web |url=http://livedocs.adobe.com/coldfusion/8/htmldocs/help.html?content=formsPDF_02.html |title=About PDF forms |access-date=February 19, 2010 |url-status=dead |archive-url=https://web.archive.org/web/20110429032948/http://livedocs.adobe.com/coldfusion/8/htmldocs/help.html?content=formsPDF_02.html |archive-date=April 29, 2011 |publisher=Adobe Systems Inc.}}</ref><ref>{{cite web|url=https://community.adobe.com/t5/acrobat-sdk-discussions/convert-xfa-form-to-acroform/td-p/1175365|title=Convert XFA Form to AcroForm?|last=Demling|first=Peter|date=July 1, 2008|access-date=January 12, 2023|archive-date=January 12, 2023|archive-url=https://web.archive.org/web/20230112153241/https://community.adobe.com/t5/acrobat-sdk-discussions/convert-xfa-form-to-acroform/td-p/1175365|url-status=live}}</ref>
 
* AcroForms (also known as Acrobat forms), introduced in the PDF 1.2 format specification and included in all later PDF specifications.
Line 312 ⟶ 310:
There are many software options for creating PDFs, including the PDF printing capabilities built into [[macOS]], [[iOS]],<ref>{{Cite web|url=https://ijunkie.com/how-to-create-pdf-web-page-safari-iphone-ipad-ios-11/|title=How to Create a PDF from Web Page on iPhone and iPad in iOS 11|last=Pathak|first=Khamosh|date=October 7, 2017|website=iJunkie|access-date=January 12, 2023|archive-date=January 12, 2023|archive-url=https://web.archive.org/web/20230112153246/https://ijunkie.com/how-to-create-pdf-web-page-safari-iphone-ipad-ios-11/|url-status=live}}</ref> and most [[Linux]] distributions. Much document processing software including [[LibreOffice]], [[Microsoft Office 2007]] (if updated to [[Office 2007#Service Pack 2|SP2]]) and later,<ref>{{cite web|url=http://support.microsoft.com/kb/953195|title=Description of 2007 Microsoft Office Suite Service Pack 2 (SP2)|publisher=[[Microsoft]]|url-status=dead|archive-url=https://web.archive.org/web/20090429212434/http://support.microsoft.com/kb/953195|archive-date=April 29, 2009|access-date=January 12, 2023}}</ref> [[WordPerfect]] 9, and [[Scribus]] can export documents in PDF format. There are many PDF print drivers for Microsoft Windows, the [[pdfTeX]] typesetting system, the [[DocBook]] PDF tools, applications developed around [[Ghostscript]] and [[Adobe Acrobat]] itself as well as [[Adobe InDesign]], [[Adobe FrameMaker]], Adobe Illustrator, Adobe Photoshop, that allow a "PDF printer" to be set up, which when selected sends output to a PDF file instead of a physical printer. [[Google]]'s online office suite [[Google Docs]] allows uploading and saving to PDF. Some web apps offer free PDF editing and annotation tools.
 
The [[Free Software Foundation]] was "developing a free, high-quality and fully functional set of libraries and programs that implement the PDF file format and associated technologies to the ISO 32000 standard", as one of its [[High priority free software projects|high priority projects]].<ref>On 2014-04-02, a note dated February 10, 2009 referred to [http://www.fsf.org/campaigns/priority.html Current FSF High Priority Free Software Projects] {{Webarchive|url=https://web.archive.org/web/20070810230457/http://www.fsf.org/campaigns/priority.html |date=August 10, 2007 }} as a source. Content of the latter page, however, changes over time.</ref><ref>{{cite web|url=http://gnupdf.org/Goals_and_Motivations|title=Goals and Motivations|publisher=GNUpdf|date=November 28, 2007|website=gnupdf.org|access-date=April 2, 2014|archive-date=July 4, 2014|archive-url=https://web.archive.org/web/20140704114405/http://www.gnupdf.org/Goals_and_Motivations|url-status=liveusurped}}</ref> In 2011, however, the GNU PDF project was removed from the list of "high priority projects" due to the maturation of the [[Poppler (software)|Poppler library]],<ref>{{cite web|title=GNU PDF project leaves FSF High Priority Projects list; mission complete!|url=http://www.fsf.org/blogs/community/gnu-pdf-project-leaves-high-priority-projects-list-mission-complete|date=October 6, 2011|first=Matt|last=Lee|publisher=Free Software Foundation|website=fsf.org|archive-date=December 28, 2014|archive-url=https://web.archive.org/web/20141228050435/http://www.fsf.org/blogs/community/gnu-pdf-project-leaves-high-priority-projects-list-mission-complete|url-status=live}}</ref> which has enjoyed wider use in applications such as [[Evince]] with the [[GNOME]] desktop environment. Poppler is based on [[Xpdf]]<ref>{{cite web|url=http://poppler.freedesktop.org/|title=Poppler Homepage|quote=Poppler is a PDF rendering library based on the xpdf-3.0 code base.|access-date=January 12, 2023|archive-date=January 8, 2015|archive-url=https://web.archive.org/web/20150108235708/http://poppler.freedesktop.org/|url-status=live}}</ref><ref>{{cite web|url=http://cgit.freedesktop.org/poppler/poppler/tree/README-XPDF|title=Xpdf License|quote=Xpdf is licensed under the GNU General Public License (GPL), version 2 or 3.|access-date=January 12, 2023|archive-date=April 14, 2013|archive-url=https://archive.today/20130414194348/http://cgit.freedesktop.org/poppler/poppler/tree/README-XPDF|url-status=live}}</ref> code base. There are also commercial development libraries available as listed in [[List of PDF software]].
 
The [[Apache PDFBox]] project of the [[Apache Software Foundation]] is an open source Java library, licensed under the [[Apache License]], for working with PDF documents.<ref>{{cite web|url=http://pdfbox.apache.org/|url-status=live|title=The Apache PDFBox project- Apache PDFBox 3.0.0 released|date=August 17, 2023|archive-date=January 7, 2023|archive-url=https://web.archive.org/web/20230107234923/https://pdfbox.apache.org/}} Updated for new releases.</ref>
Line 332 ⟶ 330:
=== Native display model ===
{{unreferenced section|date=November 2023}}
PDF was selected as the "native" [[metafile]] format for [[macOS]] (originally called Mac OS X), replacing the [[PICT]] format of the earlier [[classic Mac OS]]. The imaging model of the [[Quartz (graphics layer)|Quartz]] graphics layer is based on the model common to [[Display PostScript]] and PDF, leading to the nickname ''Display PDF''. The [[Preview (macOS)|Preview]] application can display PDF files, as can version 2.0 and later of the [[Safari (web browser)|Safari]] web browser. System-level support for PDF allows Mac OS XmacOS applications to create PDF documents automatically, provided they support the OS-standard printing architecture. The files are then exported in PDF 1.3 format according to the file header. When taking a screenshot under Mac OS X versions 10.0 through 10.3, the image was also captured as a PDF; later versions save screen captures as a PNG file, though this behavior can be set back to PDF if desired.
 
=== Annotation ===
Line 364 ⟶ 362:
== Further reading ==
 
* {{cite bookconference | last1 = Hardy | first1 = M. R. B. | last2 = Brailsford | first2 = D. F. | chaptertitle = Mapping and displaying structural transformations between XML and PDF | book-title = Proceedings of the 2002 ACM symposium on Document engineering – DocEng '02 | pages = 95–102 | year = 2002 | url = https://www.cs.nott.ac.uk/~psadb1/Publications/Download/2002/Hardy02.pdf | doi = 10.1145/585058.585077 | publisher = Proceedings of the 2002 ACM symposium on Document engineering |isbn = 1-58113-594-7 | s2cid = 9371237 |archive-url=https://web.archive.org/web/20170324072906/https://www.cs.nott.ac.uk/~psadb1/Publications/Download/2002/Hardy02.pdf |archive-date=2017-03-24 |url-status=dead}}{{relevance inline|date=May 2022|reason=Why would random conference paper about some particular plugin for Adobe Acrobat be of interest to the reader?}}
* PDF 2.0 {{cite web |url = https://www.iso.org/standard/75839.html |title=ISO 32000-2:2020(en), Document management — Portable document format — Part 2: PDF 2.0 |website = International Organization for Standardization |language = English |access-date = December 16, 2020 }}
* PDF 2.0 {{cite web |url = https://www.iso.org/standard/63534.html |title=ISO 32000-2:2017(en), Document management — Portable document format — Part 2: PDF 2.0 |website = International Organization for Standardization |date=August 3, 2017 |language = English |access-date = January 31, 2019 }}