EP1567942A2 - Procede pour coder un document base sur le langage xml - Google Patents
Procede pour coder un document base sur le langage xmlInfo
- Publication number
- EP1567942A2 EP1567942A2 EP03789106A EP03789106A EP1567942A2 EP 1567942 A2 EP1567942 A2 EP 1567942A2 EP 03789106 A EP03789106 A EP 03789106A EP 03789106 A EP03789106 A EP 03789106A EP 1567942 A2 EP1567942 A2 EP 1567942A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- codes
- xml
- content
- mixed
- document
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2353—Processing of additional data, e.g. scrambling of additional data or processing content descriptors specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/14—Tree-structured documents
- G06F40/143—Markup, e.g. Standard Generalized Markup Language [SGML] or Document Type Definition [DTD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
Definitions
- the invention relates to a method for coding an XML-based document that contains content according to an XML schema language definition, a corresponding decoding method and corresponding coding and decoding devices.
- XML extensible markup language
- XML schema A more detailed description of the XML schema as well as the structures, data types and content models used in it can be found in references [1], [2] and [3].
- the object of the invention is therefore to provide a method for coding XML-based documents, which simpler access to coded textual content of the "Complex Type” data type with "mixed” content model enabled.
- a coded binary representation of an XML-based document is generated by assigning binary structure codes to the contents of the document via coding tables, textual contents of a "Complex Type" data type being assigned to the "mixed" content model structure codes.
- the structure codes are the SBC schema branch codes defined in section 7.6.1 of document [4]. The assignment of structure codes to the contents of the document as described in [4] enables the position of these contents in the structure of the XML documents to be signaled or addressed.
- the invention essentially consists in the fact that the textual content of a type “complex type ⁇ modell with content model“ mixed ”is regarded as an element declaration in the type definition in the code assignment. Accordingly, in addition to the declared elements in a type definition, the coding is also used A defined structure code is assigned to the textual content if a content model is defined as mixed, which means that textual content is addressed in the coded data stream so that it can be accessed without having to decode the entire data stream.
- the structure codes are assigned to the textual contents of a "complex type" data type with “mixed” content models exclusively via OperandTBC-
- the position of the textual contents of a data type “complex type” with the content model “mixed” is also assigned. These are the position codes described in section 7.6.5.5 of document [4]. Since in a data type "Complex Type” with the content model "mixed” several textual contents can be contained, the information about the position of the textual content within the data type is thereby transmitted.
- “single element position codes” and / or “multiple element position codes” are used in the assignment of the "position codes”. These position codes are described in more detail in the publication [4], section 7.6.5.5.
- Single Element position codes are used in particular when no "model group” in the type definition of the "complex type” can occur more than once in the XML schema definition. A definition of the "model group” can be found in document [2 ].
- the single element position code determines the position of a content with respect to a particular particle in an instantiation of a data type. A definition for particles can also be found in document [2].
- the single element position code is encoded on the assumption that the textual content is declared a maximum of MPA + 1 times, where MPA is the number of all possible particle types in this data type
- a multiple element position code is used if in the definition of the "complex type""modelgroups" in the XML schema definition can occur more than once.
- the multiple element position code is coded on the assumption that a total of 2 * MPA + 1 positions can be addressed, this code representing the position the content of all particles in an instantiation of a data type.
- the position codes are coded with codes of variable length, in particular with the code vluimsbf5, which is described in document [4], section 4.3.
- the invention also comprises a decoding method with which a binary representation of an XML-based document coded according to the coding method described above is decoded.
- a decoding method with which a binary representation of an XML-based document coded according to the coding method described above is decoded.
- binary representations of textual contents of a "Complex Type” data type with the "mixed” content model, to which structure codes (SBC) were assigned during coding are incorporated into the XML textual contents assigned to the structure codes (SBC). based document converted.
- the assignment is carried out by structure codes (SBC) via operand TBC coding tables.
- SBC structure codes
- binary representations of textual content of a "Complex Type" data type with the "mixed” content model, addressed with “Position Codes”, are converted into textual content at the assigned position.
- the "Position Codes” can in turn be “Single” Element Position Codes "and / or” Multiple Element Position Codes "include.
- These position codes are the same position codes as are defined in relation to the coding method.
- the "position codes” can also be coded with codes of variable length, these codes being decoded when the position codes are converted into textual content.
- the position codes are preferably coded with the code vluimsbf ⁇ .
- the invention further comprises an encoding and decoding method, which comprises the encoding method according to the invention and the decoding method according to the invention.
- the invention relates to a device for coding XML-based documents with which the coding method according to the invention can be carried out, the device comprising a storage means in which at least one assignment of a textual content of a data type "complex type” with the content model "mixed" to a structure code is stored.
- the invention relates to a device for decoding a coded binary representation of an XML-based document, the device being set up in such a way that the decoding method according to the invention can be carried out.
- the device comprises a storage means in which at least one assignment of a structure code to a textual content of a data type "complex type” with the content model "mixed” is stored.
- the invention relates to a device for coding and decoding an XML-based document, comprising the above-described coding device according to the invention and the above-described decoding device according to the invention.
- Figure 1 is a schematic diagram of an encoding and decoding system according to the invention with encoder and decoder;
- FIG. 2 shows an XML schema definition in which, among other things, a data type “complex type” with a content model “mixed” is defined;
- FIG. 3 shows an XML document in which an element “MixedElement” declared in the XML schema definition of FIG. 2 is instantiated;
- FIG. 4 shows a graphical representation of the structure of the element “MixedElement” instantiated in the XML document of FIG. 3;
- FIG. 5 shows an illustration to explain the assignment of structure codes in the case of data types “complex type” with content model “mixed”;
- FIG. 6 shows an illustration to explain the assignment of “position codes” for “complex type” data types
- FIG. 1 shows an example of a coding and decoding system with an encoder ENG and a decoder DEC, with which XML documents DOC are coded or decoded.
- Both the encoder and the decoder both have a so-called XML schema S, in which the elements and types of the XML document used for communication are declared and defined.
- Code tables CT are generated from the scheme S via corresponding scheme compilations SC in the encoder and decoder. If the XML document is DOC encoded, binary codes are assigned to the contents of the XML document via the code tables. This creates a binary representation BDOC of the document DOC, which can be decoded again in the decoder using the code table CT.
- the method according to the invention is characterized in that textual contents of a "complex type” data type are assigned binary structure codes with the "mixed” content model. This enables the textual data to be filtered out of the BDOC binary representation without the entire BDOC binary representation having to be decoded.
- FIG. 2 shows an example of a schema S, an element with the name “Example” being declared in lines 4 to 10 of this schema, which in turn contains an element of the name "MixedElement" of the type "MixedType". In the lines 12 to 17 the type "MixedType” is defined. This is a “complex type” data type with the content model "mixed", which can be seen in particular from line 12.
- the "MixedType” type contains two elements with the names "firstElement” and “secondElement", both of which are of the type "string”.
- FIG. 3 shows an instantiation of the "MixedElement” element in an XML document. Since the content model "mixed" can contain textual content in the form of strings, textual content can occur before, after or between the first and second elements “firstElement” and “secondElement". In the example in FIG. 3, a total of three textual contents occur.
- Any document based on the XML language can be represented by a so-called tree structure, the contents of the XML document forming nodes in the tree structure and so-called context paths leading to these nodes.
- Binary structure codes are assigned to the nodes of the tree structure during coding.
- a structure code for the parent node and for the elements "firstElement” and “secondElement” are assigned for the element node “MixedElement” shown in FIG Node of the element "MixedElement” is connected.
- a structure code is assigned for the parent node and the elements "firstElement and" secondElement ", but also a structure code is assigned for the textual content. This is illustrated in FIG code 00 is assigned to the parent node, code 01 is assigned to the textual content and codes 10 and 11 are assigned to the "first element” and the "second element", respectively.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Document Processing Apparatus (AREA)
Abstract
La présente invention concerne un procédé pour coder un document basé sur le langage XML (DOC) qui contient des contenus définis par le langage de schéma XML. Selon l'invention, une représentation binaire codée (BDOC) du document est produite, des codes de structure binaire (SBC) étant associés aux contenus du document par l'intermédiaire de tables de codage (CT), les contenus textuels d'un type de données complexe ("Complex Type") étant associés au modèle de contenu codes de structure mixte ("mixed") (SBC).
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE10256455 | 2002-12-03 | ||
| DE10256455 | 2002-12-03 | ||
| DE10339971 | 2003-08-29 | ||
| DE10339971A DE10339971A1 (de) | 2002-12-03 | 2003-08-29 | Verfahren zur Codierung eines XML-basierten Dokuments |
| PCT/EP2003/013511 WO2004051502A2 (fr) | 2002-12-03 | 2003-12-01 | Procede pour coder un document base sur le langage xml |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP1567942A2 true EP1567942A2 (fr) | 2005-08-31 |
Family
ID=32471494
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP03789106A Ceased EP1567942A2 (fr) | 2002-12-03 | 2003-12-01 | Procede pour coder un document base sur le langage xml |
Country Status (3)
| Country | Link |
|---|---|
| EP (1) | EP1567942A2 (fr) |
| AU (1) | AU2003293743A1 (fr) |
| WO (1) | WO2004051502A2 (fr) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7747558B2 (en) | 2007-06-07 | 2010-06-29 | Motorola, Inc. | Method and apparatus to bind media with metadata using standard metadata headers |
-
2003
- 2003-12-01 AU AU2003293743A patent/AU2003293743A1/en not_active Abandoned
- 2003-12-01 EP EP03789106A patent/EP1567942A2/fr not_active Ceased
- 2003-12-01 WO PCT/EP2003/013511 patent/WO2004051502A2/fr not_active Ceased
Non-Patent Citations (2)
| Title |
|---|
| BING L. ET AL: "An Architecture for Multidatabase Systems Based on Corba and XML", 12TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, 3 September 2001 (2001-09-03), PISCATAWAY, NJ, USA, XP010558716 * |
| BOX D. ET AL: "Essential XML - Beyond Markup", September 2000, ADDISON-WESLEY * |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2004051502A3 (fr) | 2005-02-03 |
| AU2003293743A1 (en) | 2004-06-23 |
| WO2004051502A2 (fr) | 2004-06-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP2197213B1 (fr) | Procédé d'amélioration de la fonctionnalité de la représentation binaire de MPEG-7 et autres descriptions de contenu XML | |
| WO2004017225A1 (fr) | Procede et dispositif pour coder/decoder des documents structures, notamment des documents xml | |
| DE60225785T2 (de) | Verfahren zur codierung und decodierung eines pfades in der baumstruktur eines strukturierten dokuments | |
| EP1645133B1 (fr) | Procede de codage de documents structures | |
| WO2004047304A2 (fr) | Procede de production d'un train de bits a partir d'un arbre d'indexation | |
| DE102004034004A1 (de) | Verfahren zum Codieren eines XML-Dokuments, sowie Verfahren zum Decodieren, Verfahren zum Codieren und Decodieren, Codiervorrichtung, Decodiervorrichtung und Vorrichtung zum Codieren und Decodieren | |
| EP1616274B1 (fr) | Procede de codage d'un document structure | |
| EP1400124B1 (fr) | Procede pour ameliorer la fonctionnalite de la representation binaire de descriptions de contenu mpeg-7 et basees sur d'autres langages xml | |
| DE10339971A1 (de) | Verfahren zur Codierung eines XML-basierten Dokuments | |
| WO2004051502A2 (fr) | Procede pour coder un document base sur le langage xml | |
| DE102004009617A1 (de) | Verfahren und Vorrichtung zur Codierung und Decodierung von strukturierten Dokumenten | |
| EP0763920A2 (fr) | Méthode pour le codage ou décodage d'unités de données de protocole | |
| DE10231970B3 (de) | Verfahren zur Codierung von Positionen von Datenelementen in einer Datenstruktur sowie Vorrichtungen zur entsprechenden Codierung und/oder Decodierung | |
| WO2003001404A2 (fr) | Procede permettant une recherche rapide d'elements ou d'attributs, ou un filtrage rapide de fragments dans des representations binaires de documents structures par exemple a base xml | |
| EP0828368B1 (fr) | Procédé et dispositif pour accéder à un document multimedia | |
| DE10351897A1 (de) | Verfahren zur Codierung von strukturierten Dokumenten | |
| WO2025168430A1 (fr) | Procédé mis en œuvre par ordinateur pour fournir des artéfacts pour une simulation d'un dispositif réel | |
| WO2026000006A1 (fr) | Procédé d'analyse fonctionnelle d'un système source | |
| DE102004044164A1 (de) | Verfahren und Vorrichtung zur Kodierung von XML-Dokumenten | |
| WO2006027323A1 (fr) | Procede de codage d'un document xml | |
| DE10248758A1 (de) | Verfahren und Vorrichtungen zum Encodieren/Decodieren von strukturierten Dokumenten, insbesondere von XML-Dokumenten | |
| DE3523247A1 (de) | Einrichtung zur datenreduktion binaerer datenstroeme | |
| DE10351896A1 (de) | Verfahren zur Indizierung von strukturierten Dokumenten |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20050422 |
|
| AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
| AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
| DAX | Request for extension of the european patent (deleted) | ||
| RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: HUTTER, ANDREAS Inventor name: HEUER, JOERG |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
| 18R | Application refused |
Effective date: 20081007 |