EP1567942A2 - Procede pour coder un document base sur le langage xml - Google Patents

Procede pour coder un document base sur le langage xml

Info

Publication number
EP1567942A2
EP1567942A2 EP03789106A EP03789106A EP1567942A2 EP 1567942 A2 EP1567942 A2 EP 1567942A2 EP 03789106 A EP03789106 A EP 03789106A EP 03789106 A EP03789106 A EP 03789106A EP 1567942 A2 EP1567942 A2 EP 1567942A2
Authority
EP
European Patent Office
Prior art keywords
codes
xml
content
mixed
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP03789106A
Other languages
German (de)
English (en)
Inventor
Jörg Heuer
Andreas Hutter
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Siemens AG
Siemens Corp
Original Assignee
Siemens AG
Siemens Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from DE10339971A external-priority patent/DE10339971A1/de
Application filed by Siemens AG, Siemens Corp filed Critical Siemens AG
Publication of EP1567942A2 publication Critical patent/EP1567942A2/fr
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2353Processing of additional data, e.g. scrambling of additional data or processing content descriptors specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • G06F40/143Markup, e.g. Standard Generalized Markup Language [SGML] or Document Type Definition [DTD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream

Definitions

  • the invention relates to a method for coding an XML-based document that contains content according to an XML schema language definition, a corresponding decoding method and corresponding coding and decoding devices.
  • XML extensible markup language
  • XML schema A more detailed description of the XML schema as well as the structures, data types and content models used in it can be found in references [1], [2] and [3].
  • the object of the invention is therefore to provide a method for coding XML-based documents, which simpler access to coded textual content of the "Complex Type” data type with "mixed” content model enabled.
  • a coded binary representation of an XML-based document is generated by assigning binary structure codes to the contents of the document via coding tables, textual contents of a "Complex Type" data type being assigned to the "mixed" content model structure codes.
  • the structure codes are the SBC schema branch codes defined in section 7.6.1 of document [4]. The assignment of structure codes to the contents of the document as described in [4] enables the position of these contents in the structure of the XML documents to be signaled or addressed.
  • the invention essentially consists in the fact that the textual content of a type “complex type ⁇ modell with content model“ mixed ”is regarded as an element declaration in the type definition in the code assignment. Accordingly, in addition to the declared elements in a type definition, the coding is also used A defined structure code is assigned to the textual content if a content model is defined as mixed, which means that textual content is addressed in the coded data stream so that it can be accessed without having to decode the entire data stream.
  • the structure codes are assigned to the textual contents of a "complex type" data type with “mixed” content models exclusively via OperandTBC-
  • the position of the textual contents of a data type “complex type” with the content model “mixed” is also assigned. These are the position codes described in section 7.6.5.5 of document [4]. Since in a data type "Complex Type” with the content model "mixed” several textual contents can be contained, the information about the position of the textual content within the data type is thereby transmitted.
  • “single element position codes” and / or “multiple element position codes” are used in the assignment of the "position codes”. These position codes are described in more detail in the publication [4], section 7.6.5.5.
  • Single Element position codes are used in particular when no "model group” in the type definition of the "complex type” can occur more than once in the XML schema definition. A definition of the "model group” can be found in document [2 ].
  • the single element position code determines the position of a content with respect to a particular particle in an instantiation of a data type. A definition for particles can also be found in document [2].
  • the single element position code is encoded on the assumption that the textual content is declared a maximum of MPA + 1 times, where MPA is the number of all possible particle types in this data type
  • a multiple element position code is used if in the definition of the "complex type""modelgroups" in the XML schema definition can occur more than once.
  • the multiple element position code is coded on the assumption that a total of 2 * MPA + 1 positions can be addressed, this code representing the position the content of all particles in an instantiation of a data type.
  • the position codes are coded with codes of variable length, in particular with the code vluimsbf5, which is described in document [4], section 4.3.
  • the invention also comprises a decoding method with which a binary representation of an XML-based document coded according to the coding method described above is decoded.
  • a decoding method with which a binary representation of an XML-based document coded according to the coding method described above is decoded.
  • binary representations of textual contents of a "Complex Type” data type with the "mixed” content model, to which structure codes (SBC) were assigned during coding are incorporated into the XML textual contents assigned to the structure codes (SBC). based document converted.
  • the assignment is carried out by structure codes (SBC) via operand TBC coding tables.
  • SBC structure codes
  • binary representations of textual content of a "Complex Type" data type with the "mixed” content model, addressed with “Position Codes”, are converted into textual content at the assigned position.
  • the "Position Codes” can in turn be “Single” Element Position Codes "and / or” Multiple Element Position Codes "include.
  • These position codes are the same position codes as are defined in relation to the coding method.
  • the "position codes” can also be coded with codes of variable length, these codes being decoded when the position codes are converted into textual content.
  • the position codes are preferably coded with the code vluimsbf ⁇ .
  • the invention further comprises an encoding and decoding method, which comprises the encoding method according to the invention and the decoding method according to the invention.
  • the invention relates to a device for coding XML-based documents with which the coding method according to the invention can be carried out, the device comprising a storage means in which at least one assignment of a textual content of a data type "complex type” with the content model "mixed" to a structure code is stored.
  • the invention relates to a device for decoding a coded binary representation of an XML-based document, the device being set up in such a way that the decoding method according to the invention can be carried out.
  • the device comprises a storage means in which at least one assignment of a structure code to a textual content of a data type "complex type” with the content model "mixed” is stored.
  • the invention relates to a device for coding and decoding an XML-based document, comprising the above-described coding device according to the invention and the above-described decoding device according to the invention.
  • Figure 1 is a schematic diagram of an encoding and decoding system according to the invention with encoder and decoder;
  • FIG. 2 shows an XML schema definition in which, among other things, a data type “complex type” with a content model “mixed” is defined;
  • FIG. 3 shows an XML document in which an element “MixedElement” declared in the XML schema definition of FIG. 2 is instantiated;
  • FIG. 4 shows a graphical representation of the structure of the element “MixedElement” instantiated in the XML document of FIG. 3;
  • FIG. 5 shows an illustration to explain the assignment of structure codes in the case of data types “complex type” with content model “mixed”;
  • FIG. 6 shows an illustration to explain the assignment of “position codes” for “complex type” data types
  • FIG. 1 shows an example of a coding and decoding system with an encoder ENG and a decoder DEC, with which XML documents DOC are coded or decoded.
  • Both the encoder and the decoder both have a so-called XML schema S, in which the elements and types of the XML document used for communication are declared and defined.
  • Code tables CT are generated from the scheme S via corresponding scheme compilations SC in the encoder and decoder. If the XML document is DOC encoded, binary codes are assigned to the contents of the XML document via the code tables. This creates a binary representation BDOC of the document DOC, which can be decoded again in the decoder using the code table CT.
  • the method according to the invention is characterized in that textual contents of a "complex type” data type are assigned binary structure codes with the "mixed” content model. This enables the textual data to be filtered out of the BDOC binary representation without the entire BDOC binary representation having to be decoded.
  • FIG. 2 shows an example of a schema S, an element with the name “Example” being declared in lines 4 to 10 of this schema, which in turn contains an element of the name "MixedElement" of the type "MixedType". In the lines 12 to 17 the type "MixedType” is defined. This is a “complex type” data type with the content model "mixed", which can be seen in particular from line 12.
  • the "MixedType” type contains two elements with the names "firstElement” and “secondElement", both of which are of the type "string”.
  • FIG. 3 shows an instantiation of the "MixedElement” element in an XML document. Since the content model "mixed" can contain textual content in the form of strings, textual content can occur before, after or between the first and second elements “firstElement” and “secondElement". In the example in FIG. 3, a total of three textual contents occur.
  • Any document based on the XML language can be represented by a so-called tree structure, the contents of the XML document forming nodes in the tree structure and so-called context paths leading to these nodes.
  • Binary structure codes are assigned to the nodes of the tree structure during coding.
  • a structure code for the parent node and for the elements "firstElement” and “secondElement” are assigned for the element node “MixedElement” shown in FIG Node of the element "MixedElement” is connected.
  • a structure code is assigned for the parent node and the elements "firstElement and" secondElement ", but also a structure code is assigned for the textual content. This is illustrated in FIG code 00 is assigned to the parent node, code 01 is assigned to the textual content and codes 10 and 11 are assigned to the "first element” and the "second element", respectively.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Document Processing Apparatus (AREA)

Abstract

La présente invention concerne un procédé pour coder un document basé sur le langage XML (DOC) qui contient des contenus définis par le langage de schéma XML. Selon l'invention, une représentation binaire codée (BDOC) du document est produite, des codes de structure binaire (SBC) étant associés aux contenus du document par l'intermédiaire de tables de codage (CT), les contenus textuels d'un type de données complexe ("Complex Type") étant associés au modèle de contenu codes de structure mixte ("mixed") (SBC).
EP03789106A 2002-12-03 2003-12-01 Procede pour coder un document base sur le langage xml Ceased EP1567942A2 (fr)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
DE10256455 2002-12-03
DE10256455 2002-12-03
DE10339971 2003-08-29
DE10339971A DE10339971A1 (de) 2002-12-03 2003-08-29 Verfahren zur Codierung eines XML-basierten Dokuments
PCT/EP2003/013511 WO2004051502A2 (fr) 2002-12-03 2003-12-01 Procede pour coder un document base sur le langage xml

Publications (1)

Publication Number Publication Date
EP1567942A2 true EP1567942A2 (fr) 2005-08-31

Family

ID=32471494

Family Applications (1)

Application Number Title Priority Date Filing Date
EP03789106A Ceased EP1567942A2 (fr) 2002-12-03 2003-12-01 Procede pour coder un document base sur le langage xml

Country Status (3)

Country Link
EP (1) EP1567942A2 (fr)
AU (1) AU2003293743A1 (fr)
WO (1) WO2004051502A2 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7747558B2 (en) 2007-06-07 2010-06-29 Motorola, Inc. Method and apparatus to bind media with metadata using standard metadata headers

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BING L. ET AL: "An Architecture for Multidatabase Systems Based on Corba and XML", 12TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, 3 September 2001 (2001-09-03), PISCATAWAY, NJ, USA, XP010558716 *
BOX D. ET AL: "Essential XML - Beyond Markup", September 2000, ADDISON-WESLEY *

Also Published As

Publication number Publication date
WO2004051502A3 (fr) 2005-02-03
AU2003293743A1 (en) 2004-06-23
WO2004051502A2 (fr) 2004-06-17

Similar Documents

Publication Publication Date Title
EP2197213B1 (fr) Procédé d'amélioration de la fonctionnalité de la représentation binaire de MPEG-7 et autres descriptions de contenu XML
WO2004017225A1 (fr) Procede et dispositif pour coder/decoder des documents structures, notamment des documents xml
DE60225785T2 (de) Verfahren zur codierung und decodierung eines pfades in der baumstruktur eines strukturierten dokuments
EP1645133B1 (fr) Procede de codage de documents structures
WO2004047304A2 (fr) Procede de production d'un train de bits a partir d'un arbre d'indexation
DE102004034004A1 (de) Verfahren zum Codieren eines XML-Dokuments, sowie Verfahren zum Decodieren, Verfahren zum Codieren und Decodieren, Codiervorrichtung, Decodiervorrichtung und Vorrichtung zum Codieren und Decodieren
EP1616274B1 (fr) Procede de codage d'un document structure
EP1400124B1 (fr) Procede pour ameliorer la fonctionnalite de la representation binaire de descriptions de contenu mpeg-7 et basees sur d'autres langages xml
DE10339971A1 (de) Verfahren zur Codierung eines XML-basierten Dokuments
WO2004051502A2 (fr) Procede pour coder un document base sur le langage xml
DE102004009617A1 (de) Verfahren und Vorrichtung zur Codierung und Decodierung von strukturierten Dokumenten
EP0763920A2 (fr) Méthode pour le codage ou décodage d'unités de données de protocole
DE10231970B3 (de) Verfahren zur Codierung von Positionen von Datenelementen in einer Datenstruktur sowie Vorrichtungen zur entsprechenden Codierung und/oder Decodierung
WO2003001404A2 (fr) Procede permettant une recherche rapide d'elements ou d'attributs, ou un filtrage rapide de fragments dans des representations binaires de documents structures par exemple a base xml
EP0828368B1 (fr) Procédé et dispositif pour accéder à un document multimedia
DE10351897A1 (de) Verfahren zur Codierung von strukturierten Dokumenten
WO2025168430A1 (fr) Procédé mis en œuvre par ordinateur pour fournir des artéfacts pour une simulation d'un dispositif réel
WO2026000006A1 (fr) Procédé d'analyse fonctionnelle d'un système source
DE102004044164A1 (de) Verfahren und Vorrichtung zur Kodierung von XML-Dokumenten
WO2006027323A1 (fr) Procede de codage d'un document xml
DE10248758A1 (de) Verfahren und Vorrichtungen zum Encodieren/Decodieren von strukturierten Dokumenten, insbesondere von XML-Dokumenten
DE3523247A1 (de) Einrichtung zur datenreduktion binaerer datenstroeme
DE10351896A1 (de) Verfahren zur Indizierung von strukturierten Dokumenten

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20050422

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL LT LV MK

DAX Request for extension of the european patent (deleted)
RIN1 Information on inventor provided before grant (corrected)

Inventor name: HUTTER, ANDREAS

Inventor name: HEUER, JOERG

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20081007