WO2015183060A1 - Procédé, appareil, et support d'enregistrement lisible par ordinateur pour fournir un contenu audio en utilisant un objet audio - Google Patents

Procédé, appareil, et support d'enregistrement lisible par ordinateur pour fournir un contenu audio en utilisant un objet audio Download PDF

Info

Publication number
WO2015183060A1
WO2015183060A1 PCT/KR2015/005488 KR2015005488W WO2015183060A1 WO 2015183060 A1 WO2015183060 A1 WO 2015183060A1 KR 2015005488 W KR2015005488 W KR 2015005488W WO 2015183060 A1 WO2015183060 A1 WO 2015183060A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
audio content
audio
content
providing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/KR2015/005488
Other languages
English (en)
Korean (ko)
Inventor
전상배
김선민
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Publication of WO2015183060A1 publication Critical patent/WO2015183060A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies

Definitions

  • the present invention relates to a method and an apparatus for providing audio content using an audio object. More particularly, the present invention provides a method for providing audio content using audio objects, and more specifically, configures audio content as an audio object and provides the audio content with local information to be provided. The present invention relates to a method and an apparatus for providing content specialized in a region where a user who is provided with the user is located.
  • Audio content is often provided to the user in the form of radio broadcasting or music streaming.
  • the contents of audio contents are changed and distinguished only by the carrier frequency.
  • the audio contents are provided based on the approximate location information of the user which can be identified by the Internet address band.
  • Audio content is often provided to the user in the form of radio broadcasting or music streaming.
  • the contents of audio contents are changed and distinguished only by the carrier frequency.
  • the audio contents are provided based on the approximate location information of the user which can be identified by the Internet address band.
  • the present invention solves the problems of the prior art described above, and aims to provide more effectively the audio content by setting the advertising area more finely and providing the audio content specialized for the segmented area.
  • a method of providing audio content comprising: encoding at least one audio object for each of at least one audio content; Encoding object region information including region information for at least one audio object; And transmitting at least one bitstream generated as a result of encoding at least one audio object and object region information.
  • the audio content includes additional information related to the advertisement information.
  • the object region information includes information about a region where each of the at least one audio content is provided.
  • the information on the region where each of the at least one audio content is provided includes latitude information and longitude information.
  • a method of providing audio content the object including at least one audio object for each of the at least one audio content and local information about the at least one audio object.
  • the selectively reproducing may include obtaining reception location information on a location where at least one bitstream has been received; And selecting an audio object to be reproduced from at least one decoded audio object based on a result of comparing the received location information with the decoded object region information.
  • the audio content includes additional information related to the advertisement information.
  • the object region information includes information about a region where each of the at least one audio content is provided.
  • the information on the region where each of the at least one audio content is provided includes latitude information and longitude information.
  • the reception location information is obtained based on at least one of Global Positioning System (GPS) positioning, cell positioning and Wi-Fi positioning, and Internet Protocol (IP) positioning. do.
  • GPS Global Positioning System
  • IP Internet Protocol
  • the reception location information is obtained based on a user input.
  • the step of selecting an audio object to be reproduced selects the audio object based on the distance between the obtained reception position information and the reference position included in the object region information.
  • the reception location information includes information on a plurality of reception locations obtained at a plurality of time points.
  • an apparatus for providing audio content includes encoding at least one audio content into at least one audio object, and including local information on at least one audio content.
  • a transmitter configured to transmit at least one bitstream generated as a result of encoding at least one audio object and object region information.
  • the audio content includes additional information related to advertisement information.
  • the object region information includes information about a region where each of the at least one audio content is provided.
  • the information on the region where each of the at least one audio content is provided includes latitude information and longitude information.
  • An apparatus for providing audio content for solving the technical problem, an object including at least one audio object for each of the at least one audio content and local information about the at least one audio object
  • a receiving unit for receiving at least one bitstream generated as a result of encoding local information
  • a decoder for decoding at least one audio object and object region information from the received bitstream
  • a playback unit for selectively playing back audio content of at least one decoded audio object based on the decoded object region information.
  • the playback unit includes: an acquisition unit for acquiring reception location information on a location where at least one bitstream is received; And a selector configured to select an audio object to be reproduced among at least one decoded audio object based on a result of comparing the received location information with the decoded object region information.
  • the audio content includes additional information related to advertisement information.
  • the object region information includes information about a region where each of the at least one audio content is provided.
  • Information about a region where each of the at least one audio content is provided includes latitude information and longitude information.
  • the reception location information is obtained based on at least one of Global Positioning System (GPS) positioning, cell positioning and Wi-Fi positioning, and Internet Protocol (IP) positioning. do.
  • GPS Global Positioning System
  • IP Internet Protocol
  • the reception location information is obtained based on a user input.
  • the selector selects the audio object based on the distance between the obtained reception position information and the reference position included in the object region information.
  • the reception location information includes information on a plurality of reception locations obtained at a plurality of time points.
  • a computer-readable recording medium recording a program for executing the above-described method.
  • a computer readable recording medium for recording another method for implementing the present invention, another system, and a computer program for executing the method.
  • the audio content in the case of region-specific audio content, if the area in which the audio content is provided is set in more detail, the audio content can be provided to the user more efficiently, and various business models can be presented to the content provider.
  • 1 is a diagram illustrating an embodiment of a method of providing audio content.
  • FIG. 2 is a diagram illustrating an embodiment in which a frequency channel and transmission power are determined according to a transmission station and a relay station within a transmission zone.
  • FIG. 3 is a diagram illustrating an embodiment of providing content using an audio object.
  • FIG. 4 is a view for explaining a method for providing content according to an embodiment of the present invention.
  • FIG. 5 is a flowchart of a method for providing content according to an embodiment of the present invention.
  • FIG. 6 is a detailed block diagram of an apparatus for providing audio content according to an embodiment of the present invention.
  • FIG. 7 is a flowchart of a method for providing content according to an embodiment of the present invention.
  • FIG. 8 is a detailed configuration diagram of an apparatus for providing content according to an embodiment of the present invention.
  • FIG. 9 is a detailed configuration diagram of an apparatus for providing content according to another embodiment of the present invention.
  • FIG. 10 is content region information according to an embodiment of the present invention and indicates syntax of mae_ContentData when content region information is included in an MPEG-H MAE.
  • FIG. 11 is content region information according to an embodiment of the present invention and indicates a mae_ContentKind field when content region information is included in an MPEG-H MAE.
  • FIG. 12 is a diagram for describing a method of providing content based on a location of a user, according to an embodiment of the present invention.
  • FIG. 13 is a diagram for describing a method of providing content based on a location of a user, according to an exemplary embodiment.
  • a method of providing audio content comprising: encoding at least one audio object for each of at least one audio content into a bitstream; Encoding the object region information including the region information of the at least one audio object into a bitstream; And transmitting a bitstream in which at least one audio object and object region information are encoded.
  • 1 is a diagram illustrating an embodiment of a method of providing audio content.
  • FIG. 1 is an embodiment in which a broadcast station provides radio broadcast over airwaves. Radio broadcasting over the air waves transmits a radio signal at a transmitting station or a relay station of a broadcasting station, and a user receives and reproduces a radio signal through a radio receiver or the like so that the user can hear the content, that is, audio content.
  • the station or relay station of the broadcasting station is located as high as possible, and the broadcasting area is divided according to the range that the radio waves of the station or relay station can reach based on the output size of the transmission signal and the layout of the feature. Provide service.
  • the transmission power is determined according to the frequency channels arranged for each region, and the broadcasting zone is set so that signals of each broadcasting zone do not interfere with each other.
  • each carrier frequency and transmission power are determined according to the position of each transmitting station or relay station.
  • the transmission zone 120 of the second frequency transmission station 120 transmits the broadcast signal of the second frequency.
  • the frequency 1 and the frequency 2 means a carrier frequency.
  • the user (receiver) Since the user (receiver) is in the transmission area of the first transmission station, in order to receive the radio broadcast, the user (receiver) must set the carrier frequency of the receiver to the first frequency and receive the radio broadcast signal. If moving from the transmission area of the first transmission station to the transmission area of the second transmission station during broadcast reception, the carrier frequency of the receiver must be reset to the second frequency to correctly receive the same radio broadcast signal.
  • the first receiver 112 the second receiver 113, and the third receiver 114 must all set the same reception frequency and all receive the same audio content.
  • FIG. 2 is a diagram illustrating an embodiment in which a frequency channel and transmission power are determined according to a transmission station and a relay station within a transmission zone.
  • the carrier frequency 230 and the signal output 240 used to transmit KBS (Korea Broadcasting System) first radio broadcast signal at each transmitting station and relay station 220 in the broadcasting region of the metropolitan area are shown in FIG. Appear.
  • the KBS first radio broadcasting channel outputs corresponding broadcasting signals through 11 transmitting stations and repeaters according to each carrier frequency and output.
  • the FM broadcast signal is transmitted through a channel having a high carrier frequency in an urban area with a lot of users and obstacles such as a building, and the AM broadcast is performed through a channel having a low carrier frequency in a rural area where there are few users and a barrier is low. You can confirm that you are sending a signal.
  • the transmission / reception characteristic of a broadcast signal is a carousel method, that is, the transmitting end periodically transmits the same signal, and the receiving end receives and reproduces such a signal.
  • the broadcast signal has a feature of unidirectional communication for an unspecified number.
  • advertisements over the air radio broadcasting are different for each transmission station or relay station, and the same advertisements are transmitted to the same carrier use area, and all users in the transmission area or relay area of each transmission station or relay station are provided with the same advertisement.
  • the radius of the transmission zone at each station or the relay zone at each relay station can range from a few kilometers to a few hundred kilometers. As such, because a single area is considerably wider, there is a need to provide a more suitable advertisement according to the user's location by subdividing a single area.
  • the advertisement provided in the radio broadcast through the Internet may be composed of a separate advertisement from the advertisement provided in the radio broadcast through the airwaves.
  • a spotify service is representative.
  • the user may access the Spotify service page by using the Spotify application on a terminal such as a smartphone or by using an Internet browser on a terminal such as a PC.
  • Spotify offers streaming music licensed by record labels, while users of the free service can display ads in Spotify applications or insert audio ads between songs.
  • an audio service through the Internet network may basically have a bidirectional communication feature, but such a feature is limited to a general communication protocol.
  • IP Internet Protocol
  • the same advertisement is provided to all the listeners connected from the specific IP band by analyzing the Internet access address of the user connected to the corresponding server.
  • the IP band only provides approximate information about the location of the user (receiver) and it is difficult to know the exact location. Therefore, similarly to audio services such as radio broadcasting using airwaves, users existing in a wide range of regions are all provided with the same advertisement.
  • audio content specialized for the user's current location is provided. can do.
  • FIG. 3 is a diagram illustrating an embodiment of providing content using an audio object.
  • An audio object is a specific object that composes an audio signal. For example, dialogue, effects, and background music (BGM) that constitute the audio sound of each instrument or movie constituting music. ), Etc.
  • BGM background music
  • Etc By applying object-based signal processing and encoding techniques, each audio object constituting the scene of the audio content can be extracted and reconstructed as needed.
  • the general music content records each instrument constituting the music individually and mixes the tracks of each instrument appropriately through mixing. If the track of each instrument is composed of objects, the user can control each object independently, so that the sound volume of a specific object (instrument) can be adjusted and the spatial position of the object can be changed.
  • the audio sound of a movie may be reproduced in various countries, and the effect sound and the background music may be independent of the country, but in the case of dialogue, it may be reproduced in a language desired by a user. Therefore, the dialogue audio sound dubbed in various languages such as Korean, Japanese, and English can be processed as an audio object and included in the audio signal.
  • the user selects a desired language as Korean, an audio object corresponding to Korean is selected to be included in the audio signal so that the Korean dialogue is played.
  • MPEG SAOC is an object-based audio encoding technique proposed by MPEG to improve encoding efficiency or spatial rendering efficiency.
  • SAOC a plurality of audio objects are downmixed and transmitted as mono or stereo signals, so that encoding efficiency is good, flexible, and user-controllable rendering features.
  • the MEPG SAOC codec is taken as an example to describe an object-based audio encoding technique, but the present invention is not limited to a technique using an audio object using the MPEG SAOC codec.
  • the MPEG SAOC codec is largely composed of an object encoder 310, an object decoder 320, and a mixer / renderer 330.
  • the SAOC decoder is separated from the object decoder 320 and the mixer / renderer 330.
  • the SAOC decoder may be composed of one block in which the object decoder and the mixer / renderer are integrated.
  • the object encoder 310 generates the downmix signal 302 by inputting N object signals 301, Obj. # 1, Obj. # 2, ..., Obj. #N.
  • the output signal of the object encoder 310 is a bitstream consisting of the downmix signal 302 of the audio object and the metadata 303 of the audio object.
  • the bitstream of the encoded audio signal is transmitted to the object decoder 320 through a communication network.
  • the object decoder 320 decodes the bitstream 302 of the input downmix signal and restores the N object signals 304, Obj. # 1, Obj. # 2,..., Obj. # N.
  • the metadata 303 of the object signal is used to restore the object signal.
  • the mixer / renderer 330 mixes / renders the N audio objects based on the rendering information 305 to generate an output channel signal 306 to be reproduced with the M output channels.
  • FIG. 4 is a view for explaining a method for providing content according to an embodiment of the present invention.
  • content specific to each region may be provided according to the region where the user is located.
  • all users in one transmission area using the same carrier frequency may be provided with the same content.
  • the first user, the second user, and the third user all receive and play the same content.
  • the user-specific content may be provided according to the region where each user is located.
  • different (independent) content may be provided to the first user, the second user, and the third user at different locations.
  • the user's location that can analyze and understand the IP band as in the current range is quite wide.
  • the user configures an audio object that includes audio content to be provided specifically for the user's location, and configures metadata about the audio object including information about a region where the content included in the audio object is provided. If provided to the user, not only the area where specific audio content is provided can be managed more finely, but also the content provider or the content provider can set the range of the area where the specific audio content is provided.
  • both the person providing the content and the person receiving the content can be more convenient and the satisfaction can be improved.
  • Advertisements are paid by advertisers and targeted through a variety of audiences, either through human or non-human media, where advertising to a specific target audience is more effective. Therefore, the use of direct advertising or database marketing is increasing recently. Thus, if such content is advertising content, such utility becomes even higher.
  • the advertiser can more accurately target potential customers and provide specialized advertising. As a result, more effective marketing and customer induction effects can be expected.
  • the user From the perspective of the user (user) who is provided with the content, the user can be provided with more accurate and useful advertisements associated with the local information.
  • the revenue model can be newly configured by diversifying the sales strategy of the advertising slot (slot).
  • the advertising slot determines the selling price according to the effects of the advertising slot and is the main source of revenue for commercial broadcasters.
  • the advertisement including the video content or the graphic content may be variously configured by all of the screen configurations as well as the time slots.
  • the advertisement including only the audio content is limited in configuration.
  • all the same advertisements must be transmitted in the same carrier transmission area, so that the advertisement slots can be distributed only for each time.
  • ad slots can be constructed by dividing each region into uniform sizes, but depending on the characteristics of each region, it can be classified more granularly in areas with a large flow population and more widely in areas with a small flow population. Can be.
  • broadcasters can create more diverse profit models, and advertisers can expect higher advertising effects.
  • Positioning technology may be used to obtain a location of a user terminal providing audio content to a user.
  • Positioning technology is a positioning method using proximity, a positioning method using triangulation and a positioning method using scene analysis according to the basic principle of location recognition, and the terminal according to the location signal receiving subject MS based positioning method, network based positioning method, and hybrid positioning method can be distinguished.
  • it can be classified into indoor positioning and outdoor positioning according to the position of the positioning target, and can be classified into global position positioning and near position positioning according to the position positioning range.
  • Wide-area positioning technology typically includes cell-based positioning method and global positioning system (GPS), and short-range positioning technology includes wireless LAN, Bluetooth, ZigBee, and UWB (Ultra WideBand). ) How to use techniques.
  • GPS global positioning system
  • short-range positioning technology includes wireless LAN, Bluetooth, ZigBee, and UWB (Ultra WideBand).
  • the cell-based positioning method is a positioning technology based on a base station signal of a mobile communication company, a cell ID method using an ID (ID, IDentification) of the base station, and an enhanced cell ID method using additional distance information of the terminal in addition to the cell ID method.
  • a triangulation method using a plurality of base station signals there are an Angle of Arrival (AOA), a Time of Arrival (TOA), and a Time Difference of Arrival (TDOA) method.
  • AOA Angle of Arrival
  • TOA Time of Arrival
  • TDOA Time Difference of Arrival
  • Such a cell-based positioning method has the advantage that it is possible to receive positioning signals even in urban environments or indoors where GPS and Line of Sight (GOS) are not guaranteed, but a GPS range is not guaranteed. Is large.
  • the GPS positioning method is a satellite signal-based positioning method, and measures the position of a receiving terminal using a time difference of arrival of signals received from four or more GPS satellites among 24 GPS satellites. It is known to have an error range of about 10 meters to 100 meters.
  • GPS Globalstar Satellite Navigation
  • DGPS Different GPS
  • DGPS technology corrects the position by receiving satellite position measurement errors from a reference station within 100 kilometers of which the exact position is known, and can maintain an error range of approximately 5 meters to 10 meters.
  • a mobile terminal such as a smart phone is equipped with a GPS module, which is used as the most common positioning method, but in order to receive satellite signals, the LOS between the satellite and the transmitting terminal must be guaranteed.
  • WLAN-based positioning techniques In the case of wireless LAN (WLAN) signal-based positioning techniques, WLAN-based positioning techniques generally determine and position propagation delays between wireless nodes to triangulate and position relative positions.
  • WLAN-based location technology is an indoor positioning technology, and has an advantage of being cost effective since it can use an existing WLAN infrastructure.
  • the positioning technology using near field communication such as Bluetooth, Zigbee, RFID, WLAN, UWB, etc. has much smaller coverage than the mobile communication network or GPS, thereby improving the accuracy of indoor positioning.
  • Positioning technology using WLAN can obtain the position of the receiver using algorithms such as TOA, AOA, TDOA or Received Signal Strength (RSS).
  • WLAN Wireless Local Area Network
  • RSS Received Signal Strength
  • RADAR is a radio frequency (RF) based system for recognizing and tracking a user's location in a building based on 802.11 wireless networking technology.
  • RADAR applies a scene analysis method for estimating the location by comparing the characteristics of the currently received signal with a database storing the properties of the WLAN signal for each location.
  • the result of the triangulation method for the signal strength of the WLAN has an average distance error of 4.3 meters, and when using the scene analysis method has an average distance error of about 3 meters.
  • Radio Frequency IDentification RFID
  • RFID Radio Frequency IDentification
  • FIG. 5 is a flowchart of a method for providing content according to an embodiment of the present invention.
  • FIG. 5 discloses a method in which an audio content provider generates and transmits audio content as an audio object.
  • the audio content providing apparatus When audio content to be provided differently is determined according to each region, the audio content providing apparatus generates an audio object based on the audio content (510).
  • the audio object is a specific object constituting the audio signal, for example, dialogue, effect sound and background music constituting the audio sound of each instrument or movie constituting the music. Or, it may mean each advertisement content.
  • object-based signal processing and encoding techniques each audio object constituting the scene of the audio content can be extracted and reconstructed as needed.
  • the content area information includes information about the area where each audio object is to be provided.
  • information about 'region A' to be provided with 'audio object A' includes information such as reference point and radius of 'region A'.
  • the content area information may be composed of each content area information for each audio object.
  • the content area information may be composed of one content area information that collects and manages a plurality of audio objects.
  • the content area information may be included in metadata of the audio object.
  • the audio object and the content region information generated through steps 510 and 520 are encoded to generate a bitstream (530). Generating the audio object into the bitstream is performed at the encoder end of the audio codec.
  • the N audio objects are downmixed into M signals satisfying M ⁇ N, and a bitstream in which such downmix signals are encoded is generated.
  • the downmix signal is a mono or stereo signal, but is not limited thereto.
  • the spatial parameter is also encoded into the bitstream. Such spatial information is included in the audio object metadata or parameter.
  • the user terminal or the transmitting terminal may encode system data related to a system engine such as MPEG Media Transport (MMT) and / or Dynamic Adaptive Streaming over HTTP (DASH).
  • system data may be included in the audio content or provided as additional information separate from the audio content.
  • the system data may be used to provide an interface for interaction with the system engine at the receiving terminal or decoder.
  • the transmitting terminal encodes the system data
  • the location information where the system data is located for example, the URL address, may be encoded and included in the bitstream.
  • the encoded bitstream is transmitted 540 over the network.
  • the network includes a dedicated line, LAN, VAN, intranet, private telephone network, public telephone network, PSTN network, and a combination thereof, and is a comprehensive data communication network that enables each network component to communicate smoothly with each other.
  • Wired internet Wired internet
  • wireless internet wireless internet
  • mobile wireless networks Wired internet
  • the transmission method follows a transmission method unique to each network.
  • FIG. 6 is a detailed block diagram of an apparatus for providing audio content according to an embodiment of the present invention.
  • the apparatus 600 for providing audio content according to an embodiment of the present invention shown in FIG. 6 operates according to the method for providing audio content according to an embodiment of the present invention shown in FIG.
  • the channel audio object generator 610, the content location information generator 620, the bitstream generator 630, and the transmitter 640 are included.
  • the audio object generator 610 generates an audio object based on content to be provided using an audio object among contents provided by the content providing device.
  • An audio object is a specific object constituting an audio signal, and means, for example, dialogue, effect sounds, and background music constituting an audio sound of each instrument or movie constituting music.
  • object-based signal processing and encoding techniques By applying object-based signal processing and encoding techniques, each audio object constituting the scene of the audio content can be extracted and reconstructed as needed.
  • advertisement content may be provided through an audio object.
  • the content location information generator 620 generates information about a location where a content generated as an audio object is to be provided.
  • the content area information includes information about the area where each audio object is to be provided.
  • information about 'region A' to be provided with 'audio object A' includes information such as reference point and radius of 'region A'.
  • the content area information may be composed of each content area information for each audio object.
  • the content area information may be composed of one content area information that integrates and manages a plurality of audio objects.
  • the content area information may be included in metadata of the audio object.
  • the bitstream generator 630 generates a bitstream by encoding the audio object and the content region information. Generating the audio object into the bitstream is performed at the encoder end of the audio codec.
  • the N audio objects are downmixed into M signals satisfying M ⁇ N, and a bitstream in which such downmix signals are encoded is generated.
  • the downmix signal is a mono or stereo signal, but is not limited thereto.
  • the user terminal or the transmitting terminal may encode system data related to a system engine such as MPEG Media Transport (MMT) and / or Dynamic Adaptive Streaming over HTTP (DASH).
  • system data may be included in the audio content or provided as additional information separate from the audio content.
  • the system data may be used to provide an interface for interaction with the system engine at the receiving terminal or decoder.
  • the transmitting terminal encodes the system data
  • the location information where the system data is located for example, the URL address, may be encoded and included in the bitstream.
  • the transmitter 640 transmits the bitstream generated by the bitstream generator through the network.
  • a network is a comprehensive data communication network that includes dedicated lines, LANs, VANs, intranets, private telephone networks, public telephone networks, PSTN networks, and a combination of these, and enables each network component to communicate with each other smoothly.
  • Internet wireless internet and mobile wireless communications networks.
  • the transmission method follows a transmission method unique to each network.
  • audio content according to an embodiment of the present invention is provided through a broadcast medium such as a radio broadcast
  • the content provider broadcasts a broadcast signal for an unspecified number
  • the user terminal transmits a specific carrier frequency band signal according to the content provider.
  • Receives a corresponding broadcast signal by connecting to a specific server according to the content provider.
  • the apparatus 600 for providing audio content may further include a channel audio signal generator, a storage unit, and a controller in addition to the above configuration.
  • the channel audio signal is distinguished from the audio object and means a general audio signal rather than an object signal.
  • FIG. 7 is a flowchart of a method for providing content according to an embodiment of the present invention.
  • FIG. 7 discloses a method for providing audio content to a user by an apparatus for providing audio content. More specifically, a method for receiving an encoded audio object and providing audio content to a user in a terminal capable of playing audio content is disclosed.
  • the apparatus for providing audio content receives 710 a bitstream of an encoded audio signal transmitted through a network.
  • audio content according to an embodiment of the present invention is provided through a broadcast medium such as a radio broadcast
  • the content provider broadcasts a broadcast signal for an unspecified number
  • the user terminal transmits a specific carrier frequency band signal according to the content provider.
  • Receives a corresponding broadcast signal by connecting to a specific server according to the content provider.
  • the user terminal Upon receiving (710) a bitstream of the encoded audio signal, the user terminal decodes the received bitstream to obtain an audio object (720).
  • Acquiring an audio object by decoding the received bitstream is a process of restoring the original plurality of audio objects as the received bitstream is a signal in which the plurality of audio objects are downmixed. At this time, metadata of the object signal is used to restore the object signal.
  • the user terminal also obtains 730 content area information from the received bitstream.
  • the content area information includes information about the area where each audio object is to be provided.
  • information about 'region A' to be provided with 'audio object A' includes information such as reference point and radius of 'region A'.
  • the content area information may be composed of each content area information for each audio object.
  • the content area information may be composed of one content area information that collects and manages a plurality of audio objects.
  • the content area information may be included in metadata of the audio object.
  • the user terminal or the receiving terminal may receive a bitstream including encoded system data and location information where the system data is located.
  • System data may be included in the encoded audio content, or may be included in the bitstream as additional information separate from the encoded audio content.
  • the receiving terminal may obtain location information related to system data by decoding the bitstream.
  • the receiving terminal can transmit the system data present in the URL address to the corresponding system engine by providing an interface capable of fetching the corresponding system data based on the location information, for example, the URL address.
  • the user terminal also obtains information about the location where the user terminal received the corresponding content (740).
  • Positioning technology is largely based on the basic principle of location recognition, positioning method using proximity, positioning method using triangulation and positioning method using scene analysis, terminal-based positioning method, network-based positioning method according to the location signal receiver and It can be distinguished by a hybrid positioning method.
  • it can be classified into indoor positioning and outdoor positioning according to the position of the positioning target, and can be classified into global position positioning and near position positioning according to the position positioning range.
  • Representative technologies for wide area positioning include cell-based positioning and GPS, and short-range positioning includes wireless LAN, Bluetooth, Zigbee, and UWB.
  • a mobile terminal such as a smartphone is equipped with a GPS module, which is used as the most common positioning method, and when the DGPS technology using additional information is applied, an error range of about 5 meters to 10 meters can be maintained.
  • the content receiving location may be obtained by using the location of the user terminal using a positioning technique, but according to another embodiment of the present invention, the content receiving location may be set by the user. For example, if the user is present in the 'B-zone' but wants to receive audio content provided in the 'A-zone', the user terminal sets the content reception position to the 'A-zone'.
  • the method of providing the audio content may include a step of receiving a user input in addition to the step of FIG. 7.
  • the user terminal selects 750 an audio object to play based on the content area information and the reception location.
  • the criteria for selecting an audio object may be determined by a user's setting or a content provider's setting.
  • each area can be divided into uniform sizes to form an ad slot, but according to the characteristics of each area, it can be classified more finely in areas with a large floating population. Areas with a small floating population can be further classified.
  • the advertising area can be set wider for the advertiser who paid more.
  • an advertisement of a company that is closest to the user's location among the same companies may be played.
  • the first and second companies may be played in the order based on the distance starting from the company located at the closest distance.
  • the user terminal plays 760 the selected audio object.
  • the mixer / renderer of the audio object decoder mixes / renders the audio object based on the rendering information to generate an output channel signal to be reproduced as an output channel.
  • FIG. 8 is a detailed configuration diagram of an apparatus for providing content according to an embodiment of the present invention.
  • the apparatus 800 for providing audio content according to an embodiment of the present invention shown in FIG. 8 operates according to the method for providing audio content according to an embodiment of the present invention illustrated in FIG. Includes a receiving unit 810, an audio object obtaining unit 820, a content area information obtaining unit 830, a receiving location obtaining unit 840, a selecting unit 850, and a reproducing unit 860.
  • An apparatus for providing audio content according to an embodiment of the present invention illustrated in FIG. 8 typically includes a user terminal capable of playing and providing audio content to a user, based on which one embodiment of the present invention illustrated in FIG. Explain the example.
  • the receiver 810 receives a bitstream encoded with an audio object transmitted through a network.
  • a network is a comprehensive data communication network that includes dedicated lines, LANs, VANs, intranets, private telephone networks, public telephone networks, PSTN networks, and a combination of these, and enables each network component to communicate with each other smoothly.
  • Internet wireless internet and mobile wireless communications networks.
  • the transmission method follows a transmission method unique to each network.
  • audio content according to an embodiment of the present invention is provided through a broadcast medium such as a radio broadcast
  • the content provider broadcasts a broadcast signal for an unspecified number
  • the user terminal transmits a specific carrier frequency band signal according to the content provider.
  • Receives a corresponding broadcast signal by connecting to a specific server according to the content provider.
  • the audio object obtainer 820 extracts data corresponding to the audio object from the bitstream received through the receiver 810, decodes the extracted data, and mixes / renders the downmixed signal again to obtain the audio object. .
  • An audio object is a specific object constituting an audio signal, and means, for example, dialogue, effect sounds, and background music constituting an audio sound of each instrument or movie constituting music.
  • object-based signal processing and encoding techniques By applying object-based signal processing and encoding techniques, each audio object constituting the scene of the audio content can be extracted and reconstructed as needed.
  • advertisement content may be provided through an audio object.
  • the content area information acquisition unit 830 extracts data corresponding to the content area information from the bitstream received through the reception unit 810, and obtains the content area information by decoding the extracted data.
  • the content area information includes information about the area where each audio object is to be provided.
  • information about 'region A' to be provided with 'audio object A' includes information such as reference point and radius of 'region A'.
  • the content area information may be composed of each content area information for each audio object.
  • the content area information may be composed of one content area information that integrates and manages a plurality of audio objects.
  • the content area information may be included in metadata of the audio object.
  • the user terminal or the receiving terminal may receive a bitstream including encoded system data and location information where the system data is located.
  • System data may be included in the encoded audio content, or may be included in the bitstream as additional information separate from the encoded audio content.
  • the receiving terminal may obtain location information related to system data by decoding the bitstream.
  • the receiving terminal can transmit the system data present in the URL address to the corresponding system engine by providing an interface capable of fetching the corresponding system data based on the location information, for example, the URL address.
  • the receiving location obtaining unit 840 obtains the location of the user terminal providing the audio content to the user by using the positioning technology.
  • Positioning technology is a positioning method using proximity, a positioning method using triangulation and a positioning method using scene analysis according to the basic principle of location recognition, and the terminal according to the location signal receiving subject MS based positioning method, network based positioning method, and hybrid positioning method can be distinguished.
  • it can be classified into indoor positioning and outdoor positioning according to the position of the positioning target, and can be classified into global position positioning and near position positioning according to the position positioning range.
  • Representative technologies for wide area positioning include cell-based positioning and GPS, and short-range positioning includes wireless LAN, Bluetooth, Zigbee, and UWB.
  • a mobile terminal such as a smartphone is equipped with a GPS module, which is used as the most common positioning method, and when the DGPS technology using additional information is applied, an error range of about 5 meters to 10 meters can be maintained.
  • the content receiving location may be obtained by using the location of the user terminal using a positioning technique, but according to another embodiment of the present invention, the content receiving location may be set by the user. For example, if the user is present in the 'B-zone' but wants to receive audio content provided in the 'A-zone', the user terminal sets the content reception position to the 'A-zone'.
  • the apparatus for providing audio content may further include a user input unit for receiving user input in addition to the configuration of FIG. 8.
  • the selector 850 selects an audio object to be provided to the user among the received audio objects based on the acquired content region information and the acquired location of the user terminal.
  • the criteria for selecting an audio object may be determined by a user's setting or a content provider's setting.
  • each area can be divided into uniform sizes to form an ad slot, but according to the characteristics of each area, it can be classified more finely in areas with a large floating population. Areas with a small floating population can be further classified.
  • the advertising area can be set wider for the advertiser who paid more.
  • an advertisement of a company that is closest to the user's location among the same companies may be played.
  • the first and second companies may be played in the order based on the distance starting from the company located at the closest distance.
  • the selector 850 may select an audio object corresponding to the advertisement domain having the closest distance by comparing the distance between the reference point of each advertisement domain and the user terminal.
  • the reference point of each advertising domain may be the center of each advertising domain. According to another embodiment of the present invention, the reference point of each advertising domain may be the largest floating population in each advertising domain. According to another embodiment of the present invention, the reference point of each advertising area may be a terrain, a feature that is a landmark in each advertising area.
  • the playback unit 860 plays the audio object selected by the selection unit 850.
  • the mixer / renderer of the audio object decoder mixes / renders the audio object based on the rendering information to generate an output channel signal to be reproduced as an output channel.
  • the apparatus 800 for providing audio content may further include a channel audio signal acquisition unit, a storage unit, and a controller in addition to the above configuration.
  • the channel audio signal is distinguished from the audio object and means a general audio signal rather than an object signal.
  • FIG. 9 is a detailed configuration diagram of an apparatus for providing content according to another embodiment of the present invention.
  • An apparatus for providing audio content according to an embodiment of the present invention illustrated in FIG. 9 may be a partial configuration of an apparatus for providing audio content illustrated in FIG. 8, and in FIG. 9, operations and transmission / reception data of each component are illustrated. It is.
  • the audio object decoder 821 of the apparatus for providing audio content transmits the content location information included in the metadata of the audio object to the audio object selector 841.
  • the audio object selecting unit 841 selects an audio object based on the content position information delivered from the audio object decoder 821 and the content receiving position delivered from the receiving position obtaining unit 831, and outputs the audio object selection result again. Pass it to the object decoder.
  • the audio object decoder 821 activates the selected audio object based on the audio object selection result delivered from the audio object selecting unit 841, and deactivates the remaining unselected audio objects.
  • the audio object decoder 821 transfers the selected audio object to the playback unit 851, and the playback unit 851 plays the selected audio object and provides the same to the user.
  • FIG. 10 is content region information according to an embodiment of the present invention and indicates syntax of mae_ContentData when content region information is included in an MPEG-H MAE.
  • MPEG-H is the next generation codec for efficient digital broadcasting in broadband networks. Part 3 of MPEG-H addresses the next generation of audio compression standards. In MPEG-H, technology for processing audio sources constituting each audio scene as audio objects for 3D audio is included.
  • Metadata for processing such an audio object is managed as metadata for each group of elements, which corresponds to a MetE Audio Element (MAE).
  • MAE MetE Audio Element
  • FIG. 10 illustrates a syntax 1000 of MAE mae_ContentData () for implementing a method of providing audio content according to an embodiment of the present invention, and mae_hasContentLocation 1010 is added to implement an embodiment of the present invention.
  • MAE mae_ContentData for implementing a method of providing audio content according to an embodiment of the present invention
  • mae_hasContentLocation 1010 is added to implement an embodiment of the present invention.
  • the mae_hasContentLocation 1010 may include latitude information (mae_contentLatitude) and longitude information (mae_contentLongitude) as local information of the content data for each content data block.
  • FIG. 11 is content region information according to an embodiment of the present invention and indicates a mae_ContentKind field when content region information is included in an MPEG-H MAE.
  • the mae_ContentKind field 1100 indicates a content type of a metadata element group. If the value of the mae_ContentKind field is 2, it indicates that the corresponding content is dialogue. If 3, the content is music.
  • the advertisement content specialized for a specific region may be provided.
  • the mae_hasContentLocation 1010 field may be defined for the corresponding kind of content to use the region information.
  • FIG. 12 is a diagram for describing a method of providing content based on a location of a user, according to an embodiment of the present invention.
  • FIG. 12 is a map of the Las Vegas region of the United States, where the regions shown in the map of FIG. 12 include the Wynn Las Vegas Hotel, The Venetian Hotel and The Palazzo Hotel. If the Wynn Las Vegas hotel and The Venetian hotel both advertise on the radio, let us call the advertising area of each hotel area A advertising area (1210) and B advertising area (1220).
  • a listener receiving a radio broadcast in the area is traveling by car along South Las Vegas Boulevard and only one advertisement for one hotel is to be provided in one time slot, the hotel closer to the listener To provide advertising.
  • the advertisement domain may be set wider for the advertiser who has paid more. For example, if the Wynn Las Vegas hotel paid more advertising, set the advertising area (510, A advertising area) of the Wynn Las Vegas hotel wider.
  • the advertisements of the two hotels may be provided in chronological order according to the business model of the broadcasting company, but the advertisements of the closer hotels may be provided first.
  • the user first listens to the advertisement of the Wynn Las Vegas hotel when in the advertising area 1210 and the user listens to the advertisement of the The Venetian hotel when the user is in the advertising area B1220.
  • FIG. 13 is a diagram for describing a method of providing content based on a location of a user, according to an exemplary embodiment.
  • FIG. 13 is a map of Seoul Gangnam Station, in which a plurality of stores of the same A coffee shop chain exist in the area indicated on the map. In the region shown in the map of FIG. 13, there are four coffee shop stores, Coffee Shop 1 (1310), Coffee Shop 2 (1320), Coffee Shop 3 (1330), and Coffee Shop 4 (1340).
  • the area 1300 displayed on the map of FIG. 13 is the same advertisement area, and the advertisement of A coffee shop is provided in the corresponding advertisement area.
  • a user in the area shown on the map of FIG. 13 receives and is provided with an advertisement of A coffee shop.
  • the user who received the advertisements has a menu which the store is close to or wants to drink coffee of A coffee shop. You don't know where a store is.
  • the user-specific content may be provided even within one advertisement domain. More specifically, according to an embodiment of the present invention, even if it is in one advertisement area 1300 where an advertisement of A coffee shop is provided, since information about the user's location is known, the nearest A to the user's location is known. Ads including information such as the location of the coffee shop store, a phone number and a menu may be provided.
  • an advertisement including information about the A coffee shop store closest to the user's location may be provided after the advertisement of the A coffee shop provided by the head office.
  • Embodiments according to the present invention described above can be implemented in the form of program instructions that can be executed by various computer components and recorded in a computer-readable recording medium.
  • the computer-readable recording medium may include program instructions, data files, data structures, etc. alone or in combination.
  • Program instructions recorded on the computer-readable recording medium may be specially designed and configured for the present invention, or may be known and available to those skilled in the computer software arts.
  • Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks and magnetic tape, optical recording media such as CD-ROMs and DVDs, and magneto-optical media such as floptical disks. medium) and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like.
  • Examples of program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like.
  • the hardware device may be modified with one or more software modules to perform the processing according to the present invention, and vice versa.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

La présente invention concerne, dans un mode de réalisation, un procédé destiné à résoudre le problème technique de la fourniture d'un contenu audio et comportant les étapes consistant à: coder un ou plusieurs objets audio pour chaque contenu parmi un ou plusieurs contenus audio; coder des informations de régions d'objets comprenant des informations de régions sur un ou plusieurs objets audio; et émettre un ou plusieurs flux binaires générés du fait du codage du ou des objets audio et des informations de régions d'objets.
PCT/KR2015/005488 2014-05-30 2015-06-01 Procédé, appareil, et support d'enregistrement lisible par ordinateur pour fournir un contenu audio en utilisant un objet audio Ceased WO2015183060A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201462005050P 2014-05-30 2014-05-30
US62/005,050 2014-05-30

Publications (1)

Publication Number Publication Date
WO2015183060A1 true WO2015183060A1 (fr) 2015-12-03

Family

ID=54699317

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2015/005488 Ceased WO2015183060A1 (fr) 2014-05-30 2015-06-01 Procédé, appareil, et support d'enregistrement lisible par ordinateur pour fournir un contenu audio en utilisant un objet audio

Country Status (1)

Country Link
WO (1) WO2015183060A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240114202A1 (en) * 2014-09-30 2024-04-04 Sony Group Corporation Transmission apparatus, transmission method, reception apparatus and reception method for transmitting a plurality of types of audio data items

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110100544A (ko) * 2010-03-04 2011-09-14 오큐브 주식회사 위치연동 컨텐츠 제공 시스템, 위치연동 컨텐츠 이용 단말기 및 위치연동 컨텐츠의 이용방법
KR20120084314A (ko) * 2009-11-20 2012-07-27 돌비 인터네셔널 에이비 다운믹스 신호 표현에 기초하여 업믹스 신호 표현을 제공하기 위한 장치, 다중 채널 오디오 신호를 표현하는 비트스트림을 제공하기 위한 장치, 선형 결합 파라미터를 이용하여 다중 채널 오디오 신호를 표현하는 방법, 컴퓨터 프로그램 및 비트스트림
KR20130117868A (ko) * 2011-04-13 2013-10-28 엠파이어 테크놀로지 디벨롭먼트 엘엘씨 동적 광고 컨텐츠 선택
KR20130137367A (ko) * 2012-06-07 2013-12-17 (주)엔써즈 이미지 기반 도서 관련 서비스 제공 시스템 및 방법
US20140023196A1 (en) * 2012-07-20 2014-01-23 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20120084314A (ko) * 2009-11-20 2012-07-27 돌비 인터네셔널 에이비 다운믹스 신호 표현에 기초하여 업믹스 신호 표현을 제공하기 위한 장치, 다중 채널 오디오 신호를 표현하는 비트스트림을 제공하기 위한 장치, 선형 결합 파라미터를 이용하여 다중 채널 오디오 신호를 표현하는 방법, 컴퓨터 프로그램 및 비트스트림
KR20110100544A (ko) * 2010-03-04 2011-09-14 오큐브 주식회사 위치연동 컨텐츠 제공 시스템, 위치연동 컨텐츠 이용 단말기 및 위치연동 컨텐츠의 이용방법
KR20130117868A (ko) * 2011-04-13 2013-10-28 엠파이어 테크놀로지 디벨롭먼트 엘엘씨 동적 광고 컨텐츠 선택
KR20130137367A (ko) * 2012-06-07 2013-12-17 (주)엔써즈 이미지 기반 도서 관련 서비스 제공 시스템 및 방법
US20140023196A1 (en) * 2012-07-20 2014-01-23 Qualcomm Incorporated Scalable downmix design with feedback for object-based surround codec

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240114202A1 (en) * 2014-09-30 2024-04-04 Sony Group Corporation Transmission apparatus, transmission method, reception apparatus and reception method for transmitting a plurality of types of audio data items

Similar Documents

Publication Publication Date Title
KR101512354B1 (ko) 정보 제공 시스템, 휴대용 단말 장치, 서버 및 프로그램
US6577716B1 (en) Internet radio system with selective replacement capability
KR102179321B1 (ko) 두 개의 전자 디바이스에서 재생을 동기화하는 방법 및 장치
WO2010114321A1 (fr) Procédé de mesure d'audience d'un service et d'un contenu de diffusion au niveau d'un terminal
US20070143788A1 (en) Method, apparatus, and program product for providing local information in a digital video stream
JP2009027722A (ja) 通信システム
KR20190077599A (ko) 멀티 매칭 검출 및 위치 기반의 명확화를 통한 미디어 채널 식별
CN103024601B (zh) 信息采集系统
JP2012227921A (ja) 情報提供システム、識別情報解決サーバおよび携帯端末装置
WO2014171759A1 (fr) Système pour fournir une publicité personnalisée pour l'utilisateur sur la base d'un signal son délivré par un téléviseur, procédé pour fournir une publicité personnalisée pour l'utilisateur, et support d'enregistrement lisible par ordinateur pour enregistrer un programme de service mim
EP2497264A2 (fr) Appareil d'affichage pour supporter un service de recherche, terminal utilisateur pour réaliser la recherche d'objet, et procédés connexes
JP4883046B2 (ja) 視聴確認システム
WO2021118107A1 (fr) Appareil de sortie audio et procédé de commande de celui-ci
WO2012020905A1 (fr) Système et procédé pour la fourniture de contenu dans un environnement cpns
WO2015183060A1 (fr) Procédé, appareil, et support d'enregistrement lisible par ordinateur pour fournir un contenu audio en utilisant un objet audio
USRE47819E1 (en) Wired internet network system for the internet video streams of radio stations
CN104349181A (zh) 音频标签应用系统和方法
WO2014109472A1 (fr) Système de localisation et procédé associé
US20050076219A1 (en) Transmitting device and method
KR100562997B1 (ko) 멀티미디어 방송 서비스의 수신 품질 측정 방법 및 장치
JP6744090B2 (ja) 情報表示システム
CN105574746A (zh) 一种基于图像分析方法的广告监控方法
WO2013089487A1 (fr) Appareil de fourniture de publicités et procédé de fourniture de publicités
US12457051B2 (en) Radio head unit with dynamically updated tunable channel listing
JP5584506B2 (ja) 情報処理装置、情報処理方法及びプログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15799345

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15799345

Country of ref document: EP

Kind code of ref document: A1