WO2023127422A1 - 情報処理装置、情報処理方法、プログラム、情報処理システム - Google Patents
情報処理装置、情報処理方法、プログラム、情報処理システム Download PDFInfo
- Publication number
- WO2023127422A1 WO2023127422A1 PCT/JP2022/044839 JP2022044839W WO2023127422A1 WO 2023127422 A1 WO2023127422 A1 WO 2023127422A1 JP 2022044839 W JP2022044839 W JP 2022044839W WO 2023127422 A1 WO2023127422 A1 WO 2023127422A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- content
- music
- information processing
- trigger
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K15/00—Acoustics not otherwise provided for
- G10K15/04—Sound-producing devices
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
- G11B27/105—Programmed access in sequence to addressed parts of tracks of operating record carriers of operating discs
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/11—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
- G11B27/28—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
- G11B27/32—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on separate auxiliary tracks of the same or an auxiliary record carrier
- G11B27/322—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on separate auxiliary tracks of the same or an auxiliary record carrier used signal is digitally coded
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/34—Indicating arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/02—Services making use of location information
- H04W4/029—Location-based management or tracking services
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/121—Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/121—Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
- G10H2240/131—Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
Definitions
- the present technology relates to an information processing device, an information processing method, a program, and an information processing system, and in particular, an information processing device capable of realizing a content viewing experience that maintains a worldview intended by an artist without boring the user. , an information processing method, a program, and an information processing system.
- Patent Document 1 describes a technique for generating parameters for creating music based on information from each part in the automobile and generating musical score data based on the generated parameters.
- the experience of listening to music using information obtained from the vehicle body, as disclosed in Patent Document 1 is a passive experience. It is easy for the user to get tired of the music, for the reason that an unexpected change in music occurs. Also, the musical experience intended by the artist cannot be fully realized.
- This technology has been developed in view of this situation, and it is intended to realize a content viewing experience that maintains the world view intended by the artist without boring the user.
- An information processing apparatus includes a content information setting unit that sets content information including content parameter information that defines details of content reproduction control, and a condition for executing the content reproduction control.
- An information processing method or program sets content information including content parameter information that defines details of content reproduction control, and triggers that define conditions for executing the content reproduction control.
- a step of setting content generation parameters by setting information and linking the content information and the trigger information is included.
- content information including content parameter information that defines details of content reproduction control is set, trigger information that defines conditions for executing the content reproduction control is set, and A content generation parameter is set by linking the content information and the trigger information.
- An information processing apparatus includes an event acquisition unit that acquires an event when a user moves, content information including content parameter information that defines details of content playback control, and content playback control. an acquisition unit that acquires content generation parameters that are set in association with trigger information that defines conditions for executing the trigger A content reproduction control unit that controls reproduction of the content based on the content information linked with the information.
- An information processing method or program acquires an event when a user moves, and executes content information including content parameter information that defines details of content playback control, and the content playback control. acquires the content generation parameters set in association with trigger information that defines the conditions when the and controlling reproduction of the content based on the content information obtained.
- an event when the user moves is acquired, and content information including content parameter information defining content reproduction control details and conditions for executing the content reproduction control are obtained.
- content information including content parameter information defining content reproduction control details and conditions for executing the content reproduction control are obtained.
- FIG. 4 is a diagram illustrating an example of content reproduction control on the user side;
- FIG. 4 is a diagram showing an example of trigger information set in advance by a producer and content reproduction control;
- FIG. 10 is a diagram showing an example of playback control using an additional track;
- FIG. FIG. 4 is a diagram showing an example of components of content; It is a figure showing an example of composition of an information processing system concerning one embodiment of this art.
- 2 is a block diagram showing a hardware configuration example of a producer terminal;
- FIG. 3 is a block diagram showing a hardware configuration example of a user terminal;
- FIG. 1 is a block diagram showing a functional configuration example of an information processing system;
- FIG. FIG. 4 is a diagram showing the data structure of content generation parameters;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a setting screen displayed on the creator's terminal;
- FIG. 10 is a diagram showing an example of a product in a series of operations performed by the creator using the setting screen;
- 4 is a flow chart showing the operation of the creator's terminal;
- FIG. 3 is a diagram showing an example of a reproduction space in which content is output;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- FIG. 4 is a diagram showing a specific example in which content reproduction control is performed on the user side;
- 4 is a flowchart when content reproduction control is performed on the user side.
- FIG. 4 is a diagram showing an example of tracks that are subject to playback control;
- FIG. 4 is a diagram showing an example of playback control for tracks generated by sound source separation;
- FIG. 4 is a diagram showing types of sound sources; It is a block diagram which shows the structural example of a music control part. It is a figure which shows the example of each information.
- FIG. 10 is a diagram showing a specific example of remixing an existing song;
- FIG. 10 is a diagram showing an example of a process of connecting existing songs;
- FIG. 3 is a diagram showing an example of tracks that constitute existing music;
- FIG. 10 is a diagram showing another example of the process of connecting existing songs;
- FIG. 10 is a diagram showing an example of a production workflow when sound source separation is used;
- FIG. 4 is a diagram showing a configuration example of an information processing server when sound source separation is performed on the cloud side; It is a figure which shows the example of object recognition.
- FIG. 10 is a diagram showing a specific example of remixing an existing song
- FIG. 10 is a diagram showing an example of a process of connecting existing songs
- FIG. 3 is a diagram showing an example of tracks that constitute existing music
- FIG. 4 is a diagram showing an example of music generation
- FIG. 10 is a diagram showing another example of object recognition
- FIG. 10 is a diagram showing another example of music generation
- FIG. 4 is a diagram showing an image of music generation according to the position of an object
- FIG. 10 is a diagram showing a display example of a user terminal during music reproduction
- FIG. 5 is a diagram showing an example of luminance detection positions
- FIG. 10 is a diagram showing an example of state changes of effect images
- FIG. 4 is a diagram showing an example of music generation
- FIG. 10 is a diagram showing a display example of a map used for area setting;
- the information processing system of this technology is mainly realized by the configuration on the side of creators such as artists and the configuration on the side of users who drive cars.
- the content generation parameter is information that associates an algorithm that defines details related to content reproduction control with trigger information that defines trigger conditions for executing content reproduction control.
- the content generation parameters may be stored in a server on the network and provided to the user via the server, or may be stored in the local terminal.
- the content whose reproduction is controlled based on the above-mentioned content generation parameters and the position information and vehicle body information of the car the user is driving is output to the in-vehicle space and presented to the user.
- the content is reproduced so that the output content dynamically changes according to the user's driving conditions and the surrounding environment.
- the following playback control is performed based on content generation parameters, vehicle position information, and vehicle body information. 1. Apply reverb to the vocal as it approaches Tokyo Tower 2 . 2. Adding a synthesizer sound to the content being played when turning right or left at a specific intersection. Decrease the volume of the content being played when stopped, and gradually increase the volume after starting
- trigger information includes, for example, area information indicating the start and end of an area set in advance by the creator, landmark information indicating landmarks such as Tokyo Tower and Rainbow Bridge, specific intersections, tunnels, and bridges. It is information including POI (Point of Interest) information indicating specific points such as.
- POI Point of Interest
- These pieces of information are set in advance by the creator as conditions that trigger the execution of content playback control. For example, conditions such as when the area is switched, when approaching the vicinity of a landmark, when turning left or right at an intersection designated as a specific point, and the like are set as triggers.
- the location information and body information of the vehicle driven by the user are used as described above.
- the vehicle body information is, for example, information indicating start/stop, acceleration/deceleration, right/left turn, and speed of the vehicle. If the user is not on a moving object, the user's own position information, walking speed, or the like may be used.
- This information is actually measured by the user using various sensors such as sensors installed in the vehicle driven by the user, acceleration sensors and gyro sensors installed in the smartphone set in the vehicle, Defined as an event below.
- playback control was performed based on content generation parameters preset by the creator. Content is output to the user side.
- the user may control the content generation parameters as long as they do not deviate from the creator's settings and intentions.
- the control of the content generation parameter on the user side is, for example, a process of changing the content generation parameter set in advance by the creator side.
- Video data (moving image data) may be used as content without being limited to music data, or data including music data and video data may be used as content.
- the content may be, for example, audio data such as environmental sounds or radio, or may be data other than those described above.
- the audio data includes data of sounds different from music (natural sounds such as the sound of waves, the sound of rain, and the sound of a stream, sound effects, human voices, guidance sounds, mechanical sounds, and so on).
- Music data consists of a time series of one or more parts characterized by melody, rhythm, harmony, key, etc.
- a piece of music data can contain a plurality of the same parts.
- a part can include repetition of a predetermined pattern or phrase by some or all of the sounds (elements) that make up the part.
- music data playback control using this technology is performed in the interior space of the car driven by the user.
- the present technology can also be applied when enjoying music viewing experience on a moving object other than a car, such as a ship, an airplane, a train, or a bicycle. Note that the present technology can be applied not only when the user is on the mobile object, but also when the user is walking.
- a smartphone is installed in the vehicle.
- the vehicle decelerates/stops, starts/accelerates, turns right/left, Events such as the current position, specific points, and landmarks are detected, playback control is performed based on these events and content generation parameters, and the playback sound (music) of the playback-controlled music data is output to the vehicle interior space.
- GPS Global Positioning System
- the vehicle decelerates/stops, starts/accelerates, turns right/left
- Events such as the current position, specific points, and landmarks are detected
- playback control is performed based on these events and content generation parameters
- the playback sound (music) of the playback-controlled music data is output to the vehicle interior space.
- the same playback control may be performed when passing around the landmark for the first time and when it is the second time, and it is determined that there is a high possibility that the user will feel bored.
- different reproduction control may be performed for the first time and the second time.
- Such settings can also be set in advance by the producer.
- the smart phone installed in the car is used as an example of the device used for controlling the playback of music data, but a PC installed in the car, a car navigation system, etc., may also be used.
- the information obtained from the various traffic databases described above may be information obtained in real time from a cloud server via a network, or may be obtained in advance and stored in a storage medium such as a memory. It may be information that is
- FIG. 1 is a diagram showing an example of music data reproduction control on the user side.
- the musical notes shown on the left side of FIG. 1 indicate that the reproduced sound of music data A is being output inside the vehicle while driving in Area 1.
- Other musical notes shown in FIG. 1 also indicate that the reproduced sound of the music data subject to reproduction control is being output while traveling in each area.
- music data B assigned to the music data B when the running area is switched from area 1 to area 2 as indicated by the broken line in FIG. music data B assigned to the music data B. That is, when the user acquires an event indicating that area 1 is switched to area 2, and the acquired event is determined to satisfy the trigger information preset by the producer, music data A is changed to music data A. Playback control to switch to data B is performed.
- the music data B is switched to the music data B+ ⁇ .
- the music data B+ ⁇ is music data obtained by adding an additional track ⁇ to the music data B, which is the main track.
- music data B+ ⁇ is played. That is, when the user acquires an event indicating that the user is running in the vicinity of a landmark in area 2, and it is determined that the acquired event satisfies the trigger information set in advance by the producer, music data B+ ⁇ is changed to music data The playback is switched to B+ ⁇ .
- the music data B+ ⁇ is music data obtained by adding an additional track ⁇ different from the additional track ⁇ to the music data B, which is the main track.
- processing to impart arbitrary musical effects such as imparting reverb processing to vocals of the main track, may be applied to the reproduced sound of music data B. good.
- Music data playback control is performed based on the music generation parameters preset by the producer.
- the creator can arbitrarily set what kind of playback control the user should perform when what kind of event is detected by the user. For example, setting Tokyo Tower as a landmark, increasing the volume as the distance to Tokyo Tower approaches, and performing localization processing and panning processing so that the sound source is localized in the direction of Tokyo Tower. set.
- FIG. 2 is a diagram showing an example of trigger information preset by the producer and playback control of music data.
- FIG. 2A is a diagram showing an example of trigger information.
- Trigger information includes POI information that indicates the start and end points of specific points (intersections, tunnels, bridges, etc.), landmark information that indicates the start and end points of landmarks (Tokyo Tower, Rainbow Bridge, etc.), start and end points of each area, Area information indicating the end, start/stop time, stop time, information indicating the start/end of right/left turn, speed information, and the like are included.
- FIG. 2B is a diagram showing an example of music data playback control.
- Types of music data playback control include effect processing (Low Pass Filter/Reverb/Echo, etc.), output of sound effects and event detection sounds, volume control/crossfade in/out, addition of additional tracks, content switching (environmental sound , guidance voice, etc.), localization/panning processing, musical score generation, and the like. Not limited to this, it is possible for the producer to set in advance the reproduction control of music data other than the above.
- music data playback control may be realized by music generation parameters that link trigger information and music information set in advance by the producer, or may be implemented by the user. may be realized at the time of output of
- FIG. 3 is a diagram showing an example of playback control using additional tracks.
- the main track and additional tracks are music data of a predetermined duration, such as 10 seconds, 15 seconds, or 4 or 8 bars.
- the additional tracks Add11 and Add12 are added to the main track M1 at time t2, which is the next reproduction start timing of the main track M1. is added to and played.
- the additional track Add11 is a track that adds sequence changes to the main track M1
- the additional track Add12 is a track that adds rhythm changes to the main track M1. .
- the additional tracks are not limited to the above example, and can be arbitrarily set by the producer.
- a track having the same time as the main track M1 is set in advance as an additional track. While the car is stopped, the user listens to the reproduced sound of the main track M1, which gives a different impression than when the car is running. Note that the additional track may have a playback time different from that of the main track M1.
- the main track M1 to which the additional tracks Add11 and Add12 are added is reproduced for the first time between times t2 and t3, and the main track M1 to which the additional tracks Add11 and Add12 are added is reproduced for the second time. is performed between times t3 and t4. Further, the main track M1 to which the additional tracks Add11 and Add12 are added is reproduced for the third time between times t4 and t5.
- the additional track Add13 is added to the main track M1 instead of the additional tracks Add11 and Add12. .
- the main track M1 to which the additional track Add13 is added is reproduced so that the reproduced sounds of the additional tracks Add11 and Add12 fade out after time T2, and the reproduced sound of the additional track Add13 fades in instead.
- the main track M1 to which the additional track Add13 is added is reproduced for the first time. Immediately after starting the car, the user hears the reproduced sound of the main track M1, which has a different impression from that when the car is running or stopped.
- the second reproduction of the main track M1 to which the additional track Add13 is added is performed between times t7 and t8.
- the additional track Add13 fades out, and only the main track M1 is reproduced during the time from t8 to t9.
- the main track M1 to which the additional tracks Add14 and Add15 are added is reproduced.
- an event such as a stop is detected at a timing more than a predetermined percentage of the time required for one loop playback of the main track M1
- the playback with the additional track added will be performed before the start of the next loop playback of the main track M1.
- the time T3 when the stop is detected is the timing when one reproduction of the main track M1 during the time t7 to t8 has advanced more than a predetermined ratio such as 80%
- the reproduction with the additional tracks Add14 and Add15 is performed. , is started at time t9, which is the start timing of the next loop of the main track M1.
- the additional track Add14 is a track that adds a sequence change to the main track M1 like the additional track Add11, and the additional track Add15 adds a rhythm change to the main track M1 like the additional track Add12. This is the track to add.
- the additional track Add11 and the additional track Add14, and the additional track Add12 and the additional track Add15 may be the same music data or different music data.
- loop playback of the main track M1 is continued in the same way, with additional tracks being added according to the conditions of the car.
- the reproduced sound of the main track assigned to the area is presented to the user while sequentially changing the impression.
- the user can obtain a new listening experience of music by listening to the reproduced sound of the main track that changes according to his or her driving.
- the additional tracks Add11 and Add12 are simultaneously added to the main track M1.
- the present technology is not limited to this. (or the additional tracks Add14, Add15) may be added to the main track M1, or both or either one of the additional tracks Add11, Add12 (or the additional tracks Add14, Add15) may be added to the main track M1. Playback control may be performed such that it is added to the main track M1 with a delay of the number of bars.
- the additional track may consist of one material, or may consist of multiple different materials.
- the aforementioned main track and additional track are loop material with a predetermined number of bars and playback time.
- Both the main track and the additional track may not be loop material, or only one of them may be loop material.
- FIG. 4 is a diagram showing an example of constituent elements of music data.
- files for tracks on each line from lines 1 to 45 are prepared by the producer when setting the music generation parameters.
- four types of track files of Rhythm, Bass, Harmony, and Vocal are shown as an example.
- the second column from the left shows the name of each file.
- the producer side selects the tracks that make up the main track from among these tracks that they have uploaded in advance.
- tracks selected by the producer side are indicated by black circles.
- a main track M1 which is one main track, is constructed using each track of the 45th row as material.
- a main track M1 constituted by each track shown in FIG. 4 is the same track as the main track M1 described with reference to FIG.
- the main tracks M2 and M3 are also composed of the tracks indicated by the black circles in the 4th and 5th columns selected in the same manner. Each main track set in this manner is assigned to an arbitrary area.
- the additional tracks described in FIG. 3 are also set in advance in the same manner.
- the additional track may be set by selecting the same track as the track that constitutes the main track, or the additional track may be set by selecting a track that is different from the track that constitutes the main track. You may do so.
- the main track may be set by specifying a specific section of an existing song that has been uploaded in advance, instead of selecting from pre-registered components (materials).
- the main track is set using an existing song, for example, the number of measures of the main track, the playback time of the main track, the start of the section that can be used as the main track (loop material) among the sections that make up the existing song Information such as position, end position, playback time, etc. is set in advance by the producer.
- FIG. 5 is a diagram illustrating a configuration example of an information processing system 1 according to an embodiment of the present technology.
- the information processing system 1 includes a producer terminal 11, which is a structure on the producer side, an information processing server 12, and a music data reproduction system 13, which is a structure on the user side.
- the music data reproduction system 13 is composed of a car 21 driven by a user and a user terminal 22 installed inside the car 21 .
- the user terminal 22 is, for example, a smart phone.
- the producer terminal 11, the information processing server 12, and the music data reproduction system 13 are connected via a network 31 so as to be able to communicate with each other.
- the network 31 is constructed by, for example, the Internet or a wide area communication network.
- any WAN (Wide Area Network) or LAN (Local Area Network) may be used as the network 31, and the protocol for constructing the network 31 is not limited.
- FIG. 6 is a block diagram showing an example of the hardware configuration of the creator terminal 11. As shown in FIG.
- the creator terminal 11 is configured by a computer such as a PC.
- the producer terminal 11 includes a CPU (Central Processing Unit) 51, a ROM (Read Only Memory) 52, a RAM (Random Access Memory) 53, an input/output interface 55, and a bus 54 connecting them together.
- CPU Central Processing Unit
- ROM Read Only Memory
- RAM Random Access Memory
- An input unit 56, an output unit 57, a storage unit 58, a communication unit 59, a drive unit 60, and the like are connected to the input/output interface 55.
- the input unit 56 is, for example, a keyboard, pointing device, touch panel, or other operating device. If input portion 56 includes a touch panel, the touch panel may be integrated with output portion 57 .
- the output unit 57 is composed of a display device and a speaker.
- the storage unit 58 is a non-volatile storage device, such as an HDD, flash memory, or other solid-state memory.
- the communication unit 59 is a communication module for communicating with other devices via the network 31.
- the communication unit 59 may use either wired or wireless communication.
- the drive unit 60 is a device capable of driving a removable recording medium 61 such as an optical recording medium or magnetic recording tape.
- Information processing by the creator terminal 11 having the hardware configuration described above is realized by cooperation between software stored in the storage unit 58 or the ROM 52 and the hardware resources of the creator terminal 11 .
- the information processing method according to the present technology is realized by loading a program constituting software stored in the ROM 52 or the like into the RAM 53 and executing the program.
- the program is installed on the creator's terminal 11 via the removable recording medium 61, for example.
- the program may be installed on the creator terminal 11 via a global network or the like.
- any non-transitory storage medium readable by the producer terminal 11 may be used.
- the information processing server 12 which is a server on the cloud, has the same configuration as the creator terminal 11 as shown in FIG.
- the information processing server 12 may be configured by a plurality of computers.
- FIG. 7 is a block diagram showing an example hardware configuration of the user terminal 22 that constitutes the music data reproduction system 13 .
- the user terminal 22 is configured by connecting a photographing unit 72 , a microphone 73 , a sensor 74 , a display 75 , an operation unit 76 , a speaker 77 , a storage unit 78 and a communication unit 79 to the control unit 71 .
- the control unit 71 is composed of a CPU, a ROM, a RAM, and the like.
- the control unit 71 executes a predetermined program and controls the overall operation of the user terminal 22 according to user's operation.
- the user terminal 22 having the control unit 71 functions as a sensing device that senses the current position of the car 21 driven by the user, acceleration of the vehicle body, steering operation, and the like. Further, since the user terminal 22 has the speaker 77, it may function as an output device for outputting the reproduced sound of music data.
- the photographing unit 72 is configured by a lens, an imaging device, etc., and performs photographing according to control by the control unit 71 .
- the imaging unit 72 outputs image data obtained by imaging to the control unit 71 .
- the current position can be obtained based on image data obtained by the imaging unit 72 instead of obtaining the current position using GPS.
- the microphone 73 supplies the collected sound data to the control unit 71 .
- the sensor 74 is composed of a GPS sensor (positioning sensor), an acceleration sensor, a gyro sensor, and the like.
- the sensor 74 measures the current position, speed, acceleration, etc. of the car 21 driven by the user, and outputs sensor data indicating the measurement results to the control unit 71 .
- the control unit 71 detects that a steering operation has been performed based on the current position, acceleration, and the like, and acquires events such as the car 21 turning left or right or turning at a curve.
- the display 75 is configured by an LCD (Liquid Crystal Display) or the like, and displays various information such as menu screens and images being captured according to control by the control unit 71 .
- the area in which the vehicle is currently traveling and the music data linked to that area may be displayed in a playlist format.
- the operation unit 76 is composed of operation buttons, a touch panel, and the like provided on the surface of the housing of the user terminal 22 .
- the operation unit 76 outputs information indicating the details of the user's operation to the control unit 71 .
- the speaker 77 outputs sound based on the reproduction signal of the music data supplied from the control section 71 .
- the storage unit 78 is composed of a flash memory or a memory card inserted into a card slot provided in the housing.
- the storage unit 78 stores various data such as image data supplied from the control unit 71 .
- image data it is also possible to record, for example, how many times the user has circumnavigated the landmark, so that the reproduction control of the music data differs depending on the number of weeks.
- the communication unit 79 performs wireless or wired communication with an external device such as the information processing server 12 .
- FIG. 8 is a block diagram showing an example of the functional configuration of the information processing system 1. As shown in FIG.
- the producer terminal 11 of the information processing system 1 implements a music generation parameter setting section 101 .
- the music generation parameter setting section 101 is composed of a music information setting section 111 and a trigger information setting section 112 .
- the music control section 301 is composed of a position information acquisition section 311 , a vehicle body information acquisition section 312 , a music generation parameter control section 313 and a music reproduction control section 314 .
- the position information acquisition unit 311 and the vehicle body information acquisition unit 312 function as an event acquisition unit that acquires events that occur on the user side. Note that the functions of the position information acquisition unit 311 and the vehicle body information acquisition unit 312 will be described below separately for the position information acquisition unit 311 and the vehicle body information acquisition unit 312, not for the event acquisition unit.
- the events include events that occur when the user does not board a mobile object (walking, etc.) in addition to events when the user boards a mobile object such as a car.
- the music control unit 301 may be realized in either one of the car 21 and the user terminal 22 that constitute the music data reproduction system 13, and the car 21 and the user terminal 22 may cooperate to operate the music control unit 301. may be realized.
- An example of joint operation between the car 21 and the user terminal 22 is, for example, that the car 21 implements the vehicle body information acquisition unit 312 and the user terminal 22 implements the location information acquisition unit 311 .
- the music information setting section 111 of the music generation parameter setting section 101 which is a component of the producer terminal 11, sets, for example, music information related to music data used for each area according to an operation by the producer. Details of the music information will be described later.
- trigger information setting unit 112 trigger information defining trigger conditions for executing playback control of music data is set according to an operation by the producer. Details of the trigger information will be described later.
- the music generation parameter setting section 101 sets music generation parameters based on the music information set by the music information setting section 111 and the trigger information set by the trigger information setting section 112 . For example, information linking music information and trigger information is set as a music generation parameter.
- the music generation parameter setting unit 101 transmits the music generation parameters to the information processing server 12 via the network 31 .
- the music generation parameters transmitted from the music generation parameter setting section 101 are stored in a database and managed.
- the information processing server 12 manages various information such as track files uploaded from the producer terminal 11 .
- Information such as music generation parameters managed by the information processing server 12 is supplied to the music data reproduction system 13 at a predetermined timing. For example, a plurality of music generation parameters that link various pieces of music information and trigger information are supplied to the music data reproduction system 13 .
- Various external databases such as a map server 211 and a traffic information server 212 may be connected to the network 31 in addition to the information processing server 12 that stores and supplies music generation parameters.
- the position information acquisition unit 311 of the music control unit 301 which is a component of the music data reproduction system 13, for example, based on sensor data (such as GPS data) acquired from the sensor 74 built in the user terminal 22, The position information of the vehicle 21 to be driven is acquired.
- the vehicle body information acquisition unit 312 acquires vehicle body information of the car 21 driven by the user based on sensor data (for example, speed data, acceleration data, etc.) acquired from the sensor 74 .
- the vehicle body information is, for example, information indicating start/stop, acceleration/deceleration, right/left turn, steering at a curve, etc. when the car 21 is actually running.
- the music generation parameter control unit 313 responds to the music generation parameters sent from the music generation parameter setting unit 101 or from the information processing server 12 via the network 31 by the vehicle body position acquired by the position information acquisition unit 311 . and the vehicle information obtained by the vehicle information obtaining unit 312.
- a music generation parameter indicating the content of the playback control is set in advance to ⁇ add an additional track to the main track in the form of an arpeggio when turning right or left at a specific intersection I''. think.
- the trigger information "specific intersection I” and “turn left or right” and the algorithm “add additional tracks to the main track in arpeggio format” are linked and set as music generation parameters.
- position information for determining whether or not the car 21 driven by the user has entered a "specific intersection I", and "right or left turn” are required.
- position information is obtained based on sensor data measured by a GPS sensor (positioning sensor) or the like
- vehicle body information is obtained based on sensor data measured by an acceleration sensor or a gyro sensor.
- the vehicle information may be acquired using a sensor mounted on the car 21 instead of the sensor mounted on the user terminal 22 .
- Playback control is based on an algorithm that adds additional tracks in arpeggiated form to the main track inside.
- the parameter control section 313 may appropriately control the music generation parameters, or such settings may be made in advance by the producer.
- the control of music generation parameters is, for example, a process of changing content generation parameters preset by the producer.
- the music reproduction control unit 314 performs reproduction control on music data based on the music generation parameters controlled by the music generation parameter control unit 313 .
- the music reproduction control unit 314 When the music generation parameter control unit 313 controls the music generation parameters preset by the producer, the music reproduction control unit 314 performs reproduction control based on the music generation parameters after control. On the other hand, if the music generation parameter control unit 313 does not control the music generation parameters preset by the producer, the music playback control unit 314 reproduces the music preset by the producer. Playback control is performed based on generation parameters.
- the output unit 401 outputs music data whose reproduction is controlled by the music reproduction control unit 314 .
- the output unit 401 is, for example, the in-vehicle audio system of the vehicle 21 driven by the user or the speaker 77 built in the user terminal 22 set in the vehicle, but is not limited to this and may be an external speaker.
- the music reproduction control section 314 may be configured to include the output section 401 .
- the music reproduction control unit 314 and the output unit 401 are connected by wire or wirelessly, and in the case of wireless connection, various protocols such as Bluetooth (registered trademark) and Wi-Fi (registered trademark) may be used.
- FIG. 9 is a diagram showing the data structure of the music generation parameters 501 set by the music generation parameter setting section 101 of FIG.
- the music generation parameters 501 are configured by associating music information 511 and trigger information 512 .
- the music information 511 is music information set by the music information setting unit 111 .
- the music information 511 is composed of music parameter information 511A, music metadata information 511B, and tag information 511C.
- the music parameter information 511A is, for example, information about an algorithm that indicates what kind of playback control is to be performed for music data linked to each area.
- the music parameter information 511A indicates an algorithm such as "Add an additional track to the main track in arpeggio form” or "Increase the volume of the main track in proportion to the speed” as described above.
- the music metadata information 511B is information indicating attributes of music data, such as key, beat, BPM (Beats Per Minute), chord, and playback time of music data.
- the music metadata information 511B may also include, for example, tonality information and instrument type information. Also, if the music data is object audio data, the position data of each object forming the music data may be included as the music metadata information 511B.
- the type of data to be specified in the music metadata information 511B can be set in advance by the producer.
- the tag information 511C is, for example, information specifying which main track and additional track are assigned to each set area. Here, if there are multiple additional tracks assigned to each area, they may be linked as one group. Information different from the information specifying the track may be set as the tag information.
- the trigger information 512 is trigger information set by the trigger information setting unit 112 .
- the trigger information 512 is composed of area information 512A, landmark information 512B, POI information 512C, and moving object information 512D.
- the area information 512A is information about the area set on the map by the creator. For example, information on the start position and end position of the area (A in FIG. 2) is set as the area information 512A. A large area (corresponding to a scene described later) composed of a plurality of areas may be set as the area information 512A. Although the details will be described later, the creator can set an area of arbitrary shape on a display screen such as a GUI (Graphical User Interface), for example.
- GUI Graphic User Interface
- the landmark information 512B is information indicating landmarks such as Tokyo Tower and Rainbow Bridge.
- the creator can also set arbitrary buildings such as shrines, temples, and schools as landmarks.
- the POI information 512C is information indicating specific points other than the aforementioned areas and landmarks. For example, intersections, tunnels, bridges (A in FIG. 2), etc. are set as specific points indicated by the POI information 512C. Which intersections, tunnels, bridges, etc. are set as specific points may be arbitrarily set by the creator, or may be set according to information published on the database of an external service such as Google Map (registered trademark). You may make it set based on.
- Google Map registered trademark
- the mobile object information 512D is information associated with a specific event based on location information. For example, as shown in A of FIG. 2, it is information about a moving body such as a car 21 driven by a user, such as start/stop, stop time, right/left turn, and speed.
- FIG. 10 to 22 a series of operations performed by the creator on the screen will be described with reference to FIGS. 10 to 22.
- FIG. The series of operations performed by the producer on the setting screen includes, as described above, trigger information that defines the conditions that trigger the execution of music data playback control, and algorithms that define various contents related to playback control. is an operation for setting music generation parameters using, for example, a GUI.
- the setting screen may be displayed on the display that constitutes the output unit 57 of the creator terminal 11 or may be displayed on an external display connected to the creator terminal 11 .
- FIG. 10 to 20 are diagrams showing display examples of setting screens displayed on the creator terminal 11.
- FIG. 10 to 20 are diagrams showing display examples of setting screens displayed on the creator terminal 11.
- the setting screen is composed of an area setting section 601 used to set an area and a detailed setting section 602 used to make detailed settings for each area set using the area setting section 601 .
- the area setting portion 601 displays a map of the range selected by the creator.
- the map displayed in the area setting unit 601 may be pre-stored in the creator terminal 11, or may be acquired via the network 31 from the database of an external service such as the aforementioned Google Map (registered trademark). It may be
- Fig. 10 is also a screen for adding (newly creating) a scene.
- the scene is the aforementioned large area. Since a large area includes a plurality of areas, it is more efficient and effective to achieve scalability by setting each scene (that is, large area) rather than setting individual areas individually.
- the setting screen shifts to the screen of FIG. 11, which is a screen for making detailed settings for the additional scene.
- the detailed setting portion 602 in FIG. 10 displays an INTRO SCENE tab 621, which is a tab operated when INTRO SCENE, which is a scene that has already been set, is selected.
- FIG. 11 is a diagram showing the screen after the additional scene creation tab 622 is selected.
- the setting of the scene to be added and the setting of the main track and additional tracks assigned to the scene to be added are performed.
- the producer inputs information about the scene to be added to SCENE INFO 702, and inputs information about the main track assigned to the scene to be added to MUSIC INFO 703.
- the information input to MUSIC INFO 703 is included in the aforementioned music metadata information 511B.
- the name and ID of the scene are entered as information about the scene to be added.
- “Creative Scene” is input as the scene name and "01" is input as the ID.
- Scene names can be set and changed arbitrarily.
- the track list 701 shows information about additional tracks to be added to the main track during playback control of music data.
- the additional track is a component of the music data described with reference to FIG. In the example of FIG. 11, three material files of kick, snare, and key are displayed in the track list 701 .
- the upload tab 704 is a tab operated when uploading music data files.
- FIG. 12 is a diagram showing an example of a screen used for setting grouping when there are multiple additional tracks.
- three material files of kick, bass, and vox are displayed in the track list 701 in a grouped state.
- kick and vox are set as materials for Group1.
- multiple additional tracks are generally used to achieve various playback controls for the music data, and it is more efficient in terms of production to group each additional track than to handle each additional track individually. good. Producers can set and check combinations of multiple additional tracks that they set themselves.
- the upload button 705 may be selected to upload.
- FIG. 13 is a diagram showing the screen after adding one scene.
- the detailed setting section 602 additionally includes a Creative Scene tab 623 that is operated when selecting the added "Creative Scene" scene.
- Fig. 14 is a diagram showing a screen used for setting each area that constitutes the "Creative Scene" scene and for setting TAG Information.
- TAG Information for example, information about main tracks and additional tracks assigned to each area is set.
- an area A1 of arbitrary size and shape is drawn on the map within the area setting section 601.
- the creator can select the range of area A1 on the map.
- the display of the detailed setting section 602 is used to set TAG Information indicating the main track and additional tracks allocated to area A1.
- the main track assigned to area A1 is set using the display of Area Music ID 801, and the additional track is set using the display of Additional Tracks 802.
- Area Music ID indicating the main track assigned to area A1 and information indicating additional tracks are set as TAG Information.
- an Area Music ID of "Area01" is set, and Group2 (FIG. 12) is selected from Group1 to Group3 set as groups for additional tracks.
- TAG Information is the aforementioned tag information 511C (FIG. 9).
- the setting of the area on the map by the area setting unit 601 is performed, for example, by arbitrarily setting it on the producer side using a device such as a mouse.
- the area may be set by selecting an area with an arbitrary shape from among a plurality of preset areas.
- the setting screen shifts to the screen of FIG. 15 showing the screen used for setting the algorithm defining various contents regarding the reproduction control in the area A1.
- FIG. 15 is a diagram showing a screen used for algorithm setting in area A1.
- the creator selects an algorithm to be used in area A1 from the algorithm list 901 of the detail setting section 602.
- FIG. Algorithm list 901 displays the names of selectable algorithms, for example, in a pull-down format.
- three types of algorithms are displayed: "Arpeggio" (arpeggio), "Synth” (synthesizer), and "Rhythm” (rhythm).
- the type of algorithm is not limited to this example. For example, two types may be used, or four or more types may be used.
- the trigger information related to scenes and areas is arbitrarily set by the creator using a screen (not shown).
- the trigger information setting and algorithm setting for the scene and each area that constitutes the scene are completed.
- Music generation parameters are set by associating the trigger information with the algorithm.
- FIG. 16 is a diagram showing an example of a screen for setting POI information.
- the POI information is information regarding a specific point, such as a start point and an end point.
- a point P1 is displayed in addition to the area A1.
- the button 613 By selecting the button 613, it is possible to set the point P1 at an arbitrary position on the map within the area setting section 601. For example, with the button 613 selected, the point P1 is set by clicking an arbitrary point on the map in the area setting section 601 with the cursor.
- an algorithm list 902 is displayed in the detail setting section 602, like the screen of FIG. Algorithm list 902 is used by the producer to set an algorithm that defines various details regarding playback control at point P1.
- point P1 is a special point set by the creator, a unique parameter may be set for the point P1.
- FIG. 18 is a diagram showing a state in which the "Arpeggio" algorithm is selected from the algorithm list for point P1.
- parameters such as the length of the notes of the additional track to be played back in arpeggiated form are set as parameters specific to point P1.
- the trigger information related to point P1 is arbitrarily set by the creator using a screen (not shown), as in the case of scenes and areas.
- FIG. 19 is a diagram showing an example of a screen regarding landmark setting.
- the landmark is information such as a start point and an end point related to landmarks such as Tokyo Tower and Rainbow Bridge.
- the landmark L1 is displayed.
- the button 611 By selecting the button 611, it is possible to set the landmark L1 at an arbitrary position on the map in the area setting section 601. For example, with the button 611 selected, the landmark L1 is set by clicking an arbitrary point on the map in the area setting section 601 with the cursor.
- the setting screen shifts to the screen in FIG.
- the algorithm list 905 for the landmark L1 is displayed in the detail setting section 602.
- Algorithm list 905 is used by the creator to set algorithms that define various details regarding playback control for landmark L1.
- a transition may be made to a screen for setting an algorithm defining various contents regarding the reproduction control in the landmark L1.
- Screen transition means may be appropriately changed by the vendor or creator of the application of the setting screen.
- Arbitrary parameters may be added or changed by the creator.
- Type of additional track that is additionally played when the user-driven car drives around the landmark L1 (select Guidance in CONTENTS TYPE) 2.
- Processing based on the distance between the car 21 driven by the user and the landmark L1 (select Distance in PLAYBACK SETTING) 3.
- Set volume change as effect processing (select Volume Change in EFFECT SETTING)
- the main track is changed from the music data to the guidance voice regarding the landmark L1.
- the landmark L1 is a special point set by the creator. , the creator can express various variations, and it becomes possible to provide a music experience that users will not get tired of. For example, the producer provides a musical experience such as "apply reverb processing only to the vocal of the main track as the landmark L1 is approached” or "change the type of guidance voice according to the number of times the landmark L1 is passed". can do.
- each algorithm set using the display of the algorithm lists 901, 902, and 905 described above is the music parameter information 511A (FIG. 9) described above.
- the unique parameter 903 at point P1 and the unique parameter at landmark L1 may be included in the music parameter information 511A.
- the area setting section 601 is displayed on the left side of the display screen, and the detail setting section 602 is displayed on the right side of the display screen.
- the left and right sides may be reversed, or the layout may be changed by the creator's preference.
- FIG. 21 is a diagram showing an example of a product in a series of operations by the creator using the setting screen.
- areas numbered 1 to 22 are set on the map. Each area number corresponds to the Area Music ID, and the main track identified by the Area Music ID is displayed as an icon. As an example, Area 1 to Area 22 may be collectively displayed as one scene (large area).
- the main track may be played by clicking the speaker icon in each area so that the producer can check the preview.
- the shape of the icon is not limited to this.
- Each area may be displayed in a way that the creator can identify it, such as by color-coding. Also, if there are particularly important areas or favorite areas, they may be displayed with an icon such as a star mark.
- each producer will set music generation parameters for many regions other than Japan, including foreign countries (for example, Tokyo and Osaka for Japan, New York and Paris for foreign countries).
- a product set for a specific region may be diverted to another region, or the product may be shared among producers. This makes it possible to increase production efficiency and achieve scalability.
- layout proposals for products produced in a specific region that have been diverted to another region can be displayed, and layout proposals produced in the past by experienced producers can be displayed in a pull-down format for novice producers. You may enable it to be selected.
- FIG. 22 is a flowchart relating to a series of operations performed by the creator using the setting screen. The processing of each step shown in FIG. 22 is performed by each section of the music generation parameter setting section 101 according to the operation of the producer (FIG. 8).
- step S1 the music generation parameter setting section 101 accepts selection of the scene addition tab 622 by the producer, and a screen used for setting the main track and the additional track to be assigned to the scene (large area) to be added (newly created). display.
- step S2 the music information setting unit 111 sets information about the main track to be played back in the additional scene according to the producer's operation on the screen for detailed setting of the main track to be assigned to the additional scene.
- step S3 the music information setting unit 111 sets information regarding additional tracks to be added to the main track set in step S2.
- step S4 the music information setting unit 111 sets information about the main track for each area that constitutes a scene.
- step S5 the music information setting unit 111 sets information regarding additional tracks to be added to the main track set in step S4.
- a single additional track may be set, or a group of multiple additional tracks may be set.
- step S6 the trigger information setting unit 112 sets trigger information.
- step S7 the music information setting unit 111 sets music parameter information according to the algorithm set by the producer.
- step S8 the trigger information setting unit 112 sets POI information and landmark information indicating the setting contents according to the POI and landmark settings made by the creator. Similar to steps S6 and S7, trigger information setting and algorithm setting are performed for the POIs and landmarks set in step S8.
- step S9 the music generation parameter setting unit 101 sets music generation parameters by associating trigger information set for scenes, areas, POIs, and landmarks with music information including algorithm settings.
- step S10 the music generation parameter setting unit 101 transmits the music generation parameters set in step S9 to the information processing server 12 via the network 31. After that, a series of processes related to the setting of music generation parameters is completed.
- FIGS. 10 to 22 A series of operations performed by the creator on the screen and an example of the screen displayed on the creator side have been described above using FIGS. 10 to 22.
- FIG. 23 is a diagram showing a reproduction space in the music data reproduction system 13.
- the reproduction space is the interior space of the car 21 driven by the user, as described above.
- a user terminal 22 such as a smart phone is installed on a dashboard in the vehicle interior.
- the music data whose reproduction is controlled may be output from the speaker 77 of the user terminal 22 or may be output from the audio system of the car 21.
- FIGS. 24 to 30 are diagrams showing specific examples of music data reproduction control in the music data reproduction system 13.
- FIG. The musical notes in FIGS. 24 to 30 indicate that the reproduced sound (music) of the music data is being output inside the car 21 while driving through each area or point, as in the case of FIG. show.
- FIG. 24 is a diagram showing an example of regeneration control when the travel area is switched.
- the scenery on the left side of FIG. 24 is the scenery in front of the car 21 traveling in the area A11 shown on the map on the right side. While traveling in the area A11, as indicated by the musical notes under the scenery, the reproduced sound of the music data (main track) assigned to the area A11 is output inside the vehicle.
- FIG. 25A is a diagram showing an example of reproduction control on a curve.
- playback control is performed by dynamically changing parameters such as volume according to the amount of steering operation. For example, the steeper the curve, the louder the additional track added to the main track.
- FIG. 25B is a diagram showing an example of regeneration control based on stopping/starting at an intersection or the like.
- the reproduction control is performed by changing the types of notes forming the additional tracks added to the main track according to the acceleration at the time of starting. For example, the greater the acceleration, the greater the proportion of 16th notes and 32nd notes that make up the additional track, which can give the user a sense of exhilaration.
- FIG. 26 is a diagram showing an example of playback control based on passage of landmarks.
- Fig. 26 shows the scenery when approaching Tokyo Tower, which is set as the landmark L1.
- the reproduced sound of the music data set for the landmark L1 is output with the sound source localized in the direction of the landmark L1.
- the reproduction control based on passage of the landmark is performed such that the vocal of the main track is subjected to reverb processing as it approaches the landmark L1.
- reproduction control such as changing the type of additional track to be added to the main track is performed between the first and second passages.
- FIG. 27 is a diagram showing an example of reproduction control when traveling at a specific point indicated by POI information.
- FIG. 27A shows reproduction control when the vehicle enters a specific intersection P11 indicated by the POI information and then proceeds straight.
- FIG. 27B shows the regeneration control when the vehicle enters the intersection P11 and then turns right (left).
- FIG. 28 is a diagram showing another example of reproduction control when traveling at a specific point indicated by POI information.
- points such as tunnels are points where location information from the GPS sensor is easily interrupted.
- point p1 which is the starting point of point P12 in FIG.
- a hatched point P12 indicates the range within the tunnel.
- the current vehicle is based on the position information acquired by the position information acquisition unit 311 and the vehicle body information acquired by the vehicle body information acquisition unit 312 before entering point P12.
- the time from the travel position to the point P12 is measured in advance. Synchronous playback is performed according to the time until the point P12 is entered, so that even if the position information from the GPS sensor is interrupted at the point p1, the synchronously played back sound can be played without skipping or the like. It is possible to output in the car.
- FIG. 29 is a diagram showing an example of regeneration control based on traffic conditions in the area in which the vehicle is running.
- FIG. 29 shows the traffic situation while driving in area A11.
- a of FIG. 29 shows a situation where no traffic jam occurs in area A11
- B of FIG. 29 shows a situation where traffic jam occurs in area A11.
- Traffic congestion is a stressful situation for the user, and even in a situation where traffic congestion is occurring, the user may feel bored if music data is simply output based on the same playback control as in normal times. There is a possibility that
- information on areas where traffic congestion occurs is acquired in advance from the traffic information server 212 or the like, and an alert sound is output when the distance to the area where traffic congestion occurs is within a predetermined distance.
- Playback control may be performed such as As a result, the user can know in advance that he or she will encounter traffic congestion, and can consider alternative routes.
- the content of music data playback control is dynamic depending on road conditions and weather (maintained roads and bad roads, road conditions on sunny days and road conditions on rainy and snowy days, etc.). may be changed to When detecting road conditions, various information about road conditions (road surface dry, wet, wet, snow, ice, gravel, coefficient of friction, etc.) may be acquired by the user terminal 22 or the sensor of the vehicle 21.
- FIG. 30 is a diagram showing an example of reproduction control based on information such as speed information of a travel area.
- Area A11 shown in FIG. 30 is the same area as area A11 in FIG. For example, it is assumed that the area A11 in FIG. 30 and the adjacent area A12 have different speed limits.
- area A11 is an area including general roads with a speed limit of 40 km/h.
- Area A12 is an area including a highway with a speed limit of 80 km/h. In both area A11 and area A12, when the music generation parameter indicating the content of the playback control is set to "add an alert sound to the main track and play it when the speed of the car 21 reaches 40 km/h". think of.
- the speed limit in area A12 is 80 km/h, so it is extremely likely that the vehicle will run at a speed faster than 40 km/h during normal driving. Therefore, if the above-mentioned music generation parameter indicating the contents of the playback control that "when the speed of the car 21 reaches 40 km/h, the alert sound is added to the main track and played" remains unchanged, the alert sound will not be sent to the user. will be output continuously.
- FIG. 1 A specific example of music data reproduction control in the music data reproduction system 13 has been described above with reference to FIGS. A specific example of music data reproduction control is not limited to the cases of FIGS. 24 to 30. FIG.
- FIG. 31 is a flow chart when music data playback control is performed in the music data playback system 13, which is the configuration on the user side.
- the processing of each step is started after the music generation parameters sent from the information processing server 12 are acquired by the music control unit 301 .
- a plurality of music production parameters created by different producers are obtained.
- Acquisition of the music generation parameters is performed by the communication unit 79 of the user terminal 22, but instead, the acquisition may be performed by the communication unit mounted on the car 21 driven by the user.
- the music generation parameters are set in association with the trigger information and the music information.
- the trigger information is information that defines a trigger condition for executing playback control of music data, and is indicated by A in FIG. 2, for example.
- the music information includes information such as music parameter information 511A that indicates what kind of reproduction control is to be performed on music data.
- the music parameter information 511A is, for example, information regarding an algorithm indicating what kind of reproduction control is to be performed for music data linked to each area.
- step S ⁇ b>11 the position information acquisition section 311 of the music control section 301 acquires position information of the car 21 driven by the user based on various sensor data supplied from the sensor 74 .
- step S ⁇ b>12 the vehicle body information acquisition unit 312 acquires vehicle body information of the vehicle 21 based on various sensor data supplied from the sensor 74 .
- the aforementioned event acquisition is performed through steps S11 and S12. That is, the position information acquisition unit 311 and the vehicle body information acquisition unit 312 function as an event acquisition unit that acquires events that occur on the user side.
- step S ⁇ b>13 the music generation parameter control unit 313 determines the position information of the car 21 acquired by the position information acquisition unit 311 and the vehicle body information of the car 21 acquired by the vehicle body information acquisition unit 312 . determines whether or not the trigger information set in the music generation parameters acquired in 1. is satisfied. That is, it is determined whether the acquired event satisfies the trigger information.
- step S13 If it is determined in step S13 that the acquired event satisfies the trigger information set in the music generation parameters acquired before the start of processing, the process proceeds to step S14.
- step S14 the music generation parameter control unit 313 determines whether it is necessary to change the music generation parameters acquired before the start of processing, based on the acquired event and trigger information.
- step S14 If it is determined in step S14 that it is necessary to change the music generation parameters acquired before the start of processing, the processing proceeds to step S15.
- the music data reproduction system 13 when changing the music generation parameters in the music data reproduction system 13, when the music generation parameters are set on the producer side, the music data reproduction system 13 is set in advance so that the music generation parameters are changed. may be set to
- step S15 the music generation parameter control unit 313 changes (controls) the music generation parameters acquired before the start of processing.
- the music generation parameters acquired before the start of processing are changed, for example, the example described with reference to FIG. 30 can be mentioned.
- step S16 the music reproduction control section 314 performs music data reproduction control based on the music generation parameters changed in step S15.
- step S14 determines whether the music generation parameter does not need to be changed. If it is determined in step S14 that the music generation parameter does not need to be changed, the process of step S15 is skipped, and music data reproduction control is performed in step S16.
- step S15 the music generation parameters acquired before the start of processing are not changed in step S15.
- step S16 reproduction control is performed on the music data based on the music generation parameters acquired before the start of processing.
- step S17 the output unit 401 outputs the reproduced sound of the music data whose reproduction is controlled to the vehicle interior space. If it is determined in step S13 that the trigger information set in the music generation parameters is not satisfied, the process proceeds to step S17 to continue outputting the reproduced sound.
- step S18 the music control unit 301 determines whether or not the user has finished driving. Whether or not the operation has ended may be determined by turning the engine ON/OFF, for example. If it is determined in step S18 that the operation will not end, the processes after step S11 are restarted. If it is determined in step S18 that the operation has ended, the process ends.
- the process may be started and terminated based on the user's voice recognized by the microphone 73 provided in the user terminal 22.
- the microphone 73 provided in the user terminal 22 or the speech recognition system of the car 21 recognizes wake words such as "I'm a little sleepy so play some music” and "I want to concentrate on driving so please stop playing music”. is the case.
- a biosensor provided in the seat of the car 21 acquires biometric data such as the sweating state, heart rate, and electroencephalogram of the driver and passengers, and based on those biodata, the driver's It is conceivable to estimate the degree of drowsiness, the degree of tension, etc., and use it to control the music generation parameters.
- the music generation parameters so as to perform playback control such as adding music data that encourages wakefulness or voice that encourages rest as an additional track for a driver who is becoming sleepy. be.
- the music generation parameter may be changed so as to perform playback control such that a voice encouraging driving is added as an additional track according to the user's degree of nervousness. In this way, this technology can also be applied to accessibility.
- FIG. 32 is a diagram showing an example of tracks to be subjected to reproduction control.
- playback is performed for tracks set using existing songs and tracks (sound source data) newly created by the producer and uploaded to the information processing server 12 as shown in A of FIG.
- a track obtained by performing sound source separation on an existing music piece may be used as an object of reproduction control.
- FIG. 33 is a diagram showing an example of playback control for tracks generated by sound source separation.
- the vehicle has started running from position P1, which is a position within a certain area. From the position P1 to the position P2, the above-described reproduction control is performed according to the vehicle body information and the like, targeting the existing music set for the area in which the vehicle is running. For example, playback control for existing music delivered by a streaming service is performed up to position P2.
- trigger information or the like is set to switch the music subject to playback control to a remix compatible music when approaching position P2 where a landmark such as Tokyo Tower is located.
- the user terminal 22 acquires the remix compatible music and performs sound source separation.
- Separation information including information indicating that the content (music) can be separated, for example, is associated with the remix compatible music.
- the music for which sound source separation is possible is acquired based on the separation information, and sound source separation is performed.
- FIG. 34 is a diagram showing types of sound sources.
- four types of sound source separation are possible: 2 sound sources, 4 sound sources, 6 sound sources, and 10 sound sources.
- 2 sound sources For example, when sound source separation is performed for two sound sources, tracks for Vocal and Others are obtained, and when sound source separation for four sound sources is performed, tracks for Bass, Drums, Vocal, and Others are obtained.
- Information indicating the types of separable sound sources may be included in the separation information.
- playback control reflecting driving information is performed up to position P3 for each track acquired by sound source separation.
- sound source separation of two sound sources is performed in the user terminal 22 and respective tracks of Vocal and Others are acquired, musical effects suitable for the Vocal track are given by playback control.
- the contents of playback control are switched depending on the type of track obtained by sound source separation.
- the contents of music effects such as effects may be set for each track.
- the Vocal track is processed while driving through a tunnel, such as adding reverb.
- a track obtained from the same music as the main track may be used, or a track obtained from another music may be used. good too.
- music tracks with similar parameters such as BPM and keys are added to the main track.
- FIG. 35 is a block diagram showing a configuration example of the music control unit 301 including the configuration for performing sound source separation as described above.
- the same components as those described with reference to FIG. 8 and the like are denoted by the same reference numerals. Duplicate explanations will be omitted as appropriate.
- the music control unit 301 shown in FIG. 35 includes a music acquisition unit 701, a music information acquisition unit, in addition to the position information acquisition unit 311, the vehicle body information acquisition unit 312, the music generation parameter control unit 313, and the music reproduction control unit 314 described above. 702, a sound source separation unit 703, a driving information acquisition unit 704, and a music reconstruction unit 705 are provided.
- the position information acquisition unit 311 and the vehicle body information acquisition unit 312 are provided as functional units that configure the driving information acquisition unit 704
- the music generation parameter control unit 313 is provided as a functional unit that configures the music reconstruction unit 705 .
- the music acquisition unit 701 acquires music data, which is data of a music to be subjected to sound source separation.
- the music data may be obtained from a server that provides a streaming service, a server on the cloud such as the information processing server 12 , or may be obtained from within the user terminal 22 .
- the music data acquired by the music acquisition unit 701 is associated with music information, meta information, and separation information.
- Each piece of music data may be associated with at least one of the three types of information rather than all of the three types of information.
- Music information is information that indicates the attributes of a song, such as key, beat, and BPM.
- the music metadata information 511B described above corresponds to music information.
- the meta information includes information indicating the parts (sections) used for remixing, information indicating the remix method, and so on. At least one of the information constituting the music parameter information 511A described above may be included in the meta information.
- the separation information includes information indicating the type of sound source as described with reference to FIG. It contains information indicating whether At least one of the information may be included in the separation information instead of including all three types of information. In addition to the information described above, information such as sound source waveform data and musical score data may be included.
- separation information including information indicating that sound source separation has been performed is associated with music data for which sound source separation has been performed in the user terminal 22 or the information processing server 12 used by a certain user. . If the data of each track obtained by sound source separation is stored in the server that performed the sound source separation, information indicating the storage location of the data of each track may be included in the separation information.
- the user terminal 22 does not perform sound source separation, and the data of each track is acquired based on the information indicating the storage location. be. In addition, reproduction control is performed on the acquired data of each track. This eliminates the need to perform sound source separation each time, making it possible to reduce the processing load on the user terminal 22 .
- a playlist selected by the user is referred to, and sound source separation is performed before playback of each song is started.
- the playlist includes information indicating the playback order of each piece of music.
- the music data acquired by the music acquisition unit 701 is supplied to the music information acquisition unit 702 and the sound source separation unit 703 .
- the music information acquisition unit 702 acquires music information by analyzing the music data supplied from the music acquisition unit 701 or by referring to music information linked to the music data.
- the music information includes, for example, genre of music, chord progression, structural information such as A melody/B melody, score information such as MIDI data, and information indicating BPM. information may be included.
- Acquisition of music information by the music information acquisition unit 702 is performed by analyzing track data supplied from the sound source separation unit 703 as appropriate. For example, by analyzing a vocal track, melody information is acquired as music information.
- the music information acquired by the music information acquisition unit 702 is supplied to the music reconstruction unit 705 .
- the sound source separation unit 703 performs sound source separation on the music data supplied from the music acquisition unit 701, and acquires data of a predetermined number of tracks such as 2 sound sources, 4 sound sources, 6 sound sources, and 10 sound sources.
- the number of sound sources separated by the sound source separation unit 703 may be automatically set by the music control unit 301, for example, or may be set by the user as appropriate.
- the sound source separation by the sound source separation unit 703 may be performed by analyzing the signal, or may be performed using an inference model generated by machine learning. In the latter case, the sound source separation unit 703 prepares an inference model that receives music data and outputs track data of each sound source. The track data of each sound source acquired by the sound source separation unit 703 is supplied to the music information acquisition unit 702 and the music reconstruction unit 705 .
- the inference model used for sound source separation may be re-learned to improve the performance of sound source separation.
- the driving information acquisition section 704 is configured to include a position information acquisition section 311 and a vehicle body information acquisition section 312 .
- a position information acquisition unit 311 of the driving information acquisition unit 704 acquires position information of the car 21 driven by the user based on data supplied from a positioning sensor or the like built into the user terminal 22 as the sensor 74 .
- the vehicle body information acquisition section 312 of the driving information acquisition section 704 acquires the vehicle body information of the car 21 driven by the user based on the sensor data supplied from the sensor 74 .
- the vehicle body information is information indicating the running state of the vehicle 21, such as starting/stopping, accelerating/decelerating, turning left or right, turning the steering wheel on a curve, and the like.
- the position information acquired by the position information acquisition unit 311 and the vehicle body information acquired by the vehicle body information acquisition unit 312 are supplied to the music reconstruction unit 705 together with other information as driving information, as shown in FIG. 36C.
- map information and external information are shown as information constituting driving information.
- External information is information including POI information and area information included in the trigger information 512 .
- the music generation parameter control unit 313 of the music reconstruction unit 705 performs reproduction control that reflects the position information and vehicle body information included in the driving information supplied from the driving information acquisition unit 704 based on the music generation parameters.
- the music composed of arbitrary tracks separated by the sound source separation unit 703 is remixed.
- the music generation parameters indicating the music remixed by the music reconstruction unit 705 are supplied to the music reproduction control unit 314 together with the data of each track.
- the music reproduction control unit 314 reproduces each track based on the music generation parameters indicating the remixed music supplied from the music reconstruction unit 705 .
- a reproduced sound based on the music generation parameters is output from a speaker that constitutes the output unit 401 (FIG. 8), such as the speaker 77 built in the user terminal 22 .
- the information processing server 12 which is a server on the cloud.
- FIG. 37 is a diagram showing a concrete example of remixing an existing song.
- an existing song with a stereo sound source is acquired as a target for sound source separation.
- the existing music to be subjected to sound source separation is analyzed by the music information acquisition unit 702, and section information such as BPM, A melody/B melody, etc. is acquired.
- sound source separation is performed by the sound source separation unit 703, and the Vocal track and the Others track are obtained.
- main melody information main melody
- music generation parameters indicating the chord progression, instrumentation, and performance information are set by the music reconstruction unit 705 (music generation parameter control unit 313) based on the main melody information of the vocal track. ) and remixed using music generation parameters.
- the performance information is information indicating how to play each musical instrument, tone color, and the like. For example, music generation parameters are generated that indicate a chord progression that matches the main melody information of the vocal track.
- Driving information is also used as appropriate to generate music generation parameters.
- a piece of music consisting of a Vocal track and an Other track is generated as a piece of music after remixing.
- the Other track is, for example, a track used for harmonizing the Vocal track.
- the entire music piece may be remixed, or a partial remix such as the chorus may be performed.
- Other tracks may be prepared in advance by the producer, or may be automatically generated based on information obtained from the system (information processing server 12).
- FIG. 38 is a diagram showing an example of the process of joining existing songs.
- the processing for switching the music to be played back from music A to music B will be described. It is assumed that the music B is set as the music to be reproduced after the music A according to a playlist or a list of music prepared in advance.
- the playback order of songs may be recommended using local data of the user terminal 22 or a DB on the cloud. Song recommendations are based on key, BPM, artist name, genre, and more.
- Music A and music B are existing music including Vocal tracks and Other tracks, respectively, as shown in FIG.
- Vocal track T A1 and Other track T A2 are obtained by sound source separation for music A
- Vocal track T B1 and Other track T B2 are obtained by sound source separation for music B.
- the trigger information or the like is set such that when the landmark approaches the position P11, the music subject to reproduction control is switched to the remix compatible music.
- the vocal track T A1 of the song A continues to be reproduced, and the other track of the song B is replaced with the other track T A2 of the song A. Playback of T B2 has started.
- reproduction control targeting Vocal track T A1 of music A and Other track T B2 of music B is performed.
- the vocal track T B1 of music B is used as the object of reproduction control together with the other track T B2 of music B. be done.
- Information indicating a section in which playback control is performed for tracks of different songs may be included as meta information (B in FIG. 36) as information on a location suitable for splicing.
- meta information information on the start position and end position of each section such as the intro, A melody, and B melody is set in the meta information as information on a part suitable for splicing.
- the information about the parts suitable for splicing is set, for example, by the producer or by analyzing the music.
- a predetermined limit may be set for a track that has restrictions on the playback location and remix method, and optimization at the time of splicing may be performed.
- remixing may be performed after signal processing such as time stretching. Also, instead of using all the constituent elements of a specific sound source obtained by sound source separation for remixing, only some constituent elements such as tone color may be used for remixing.
- FIG. 40 is a diagram showing another example of joining processing of existing songs.
- the speed follow-up mode is a mode in which a piece of music with a BPM corresponding to the running speed is played.
- reproduction control is performed according to vehicle body information, etc., for a 120 BPM song set for the area in which the vehicle is traveling. Instead of the entire piece of music, a section of a piece of music may be played back.
- the music of 128 BPM is used as the target of reproduction control
- the music of 124 BPM is used as the target of reproduction control.
- the trigger information or the like is set to switch the music subject to playback control to the remix compatible music when the position P24 is approached.
- the music of 124 BPM is maintained as the object of reproduction control.
- the music of 124 BPM is used as the target of reproduction control, as in the immediately preceding section.
- the target of playback control is switched to the specific sound source track of the 124 BPM music obtained by sound source separation.
- the music is remixed by adding a sound source track of another music with the same BPM (124 BPM).
- remixing for example, if the running speed is faster than the threshold speed, the track of the song whose BPM is higher than the threshold is selected, and if the running speed is slower than the threshold speed, the song whose BPM is lower than the threshold is selected. track is selected.
- FIG. 41 is a diagram showing an example of a production workflow when sound source separation is used.
- sound source separation of existing music is performed on the cloud side (information processing server 12 side).
- the producer operates the producer terminal 11 and uploads the existing music as indicated by arrow #1.
- settings such as meta information are made for each track as indicated by arrow #3.
- a downward pointing triangle in FIG. 41 indicates the setting position of the meta information.
- Trigger information defining trigger conditions for executing playback control is set as meta information as appropriate.
- FIG. 42 is a diagram showing a configuration example of the information processing server 12 when sound source separation is performed on the cloud side.
- the music control unit 301 is implemented in the information processing server 12 as shown in FIG. Not all functional units of the music control unit 301 are implemented, but some functional units may be implemented in the information processing server 12 .
- the same components as those described with reference to FIG. 35 are given the same reference numerals. Duplicate explanations will be omitted as appropriate.
- music data uploaded as a target for sound source separation is acquired by the music acquisition unit 701 and output to the music information acquisition unit 702 and the sound source separation unit 703 .
- the driving information acquisition unit 704 acquires vehicle body information based on sensor data transmitted from the vehicle 21 .
- reproduction control unit 314 reproduction control based on the music generation parameters supplied from the music reconstruction unit 705 is performed for the data of each track.
- Data of the reproduced sound is transmitted to the user terminal 22 and output from the speaker in the car 21 .
- ⁇ Playback control Recognizes the object in the image captured by the imaging unit 72, which is the camera of the user terminal 22, and changes the music generation parameter according to the position of the recognized object on the image.
- Playback control may be performed by By dynamically changing the music generation parameters so as to follow changes in the state of the object in the image, such as the position on the image, the music itself that is output to the user can be played dynamically in real time. will change with
- a depth map or thermography obtained from a depth sensor may be used to change the music generation parameter to perform playback control. .
- FIG. 43 is a diagram showing an example of object recognition.
- cars C1 and C2 are shown in the image taken while driving, as shown in A of FIG. 43, cars C1 and C2 are recognized as shown in a frame in B of FIG.
- the automobile C1 is photographed at a lower left position from the center of the image, and the automobile C2 is photographed at a slightly right position from the center.
- the user terminal 22 is prepared with a recognition model corresponding to the type of object to be recognized.
- FIG. 44 is a diagram showing an example of music generation.
- the state of the landscape shown in the upper part of FIG. 44 is the same as the state described with reference to FIG.
- the automobile C1 is shown at a lower left position from the center of the image
- the automobile C2 is shown at a slightly right position from the center.
- music generation parameters indicating the scale of the melody are set based on the positions of the cars C1 and C2 recognized by object recognition, and music based on the set music generation parameters is played. For example, when the car C1 and the car C2 are recognized while playing music linked to the area in which they are traveling, the music generation parameters are set based on the respective positions of the car C1 and the car C2, and the set music is played. The music continues to play, adding sounds according to the generation parameters.
- the music generation parameters are set based on the changed positions of the cars C1 and C2 (change ), and the music that has been varied using the set music generation parameters is played.
- the different arrangement of notes shown in the upper and lower stages of FIG. 44 indicates that different music is being reproduced.
- the user can drive while enjoying the changes in the music.
- the user can enjoy ever-changing music because the music generation parameters are set so as to reflect the movement of the car or object that the user is driving.
- FIG. 45 is a diagram showing another example of object recognition.
- the scenery shown in FIG. 45 is the scenery photographed when the car driven by the user is approaching the intersection.
- the recognition model used for object recognition is switched to the model for person recognition.
- the person H1 and the person H2 are shown in the photographed image as shown in FIG. 45A
- the person H1 and the person H2 are recognized as shown in a frame in FIG. 45B.
- the person H1 and the person H2 are shown at positions to the right of the center of the image.
- music generation parameters are set based on the respective positions of person H1 and person H2, and music based on the set music generation parameters is played.
- music generation parameters are set based on the respective positions of person H1 and person H2, and music based on the set music generation parameters is played.
- a car is also shown in the image shown in FIG. 46, but the car is not recognized, and the music generation parameters are set based on the position of the person to be recognized.
- a car may be recognized along with a person, and music generation parameters may be set based on the positions of the person and the car.
- the type of object to be reflected in the music generation parameters is switched according to the user's movement state, such as the running state such as running/stopped, or the current position.
- the type of object to be reflected in the settings of the music generation parameters can be switched so that an object that is easy for the user to visually recognize as moving is targeted for recognition.
- An object that a user can easily visually recognize as moving while driving is an automobile, and an object that is easily visually recognized by a user at an intersection is a person.
- FIG. 47 is a diagram showing an image of music generation according to the position of an object.
- a keyboard is virtually arranged in front of a rectangle F indicating the imaging range of the user terminal 22, and the virtually arranged keyboard is pressed according to the position of an object. become an image. For example, if the object is on the left side of the image, the music generation parameters are set so that a low melody sound is added to the main music. Also, when the object appears on the right side of the image, the music generation parameters are set so as to add a high melody sound to the main music.
- the type of object to be recognized is switched, and the music changes according to the movement (change in position) of the object, so that the user can visually see which movement of the object the change in the music is linked to. It becomes possible to understand.
- a music generation parameter is associated with each type of object, and the music generation parameter associated with a recognized object may change according to the state of the object.
- a rhythm music generation parameter is associated with an automobile as a recognition target
- a melody music generation parameter is associated with a person.
- Brightness of Captured Image Reproduction control may be performed such that music is changed based on the brightness of the image displayed on the display 75 .
- This function is used, for example, when the user displays an image (moving image) of scenery or the like on the display 75 .
- the image displayed on the display 75 may be an image captured by the imaging unit 72 or may be an image captured by another imaging device and imported into the user terminal 22 . Images taken by the camera of the user terminal 22 attached to the dashboard of the car may be used for playing music.
- FIG. 48 is a diagram showing a display example of the user terminal 22 during music playback.
- FIG. 48 shows the user terminal 22 displaying an image of a beach scene.
- image analysis is performed on each frame of the moving image, and the music output to the user changes based on the detected brightness.
- Music variation is achieved by varying the music generation parameters based on luminance.
- the brightness of the image shown in FIG. 48 changes according to changes in wave conditions and the like.
- the music being played will change in conjunction with changes in the state of the waves.
- the luminance detection positions are set at a plurality of positions within the frame as indicated by crosses in FIG. In the example of FIG. 49, five detection positions in the vertical direction and seven detection positions in the horizontal direction are arranged at regular intervals. For example, the brightness of pixels at each detection position is detected, and music is reproduced based on each brightness.
- the pixel whose luminance is to be detected at one detection position may be one pixel, or may be a plurality of pixels. Note that the luminance detection positions may be fixed, or the positions and the number of detection positions may be changed by the user. Moreover, it may change dynamically according to the type of content.
- the effect image P in FIG. 48 displayed superimposed on the landscape image indicates the respective detection positions.
- a pale-colored circular image with a predetermined degree of transparency is displayed as the effect image P.
- Five effect images P are arranged in the vertical direction and seven in the horizontal direction, and displayed at equal intervals.
- the size of the effect image P changes according to the brightness of each detection position, as shown in FIG. Since the brightness of each detection position changes according to the change of the wave state, the size of the effect image P displayed at each detection position changes in conjunction with the change of the wave state.
- the color of the effect image P may change according to the luminance, or the shape of the effect image P may change according to the luminance. In this way, it is possible to change the state of various components of the effect image according to the luminance.
- the luminance detection position may be set at only one position instead of being set at a plurality of positions.
- mapping may be performed to set the detection position to an arbitrary position based on the analysis results of the image.
- the number and locations of detection positions change according to the content of the image.
- FIG. 51 is a diagram showing an example of music generation.
- FIG. 51 shows an example of music generation based on luminance at each detection position of detection positions P1 to P3, which are some of the detection positions.
- Detection positions P1 to P3 are detection positions set at the upper left, center, and lower right of the image, respectively.
- a scale sample A which is music of a predetermined length, is commonly set for each detection position.
- a music generation parameter specifying which note of the scale to start playing changes with brightness.
- it is designated to reproduce from the scale indicated by the ellipse #1 according to the brightness of the detection position P1, and to reproduce from the scale indicated by the ellipse #2 according to the brightness of the detection position P2. specified. Further, it is designated to reproduce from the scale indicated by the ellipse #3 according to the brightness of the detection position P3.
- the sound of scale sample A reproduced from the scale specified by each music generation parameter is synthesized and output.
- the image of music generation as described above is an image in which a musical instrument is virtually arranged at each detection position on the image, and sound is output from the plurality of virtually arranged musical instruments.
- the work on the creator terminal 11 may be performed using a three-dimensional simulation tool.
- FIG. 52 is a diagram showing a display example of a map used for area setting.
- a three-dimensional map such as that shown in FIG. 52 may be displayed on the setting screen such as that shown in FIG. 10 and used to set the area.
- a three-dimensional map is displayed, for example, by a three-dimensional simulation tool.
- the producer can set points that trigger remixing of music by specifying positions P101 and P102, which are positions on the three-dimensional map.
- This technology can also be applied to music playback on moving bodies other than automobiles, such as trains and bicycles. Further, the playback control as described above may be performed while the user is walking.
- An old sound source such as a tape sound source may be used as a target for sound source separation. There is a possibility that the sound quality will improve, such as by eliminating noise during recording.
- the good/bad compatibility when used in combination during remixing may be set for each track.
- tracks with easy-to-understand parts, and tracks obtained from songs with a standard band composition are compatible with each other.
- tracks acquired from songs with uneven parts and tracks acquired from songs performed using folk instruments are incompatible with each other.
- the quality of sound source separation may be evaluated by quantitative evaluation or qualitative evaluation. Quality evaluation may be performed based on user reviews, such as by statistically analyzing comments.
- the track data may be uploaded to the information processing server 12 together with the existing music data as correct data for sound source separation.
- the production workflow is provided with a mechanism for collecting learning data used for learning an inference model for sound source separation.
- the track data of each sound source obtained by sound source separation that has already been performed may be stored in the information processing server 12 .
- the user terminal 22 requests the information processing server 12 for track data after sound source separation, and the track data transmitted from the information processing server 12 in response to the request is used for reproduction control.
- music data may be set for each area set in the Metaverse space, and music data playback control may be performed when the area is switched. Further, when there is an acquaintance of the user in the area, reproduction control may be performed such that the notification sound is additionally reproduced as an additional track to the main track.
- the present technology can also be configured as follows. (1) a content information setting unit that sets content information including content parameter information that defines content playback control; a trigger information setting unit that sets trigger information defining conditions for executing playback control of the content; A content generation parameter setting unit that sets content generation parameters by linking the content information and the trigger information. (2) The information processing apparatus according to (1), further comprising a display unit that displays a screen for setting the content generation parameter. (3) the content information further includes metadata of the content; The metadata is information including at least one of key, beat, BPM, chord, playback time, tonality, instrument type, and position data of the content if the content is object audio. The information processing apparatus according to (1) or (2) above.
- the content information further includes information for associating first content data constituting the content with information relating to at least one second content data added to the first content data.
- the information processing apparatus according to any one of (3) to (3).
- the first content data and the second content data are loop materials having a predetermined number of bars and playback time, and the content generation parameters define details of playback control Information according to (4) above processing equipment.
- the first content data is composed of data selected from data prepared in advance or using existing music
- the trigger information is information including at least one of POI information, landmark information, area information, information related to a moving object boarded by the user, and information of the user himself/herself.
- the information processing device according to any one of the above.
- the POI information is set as information indicating the start and end points of the POI
- the landmark information is set as information indicating the start point and end point of the landmark
- the area information is set as information indicating the start and end of the area
- the information processing apparatus according to (7), wherein the information about the moving body is set as information indicating start/stop, stop time, right/left turn start/end, and speed of the moving body.
- the information processing device according to (7) or (8), wherein at least one of the POI information, the landmark information, and the area information has a unique parameter.
- the information processing apparatus according to any one of (1) to (9), wherein the content is track data of a specific sound source obtained by sound source separation targeting existing music.
- the information processing device setting content information including content parameter information that defines content playback control details; setting trigger information defining conditions for executing playback control of the content; An information processing method for setting a content generation parameter by linking the content information and the trigger information.
- the event acquisition unit acquires at least one of a position of the user, information indicating a state of a mobile object boarded by the user, and information of the user himself/herself.
- the trigger information is at least one of POI information, landmark information, area information, and information about a mobile object on which a user boards.
- the separation information includes at least one of information indicating whether or not sound source separation is possible for a song, information indicating the type of sound source, and information indicating whether or not sound source separation has been performed.
- the information processing device Get events when the user moves, Acquiring a content generation parameter set by linking content information including content parameter information defining details of content reproduction control and trigger information defining conditions for executing the content reproduction control, An information processing method, wherein, when the event that satisfies the condition defined by the trigger information is acquired, the reproduction of the content is controlled based on the content information associated with the trigger information.
- the computer Get events when the user moves, Acquiring a content generation parameter set by linking content information including content parameter information defining details of content reproduction control and trigger information defining conditions for executing the content reproduction control, A program for executing a process of controlling reproduction of the content based on the content information associated with the trigger information when the event that satisfies the condition defined by the trigger information is acquired.
- a content information setting unit that sets content information including content parameter information that defines content playback control; a trigger information setting unit that sets trigger information defining conditions for executing playback control of the content; a first information processing device comprising: a content generation parameter setting unit that sets content generation parameters by linking the content information and the trigger information; an event acquisition unit that acquires an event when the user moves; an acquisition unit that acquires the content generation parameter; a content reproduction control unit that controls reproduction of the content based on the content information associated with the trigger information when the event that satisfies the condition defined by the trigger information is acquired; an information processing device;
- An information processing system comprising
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- General Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Computer Networks & Wireless Communication (AREA)
- Data Mining & Analysis (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Acoustics & Sound (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
- Stored Programmes (AREA)
- Telephone Function (AREA)
- Information Transfer Between Computers (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
Description
1.本技術の概略
2.システム構成の例
3.制作者端末における処理
4.ユーザ側での音楽データ再生制御の例
5.変形例
はじめに、本技術の概略について説明する。
1.東京タワーに近づくにつれてボーカルにリバーブ処理を施す
2.特定の交差点で右左折したら再生中のコンテンツに対してシンセサイザの音を付加する
3.停止時には再生中のコンテンツのボリュームを下げ、発進後のボリュームを徐々に上げる
図1は、ユーザ側での音楽データの再生制御の例を示す図である。
図4は、音楽データの構成要素の例を示す図である。
図5は、本技術の一実施形態に係る情報処理システム1の構成例を示す図である。
図6は、制作者端末11のハードウェア構成例を示すブロック図である。
図7は、音楽データ再生システム13を構成するユーザ端末22のハードウェア構成例を示すブロック図である。
図8は、情報処理システム1の機能構成例を示すブロック図である。
・設定画面の例
続いて、図10~図22を用いて、制作者が画面上で行う一連の操作について説明する。制作者が設定画面上で行う一連の操作とは、前述したように、音楽データ再生制御が実行される際のトリガとなる条件を定義したトリガ情報と、再生制御に関する各種の内容を定義したアルゴリズムとを紐付けることで、音楽生成パラメータを、例えばGUIなどを用いて設定する操作のことである。設定画面は、制作者端末11の出力部57を構成するディスプレイに表示されてもよいし、制作者端末11に接続された外部のディスプレイに表示されてもよい。
1.ユーザが運転する車がランドマークL1の周辺を走行する際に付加再生される追加トラックの種別(CONTENTS TYPEでGuidanceを選択)
2.ユーザが運転する車21とランドマークL1との距離に基づく処理(PLAYBACK SETTINGでDistanceを選択)
3.エフェクト処理として音量の変更を設定(EFFECT SETTINGでVolume Changeを選択)
図22は、制作者が設定画面を用いて行う一連の操作に関するフローチャートである。図22に示す各ステップの処理が、制作者の操作に応じて、音楽生成パラメータ設定部101の各部により行われる(図8)。
制作者側の構成である制作者端末11で事前に設定された音楽生成パラメータに基づいて、ユーザ側の構成である音楽データ再生システム13において、どのような音楽データ再生制御が行われるかについて、図23~図31を用いて説明する。
図24は、走行エリアが切り替わった場合の再生制御の例を示す図である。
図25のAは、カーブにおける再生制御の例を示す図である。
図25のBは、交差点などにおける停止・発進に基づく再生制御の例を示す図である。
図26は、ランドマークの通過に基づく再生制御の例を示す図である。
図27は、POI情報で示される特定のポイントを走行する場合の再生制御の例を示す図である。
図28は、POI情報で示される特定のポイントを走行する場合の再生制御の他の例を示す図である。
図29は、走行中のエリアの交通状況に基づく再生制御の例を示す図である。
図30は、走行エリアの速度情報などの情報に基づく再生制御の例を示す図である。
図31は、ユーザ側の構成である音楽データ再生システム13において、音楽データの再生制御が行われる際のフローチャートである。
図32は、再生制御の対象となるトラックの例を示す図である。
図35は、以上のような音源分離を行う構成を含む音楽制御部301の構成例を示すブロック図である。図35に示す構成のうち、図8等を参照して説明した構成と同じ構成には同じ符号を付してある。重複する説明については適宜省略する。
図37は、既存楽曲のリミックスの具体例を示す図である。
図38は、既存楽曲の繋ぎ合わせ処理の例を示す図である。
図41は、音源分離を用いた場合の制作ワークフローの例を示す図である。
・認識物体の状態に応じた再生制御
ユーザ端末22のカメラである撮影部72によって撮影された画像に写る物体を認識し、認識した物体の画像上の位置に応じて音楽生成パラメータを変化させるようにして再生制御が行われるようにしてもよい。画像上の位置などの、画像に写る物体の状態が変化することに追従するようにして音楽生成パラメータが動的に変化することにより、ユーザに向けて出力されている音楽自体が動的にリアルタイムで変化することになる。なお、カメラである撮影部72によって撮影された画像の他に、例えば、デプスセンサから取得されるデプスマップやサーモグラフィを用いて音楽生成パラメータを変化させるようにして再生制御が行われるようにしてもよい。
ディスプレイ75に表示されている画像の輝度に基づいて音楽を変化させるような再生制御が行われるようにしてもよい。この機能は、例えば、ユーザが、風景などが写る画像(動画像)をディスプレイ75に表示させている状態で用いられる。
制作者端末11上での作業が三次元的なシミュレーションツールを用いて行われるようにしてもよい。
以上、本技術の第1の実施形態について説明した。なお、本技術はユーザが車などの移動体に搭乗している場合だけでなく、ユーザが徒歩で移動している場合にも適用可能である。
なお、本技術は、以下のような構成をとることもできる。
(1)
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定するコンテンツ情報設定部と、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定するトリガ情報設定部と、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定するコンテンツ生成パラメータ設定部と
を備える情報処理装置。
(2)
前記コンテンツ生成パラメータを設定するための画面を表示する表示部を備える
前記(1)に記載の情報処理装置。
(3)
前記コンテンツ情報は、さらに、前記コンテンツのメタデータを含み、
前記メタデータは、前記コンテンツのキー、拍、BPM、コード、再生時間、トーナリティ、楽器種別、前記コンテンツがオブジェクトオーディオの場合は前記コンテンツの位置データに関する情報、のうちの少なくともいずれかを含む情報である
前記(1)または(2)に記載の情報処理装置。
(4)
前記コンテンツ情報は、さらに、前記コンテンツを構成する第1のコンテンツデータと、前記第1のコンテンツデータに付加される少なくとも1つの第2のコンテンツデータに関する情報を関連付けるための情報を含む
前記(1)乃至(3)のいずれかに記載の情報処理装置。
(5)
前記第1のコンテンツデータと前記第2のコンテンツデータは、所定の小節数と再生時間を有するループ素材であり、前記コンテンツ生成パラメータによって再生制御の内容が定義される
前記(4)に記載の情報処理装置。
(6)
前記第1のコンテンツデータは、あらかじめ用意されたデータの中から選択されたデータによって、または、既存の楽曲を用いて構成され、
前記第2のコンテンツデータは、あらかじめ用意されたデータの中から選択されたデータによって構成される
前記(4)または(5)に記載の情報処理装置。
(7)
前記トリガ情報は、POI情報、ランドマーク情報、エリア情報、ユーザが搭乗する移動体に関する情報、前記ユーザ自身の情報のうちの、少なくともいずれかを含む情報である
前記(1)乃至(6)のいずれかに記載の情報処理装置。
(8)
前記POI情報は、POIの始点・終点を示す情報として設定され、
前記ランドマーク情報は、ランドマークの始点・終点を示す情報として設定され、
前記エリア情報は、エリアの開始・終了を示す情報として設定され、
前記移動体に関する情報は、前記移動体の発進・停止、停止時間、右左折の開始・終了、速度を示す情報として設定される
前記(7)に記載の情報処理装置。
(9)
前記POI情報、前記ランドマーク情報、前記エリア情報のうちの少なくとも1つが固有のパラメータを有する
前記(7)または(8)に記載の情報処理装置。
(10)
前記コンテンツは、既存の楽曲を対象とした音源分離によって取得された特定の音源のトラックデータである
前記(1)乃至(9)のいずれかに記載の情報処理装置。
(11)
情報処理装置が、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定し、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定し、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定する
情報処理方法。
(12)
コンピュータに、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定し、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定し、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定する
処理を実行させるためのプログラム。
(13)
ユーザの移動時におけるイベントを取得するイベント取得部と、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報と、前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報とが紐付けられて設定されたコンテンツ生成パラメータを取得する取得部と、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行うコンテンツ再生制御部と
を備える情報処理装置。
(14)
前記イベント取得部は、前記ユーザの位置、前記ユーザが搭乗する移動体の状態を示す情報、前記ユーザ自身の情報のうちの、少なくともいずれかの情報を取得する
前記(13)に記載の情報処理装置。
(15)
前記トリガ情報は、POI情報、ランドマーク情報、エリア情報、ユーザが搭乗する移動体に関する情報のうちの、少なくともいずれかである
前記(13)または(14)に記載の情報処理装置。
(16)
前記コンテンツ生成パラメータの制御を行うかどうかを前記イベントと前記トリガ情報とに基づいて判定するコンテンツ生成パラメータ制御部をさらに備える
前記(13)乃至(15)のいずれかに記載の情報処理装置。
(17)
前記コンテンツ生成パラメータの制御を実行すると判定された場合、前記コンテンツ再生制御部は、前記イベントに応じて制御された前記コンテンツ生成パラメータに基づく再生制御を行う
前記(16)に記載の情報処理装置。
(18)
前記コンテンツ生成パラメータ制御部は、交通状況または制限速度に基づいて、前記コンテンツ生成パラメータの制御を実行する
前記(16)または(17)に記載の情報処理装置。
(19)
既存の楽曲を対象とした音源分離を行い、各音源のトラックデータを取得する音源分離部をさらに備え、
前記コンテンツ再生制御部は、音源分離によって取得された前記トラックデータを前記コンテンツとして再生制御を行う
前記(13)乃至(17)のいずれかに記載の情報処理装置。
(20)
音源分離の対象とする前記既存の楽曲を、音源分離が可能な楽曲であるか否かを示す情報を含む分離情報に基づいて取得する楽曲取得部をさらに備える
前記(19)に記載の情報処理装置。
(21)
前記分離情報には、音源分離が可能な楽曲であるか否かを示す情報とともに、音源の種類を示す情報と、音源分離が行われたことがあるか否かを示す情報のうちの少なくともいずれかが含まれる
前記(19)または(20)に記載の情報処理装置。
(22)
情報処理装置が、
ユーザの移動時におけるイベントを取得し、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報と、前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報とが紐付けられて設定されたコンテンツ生成パラメータを取得し、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行う
情報処理方法。
(23)
コンピュータに、
ユーザの移動時におけるイベントを取得し、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報と、前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報とが紐付けられて設定されたコンテンツ生成パラメータを取得し、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行う
処理を実行させるためのプログラム。
(24)
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定するコンテンツ情報設定部と、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定するトリガ情報設定部と、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定するコンテンツ生成パラメータ設定部と
を備える第1の情報処理装置と、
ユーザの移動時におけるイベントを取得するイベント取得部と、
前記コンテンツ生成パラメータを取得する取得部と、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行うコンテンツ再生制御部と
を備える第2の情報処理装置と、
を備える情報処理システム。
Claims (24)
- コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定するコンテンツ情報設定部と、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定するトリガ情報設定部と、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定するコンテンツ生成パラメータ設定部と
を備える情報処理装置。 - 前記コンテンツ生成パラメータを設定するための画面を表示する表示部を備える
請求項1に記載の情報処理装置。 - 前記コンテンツ情報は、さらに、前記コンテンツのメタデータを含み、
前記メタデータは、前記コンテンツのキー、拍、BPM、コード、再生時間、トーナリティ、楽器種別、前記コンテンツがオブジェクトオーディオの場合は前記コンテンツの位置データに関する情報、のうちの少なくともいずれかを含む情報である
請求項1に記載の情報処理装置。 - 前記コンテンツ情報は、さらに、前記コンテンツを構成する第1のコンテンツデータと、前記第1のコンテンツデータに付加される少なくとも1つの第2のコンテンツデータに関する情報を関連付けるための情報を含む
請求項1に記載の情報処理装置。 - 前記第1のコンテンツデータと前記第2のコンテンツデータは、所定の小節数と再生時間を有するループ素材であり、前記コンテンツ生成パラメータによって再生制御の内容が定義される
請求項4に記載の情報処理装置。 - 前記第1のコンテンツデータは、あらかじめ用意されたデータの中から選択されたデータによって、または、既存の楽曲を用いて構成され、
前記第2のコンテンツデータは、あらかじめ用意されたデータの中から選択されたデータによって構成される
請求項4に記載の情報処理装置。 - 前記トリガ情報は、POI情報、ランドマーク情報、エリア情報、ユーザが搭乗する移動体に関する情報、前記ユーザ自身の情報のうちの、少なくともいずれかを含む情報である
請求項1に記載の情報処理装置。 - 前記POI情報は、POIの始点・終点を示す情報として設定され、
前記ランドマーク情報は、ランドマークの始点・終点を示す情報として設定され、
前記エリア情報は、エリアの開始・終了を示す情報として設定され、
前記移動体に関する情報は、前記移動体の発進・停止、停止時間、右左折の開始・終了、速度を示す情報として設定される
請求項7に記載の情報処理装置。 - 前記POI情報、前記ランドマーク情報、前記エリア情報のうちの少なくとも1つが固有のパラメータを有する
請求項7に記載の情報処理装置。 - 前記コンテンツは、既存の楽曲を対象とした音源分離によって取得された特定の音源のトラックデータである
請求項1に記載の情報処理装置。 - 情報処理装置が、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定し、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定し、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定する
情報処理方法。 - コンピュータに、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定し、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定し、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定する
処理を実行させるためのプログラム。 - ユーザの移動時におけるイベントを取得するイベント取得部と、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報と、前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報とが紐付けられて設定されたコンテンツ生成パラメータを取得する取得部と、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行うコンテンツ再生制御部と
を備える情報処理装置。 - 前記イベント取得部は、前記ユーザの位置、前記ユーザが搭乗する移動体の状態を示す情報、前記ユーザ自身の情報のうちの、少なくともいずれかの情報を取得する
請求項13に記載の情報処理装置。 - 前記トリガ情報は、POI情報、ランドマーク情報、エリア情報、ユーザが搭乗する移動体に関する情報のうちの、少なくともいずれかである
請求項13に記載の情報処理装置。 - 前記コンテンツ生成パラメータの制御を行うかどうかを前記イベントと前記トリガ情報とに基づいて判定するコンテンツ生成パラメータ制御部をさらに備える
請求項13に記載の情報処理装置。 - 前記コンテンツ生成パラメータの制御を実行すると判定された場合、前記コンテンツ再生制御部は、前記イベントに応じて制御された前記コンテンツ生成パラメータに基づく再生制御を行う
請求項16に記載の情報処理装置。 - 前記コンテンツ生成パラメータ制御部は、交通状況または制限速度に基づいて、前記コンテンツ生成パラメータの制御を実行する
請求項16に記載の情報処理装置。 - 既存の楽曲を対象とした音源分離を行い、各音源のトラックデータを取得する音源分離部をさらに備え、
前記コンテンツ再生制御部は、音源分離によって取得された前記トラックデータを前記コンテンツとして再生制御を行う
請求項13に記載の情報処理装置。 - 音源分離の対象とする前記既存の楽曲を、音源分離が可能な楽曲であるか否かを示す情報を含む分離情報に基づいて取得する楽曲取得部をさらに備える
請求項19に記載の情報処理装置。 - 前記分離情報には、音源分離が可能な楽曲であるか否かを示す情報とともに、音源の種類を示す情報と、音源分離が行われたことがあるか否かを示す情報のうちの少なくともいずれかが含まれる
請求項20に記載の情報処理装置。 - 情報処理装置が、
ユーザの移動時におけるイベントを取得し、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報と、前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報とが紐付けられて設定されたコンテンツ生成パラメータを取得し、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行う
情報処理方法。 - コンピュータに、
ユーザの移動時におけるイベントを取得し、
コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報と、前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報とが紐付けられて設定されたコンテンツ生成パラメータを取得し、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行う
処理を実行させるためのプログラム。 - コンテンツの再生制御の内容を定義したコンテンツパラメータ情報を含むコンテンツ情報を設定するコンテンツ情報設定部と、
前記コンテンツの再生制御を実行する際の条件を定義したトリガ情報を設定するトリガ情報設定部と、
前記コンテンツ情報と前記トリガ情報を紐付けることでコンテンツ生成パラメータを設定するコンテンツ生成パラメータ設定部と
を備える第1の情報処理装置と、
ユーザの移動時におけるイベントを取得するイベント取得部と、
前記コンテンツ生成パラメータを取得する取得部と、
前記トリガ情報によって定義された条件となる前記イベントが取得された場合、前記トリガ情報と紐付けられた前記コンテンツ情報に基づいて、前記コンテンツの再生制御を行うコンテンツ再生制御部と
を備える第2の情報処理装置と、
を備える情報処理システム。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2023570773A JPWO2023127422A1 (ja) | 2021-12-28 | 2022-12-06 | |
| US18/724,643 US20250068380A1 (en) | 2021-12-28 | 2022-12-06 | Information processing apparatus, information processing method, program, and information processing system |
| EP22915653.4A EP4460040A4 (en) | 2021-12-28 | 2022-12-06 | Information processing device, information processing method, program, and information processing system |
| CN202280086378.0A CN118451728A (zh) | 2021-12-28 | 2022-12-06 | 信息处理装置、信息处理方法、程序和信息处理系统 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021-214483 | 2021-12-28 | ||
| JP2021214483 | 2021-12-28 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023127422A1 true WO2023127422A1 (ja) | 2023-07-06 |
Family
ID=86998633
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2022/044839 Ceased WO2023127422A1 (ja) | 2021-12-28 | 2022-12-06 | 情報処理装置、情報処理方法、プログラム、情報処理システム |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US20250068380A1 (ja) |
| EP (1) | EP4460040A4 (ja) |
| JP (1) | JPWO2023127422A1 (ja) |
| CN (1) | CN118451728A (ja) |
| WO (1) | WO2023127422A1 (ja) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025234158A1 (ja) * | 2024-05-10 | 2025-11-13 | ソニーグループ株式会社 | 情報処理システム、情報処理方法、およびプログラム |
| WO2026069674A1 (ja) * | 2024-09-30 | 2026-04-02 | 株式会社ソニー・インタラクティブエンタテインメント | 音ファイル生成装置および音ファイル生成方法 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12551792B2 (en) * | 2023-03-16 | 2026-02-17 | Honda Motor Co., Ltd. | Systems and methods for gamification in a meta verse |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003028663A (ja) * | 2001-07-17 | 2003-01-29 | Yamaha Corp | ナビゲーション装置及び制御プログラム |
| JP2005031038A (ja) * | 2003-07-11 | 2005-02-03 | Sony Corp | 情報出力装置 |
| JP2006069288A (ja) | 2004-08-31 | 2006-03-16 | Fuji Heavy Ind Ltd | 車載音楽生成装置及び車載エンタテイメントシステム |
| JP2006254285A (ja) * | 2005-03-14 | 2006-09-21 | Sony Corp | 車載装置及び車載装置の制御方法 |
| JP2006273015A (ja) * | 2005-03-28 | 2006-10-12 | Yamaha Motor Co Ltd | 車両の音声発生装置 |
| WO2007032388A1 (ja) * | 2005-09-14 | 2007-03-22 | Pioneer Corporation | 音量制御装置、音量制御方法、音量制御プログラムおよびコンピュータに読み取り可能な記録媒体 |
| JP2008145789A (ja) * | 2006-12-11 | 2008-06-26 | Yamaha Corp | 車両用楽音再生装置、車両用楽音再生方法、及びプログラム |
| JP2016188959A (ja) * | 2015-03-30 | 2016-11-04 | パイオニア株式会社 | 出力装置、出力方法及び出力装置用プログラム |
| JP2020057955A (ja) * | 2018-10-03 | 2020-04-09 | パイオニア株式会社 | 振動制御装置、振動制御方法、振動制御プログラム、及び記憶媒体 |
| JP2020156049A (ja) * | 2019-03-22 | 2020-09-24 | 株式会社Jvcケンウッド | 音量制御装置、音量制御システム、音量制御方法、およびプログラム |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002344268A (ja) * | 2001-05-14 | 2002-11-29 | Matsushita Electric Ind Co Ltd | 音声再生装置 |
| KR100921584B1 (ko) * | 2006-12-06 | 2009-10-14 | 야마하 가부시키가이샤 | 탑재식 음악 재생 장치 및 음악 정보 분배 시스템 |
| US20110054917A1 (en) * | 2009-08-28 | 2011-03-03 | Electronics And Telecommunications Research Institute | Apparatus and method for structuring bitstream for object-based audio service, and apparatus for encoding the bitstream |
| US10068620B1 (en) * | 2017-06-20 | 2018-09-04 | Lp-Research Inc. | Affective sound augmentation for automotive applications |
| US11929051B2 (en) * | 2020-10-01 | 2024-03-12 | General Motors Llc | Environment awareness system for experiencing an environment through music |
| KR20220154499A (ko) * | 2021-05-13 | 2022-11-22 | 현대자동차주식회사 | 모빌리티의 사운드 생성 장치, 그를 포함한 시스템 및 그 방법 |
| US12444394B2 (en) * | 2021-12-15 | 2025-10-14 | Distributed Creation Inc. | Scalable similarity-based generation of compatible music mixes |
-
2022
- 2022-12-06 JP JP2023570773A patent/JPWO2023127422A1/ja active Pending
- 2022-12-06 CN CN202280086378.0A patent/CN118451728A/zh active Pending
- 2022-12-06 US US18/724,643 patent/US20250068380A1/en active Pending
- 2022-12-06 WO PCT/JP2022/044839 patent/WO2023127422A1/ja not_active Ceased
- 2022-12-06 EP EP22915653.4A patent/EP4460040A4/en active Pending
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003028663A (ja) * | 2001-07-17 | 2003-01-29 | Yamaha Corp | ナビゲーション装置及び制御プログラム |
| JP2005031038A (ja) * | 2003-07-11 | 2005-02-03 | Sony Corp | 情報出力装置 |
| JP2006069288A (ja) | 2004-08-31 | 2006-03-16 | Fuji Heavy Ind Ltd | 車載音楽生成装置及び車載エンタテイメントシステム |
| JP2006254285A (ja) * | 2005-03-14 | 2006-09-21 | Sony Corp | 車載装置及び車載装置の制御方法 |
| JP2006273015A (ja) * | 2005-03-28 | 2006-10-12 | Yamaha Motor Co Ltd | 車両の音声発生装置 |
| WO2007032388A1 (ja) * | 2005-09-14 | 2007-03-22 | Pioneer Corporation | 音量制御装置、音量制御方法、音量制御プログラムおよびコンピュータに読み取り可能な記録媒体 |
| JP2008145789A (ja) * | 2006-12-11 | 2008-06-26 | Yamaha Corp | 車両用楽音再生装置、車両用楽音再生方法、及びプログラム |
| JP2016188959A (ja) * | 2015-03-30 | 2016-11-04 | パイオニア株式会社 | 出力装置、出力方法及び出力装置用プログラム |
| JP2020057955A (ja) * | 2018-10-03 | 2020-04-09 | パイオニア株式会社 | 振動制御装置、振動制御方法、振動制御プログラム、及び記憶媒体 |
| JP2020156049A (ja) * | 2019-03-22 | 2020-09-24 | 株式会社Jvcケンウッド | 音量制御装置、音量制御システム、音量制御方法、およびプログラム |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4460040A4 |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025234158A1 (ja) * | 2024-05-10 | 2025-11-13 | ソニーグループ株式会社 | 情報処理システム、情報処理方法、およびプログラム |
| WO2026069674A1 (ja) * | 2024-09-30 | 2026-04-02 | 株式会社ソニー・インタラクティブエンタテインメント | 音ファイル生成装置および音ファイル生成方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4460040A4 (en) | 2025-04-02 |
| US20250068380A1 (en) | 2025-02-27 |
| CN118451728A (zh) | 2024-08-06 |
| EP4460040A1 (en) | 2024-11-06 |
| JPWO2023127422A1 (ja) | 2023-07-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2023127422A1 (ja) | 情報処理装置、情報処理方法、プログラム、情報処理システム | |
| US12283271B2 (en) | Voice feedback for user interface of media playback device | |
| US11874124B2 (en) | Duration-based customized media program | |
| US11929051B2 (en) | Environment awareness system for experiencing an environment through music | |
| Kari et al. | Soundsride: Affordance-synchronized music mixing for in-car audio augmented reality | |
| US11188293B2 (en) | Playback sound provision device | |
| US20230410774A1 (en) | Dynamic sounds from automotive inputs | |
| JP2016167771A (ja) | ダイジェスト映像生成装置 | |
| JP2008203338A (ja) | 楽音発生装置及び楽音発生方法 | |
| CN101689207B (zh) | 提供对内容项集合的访问 | |
| CN118506752A (zh) | 用于汽车的aigc音乐生成方法、计算机装置和存储介质 | |
| JP5865708B2 (ja) | 施設、自然、歴史、ルートに関する画像及び音声の再生及びデータの制作方法 | |
| JP2009043353A (ja) | タイトル付与装置、タイトル付与方法、タイトル付与プログラム、および記録媒体 | |
| JP7782452B2 (ja) | 情報処理システム及びコンテンツ再生装置 | |
| US20260120667A1 (en) | Dynamic chunking for sounds from automotive inputs | |
| JP2023077685A (ja) | カラオケシステム、サーバ装置 | |
| WO2026070345A1 (ja) | 信号処理装置および方法 | |
| WO2025234158A1 (ja) | 情報処理システム、情報処理方法、およびプログラム | |
| JP4642607B2 (ja) | 移動再生装置による音声・画像の再生方法 | |
| Yagerline | Orange Line: Ambient Music and the Washington, DC Metro | |
| WO2024122322A1 (ja) | 情報処理装置、情報処理方法、およびプログラム | |
| WO2025147661A1 (en) | Dynamic chunking for sounds from automotive inputs | |
| DE102021116641A1 (de) | Verfahren, System und Computerprogramm zur interaktiven Auswahl und Wiedergabe von in Echtzeit erzeugten Audio- und/oder Videosequenzen in einem Kraftfahrzeug | |
| JP2020118894A (ja) | 再生制御装置、再生装置、再生制御方法、およびプログラム | |
| JP2005243182A (ja) | 選曲装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22915653 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2023570773 Country of ref document: JP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 18724643 Country of ref document: US Ref document number: 202280086378.0 Country of ref document: CN |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2022915653 Country of ref document: EP Effective date: 20240729 |
|
| WWP | Wipo information: published in national office |
Ref document number: 18724643 Country of ref document: US |