WO2020029235A1 - Providing video recommendation - Google Patents

Providing video recommendation Download PDF

Info

Publication number
WO2020029235A1
WO2020029235A1 PCT/CN2018/099914 CN2018099914W WO2020029235A1 WO 2020029235 A1 WO2020029235 A1 WO 2020029235A1 CN 2018099914 W CN2018099914 W CN 2018099914W WO 2020029235 A1 WO2020029235 A1 WO 2020029235A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
candidate
user
recommended
score
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2018/099914
Other languages
French (fr)
Inventor
Bo Han
Qiao LUAN
Yang Wang
Albert THAMBIRATNAM
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Priority to US17/256,951 priority Critical patent/US20210144418A1/en
Priority to PCT/CN2018/099914 priority patent/WO2020029235A1/en
Priority to CN201880069804.3A priority patent/CN111279709B/en
Priority to EP18929802.9A priority patent/EP3834424A4/en
Publication of WO2020029235A1 publication Critical patent/WO2020029235A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for programme selection
    • H04N21/4826End-user interface for programme selection using recommendation lists, e.g. of programmes or channels sorted out according to their score
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/251Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25808Management of client data
    • H04N21/25825Management of client data involving client display capabilities, e.g. screen resolution of a mobile phone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25808Management of client data
    • H04N21/25833Management of client data involving client hardware characteristics, e.g. manufacturer, processing or storage capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25808Management of client data
    • H04N21/25841Management of client data involving the geographical location of the client
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25866Management of end-user data
    • H04N21/25891Management of end-user data being end-user preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4518Management of client data or end-user data involving characteristics of one or more peripherals, e.g. peripheral type, software version, amount of memory available or display capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4668Learning process for intelligent management, e.g. learning user preferences for recommending movies for recommending content, e.g. movies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4852End-user interface for client configuration for modifying audio parameters, e.g. switching between mono and stereo

Definitions

  • Embodiments of the present disclosure propose method and apparatus for providing video recommendation.
  • At least one reference factor for the video recommendation may be determined, wherein the at least one reference factor indicates preferred importance of visual information and/or audio information in at least one video to be recommended.
  • a ranking score of each candidate video in a candidate video set may be determined based at least on the at least one reference factor.
  • At least one recommended video may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set.
  • the at least one recommended video may be provided to a user through a terminal device.
  • FIG. 1 illustrates exemplary implementation scenarios of providing video recommendation according to an embodiment.
  • FIG. 2 illustrates an exemplary process for determining content scores of candidate videos according to an embodiment.
  • FIG. 3 illustrates an exemplary process for determining recommended videos according to an embodiment.
  • FIG. 4 illustrates an exemplary process for determining recommended videos according to an embodiment.
  • FIG. 5 illustrates an exemplary process for determining recommended videos according to an embodiment.
  • FIG. 6 illustrates an exemplary process for determining recommended videos according to an embodiment.
  • FIG. 7 illustrates an exemplary process for determining recommended videos according to an embodiment.
  • FIG. 8 illustrates a flowchart of an exemplary method for providing video recommendation according to an embodiment.
  • FIG. 9 illustrates an exemplary apparatus for providing video recommendation according to an embodiment.
  • FIG. 10 illustrates an exemplary apparatus for providing video recommendation according to an embodiment.
  • Applications or websites being capable of accessing various video resources on the network may provide video recommendation to users.
  • the applications or websites may be news clients or websites, social networking applications or websites, video platforms clients or websites, search engine clients or websites, etc., such as, CNN News, Toutiao, Facebook, Youtube, Youku, Bing, Baidu, etc.
  • the applications or websites may select a plurality of videos from the video resources on the network as recommended videos and provide the recommended videos to users for consumption.
  • those existing approaches for determining recommended videos from the video resources on the network may consider some factors, e.g., freshness of the video, popularity of the video, click rate of the video, video quality, relevance between content of the video and a user’s interests, etc.
  • this video is more likely to be selected as a recommended video. For example, if the content of the video belongs to a category of football and the user always shows interest in football-related videos, i.e., there is a high relevance between the content of the video and the user’s interests, this video may be recommended to the user with a high probability.
  • a video may comprise visual information and audio information, wherein the visual information indicates a series of pictures being visually presented in the video, and the audio information indicates voice, sound, music, etc. being presented in an audio form in the video.
  • the user may be preparing dinner in a kitchen, and thus the user can keep listening but cannot keep watching a screen of the terminal device. For example, if it is eight o’clock in the morning and the user is on the subway now, the user may prefer to consume visual information of a recommended video but doesn’t want any sounds to be displayed to disturb others.
  • the terminal device is a smart phone and the smart phone is operating in a mute mode, and thus the user can not consume audio information in the recommended video.
  • the terminal device is a smart speaker with a small screen or with no screen, and the user is driving a car now, and thus it may be not suitable for the user to consume visual information in the recommended video.
  • Embodiments of the present disclosure propose to improve video recommendation through considering importance of visual information and/or audio information in recommended videos during determining the recommended videos.
  • importance of visual information and/or audio information in a video may indicate, e.g., whether content of the video is conveyed mainly by the visual information and/or the audio information, whether the visual information or the audio information is the most critical information in the video, whether the visual information and/or the audio information is indispensable or necessary for consuming the video, etc.
  • Importance of visual information and importance of audio information may vary for different videos. For example, for a speech video, importance of audio information is higher than importance of visual information because the video presents content of the speech mainly in an audio form.
  • audio information may be less important than visual information because the video may present the activities of the dog mainly in a visual form.
  • visual information and audio information may be important because the video may present dance movements in a visual form and meanwhile present music in an audio form. It can be seen that, when a user is consuming a video, either visual information or audio information that has a higher importance may be sufficient for the user to acknowledge or understand content of the video.
  • the embodiments of the present disclosure may decide whether to recommend those videos having a higher importance of visual information, or to recommend those videos having a higher importance of audio information, or to recommend those videos having both a high importance of visual information and a high importance of audio information, and accordingly select corresponding candidate videos as the recommended videos.
  • the embodiments of the present disclosure may improve a ratio of satisfactorily consumed videos in the video recommendation.
  • FIG. 1 illustrates exemplary implementation scenarios of providing video recommendation according to an embodiment.
  • Exemplary network architecture 100 is shown in FIG. 1, and the video recommendation may be provided in the network architecture 100.
  • a network 110 is applied for interconnecting various network entities.
  • the network 110 may be any type of networks capable of interconnecting network entities.
  • the network 110 may be a single network or a combination of various networks.
  • the network 110 may be a Local Area Network (LAN) , a Wide Area Network (WAN) , etc.
  • the network 110 may be a wireline network, a wireless network, etc.
  • the network 110 may be a circuit switching network, a packet switching network, etc.
  • a video recommendation server 120 may connect to the network 110.
  • service providing websites 130 may connect to the network 110.
  • video hosting servers 140 may connect to the network 110.
  • video resources 142 may connect to the network 110.
  • the video recommendation server 120 may be configured for providing video recommendation according to the embodiments of the present disclosure, e.g., determining recommended videos and providing the recommended videos to users.
  • providing recommended videos may refer to providing links of the recommended videos, providing graphical indications containing links of the recommended videos, displaying at least one of the recommended videos directly, etc.
  • the service providing websites 130 exemplarily represent various websites that may provide various services to users, wherein the provided services may comprise video-related services.
  • the service providing websites 130 may comprise, e.g., a news website, a social networking website, a video platform website, a search engine website, etc.
  • the service providing websites 130 may also comprise a website established by the video recommendation server 120.
  • the service providing websites 130 may be configured for interacting with the video recommendation server 120, obtaining recommended videos from the video recommendation server 120, and providing the recommended videos to the users.
  • the video recommendation server 120 may provide video recommendation in the services provided by the service providing websites 130. It should be appreciated that although the video recommendation server 120 is exemplarily shown as separated from the service providing websites 130 in FIG. 1, functionality of the video recommendation server 120 may also be implemented or incorporated in the service providing websites 130.
  • the video hosting servers 140 exemplarily represent various network entities capable of managing videos, which support uploading, storing, displaying, downloading, or sharing of videos.
  • the videos managed by the video hosting servers 140 are collectively shown as the video resources 142.
  • the video resources 142 may be stored or maintained in various databases, cloud storages, etc.
  • the video resources 142 may be accessed or processed by the video hosting servers 140. It should be appreciated that although the video resources 142 is exemplarily shown as separated from the video hosting servers 140 in FIG. 1, the video resources 142 may also be incorporated in the video hosting servers 140.
  • functionality of the video hosting servers 140 may also be implemented or incorporated in the service providing websites 130 or the video recommendation server 120. Furthermore, a part of or all of the video resources 142 may also be possessed, accessed, stored or managed by the service providing websites 130 or the video recommendation server 120.
  • the video recommendation server 120 may access the video resources 142 and determine the recommended videos from the video resources 142.
  • the terminal devices 150 and 160 in FIG. 1 may be any type of electronic computing devices capable of connecting to the network 110, accessing servers or websites on the network 110, processing data or signals, presenting multimedia contents, etc.
  • the terminal devices 150 and 160 may be smart phones, desktop computers, laptops, tablets, AI terminals, wearable devices, smart TVs, smart speakers, etc. Although two terminal devices are shown in FIG. 1, it should be appreciated that a different number of terminal devices may connect to the network 110.
  • the terminal devices 150 and 160 may be used by users for obtaining various services provided through the network 110, wherein the services may comprise video recommendation.
  • a client application 152 is installed in the terminal device 150, wherein the client application 152 represents various applications or clients that may provide services to a user of the terminal device 150.
  • the client application 152 may be, a news client, a social networking application, a video platform client, a search engine client, etc.
  • the client application 152 may also be a client associated with the video recommendation server 120.
  • the client application 152 may communicate with a corresponding application server to provide services to the user.
  • the client application 152 may interact with the video recommendation server 120, obtain recommended videos from the video recommendation server 120, and provide the recommended videos to the users within the service provided by the client application 152.
  • the client application 152 may receive recommended videos from the corresponding application server, and provide the recommended videos to the users.
  • the terminal device 160 may still obtain various services through accessing websites, e.g., the service providing websites 130, on the network 110.
  • the video recommendation server 120 may determine recommended videos, and the recommended videos may be provided to the user within the services provided by the service providing websites 130.
  • this user input may also be provided to and considered by the video recommendation server 120 so as to provide recommended videos.
  • the client application 152 may communicate with the video hosting servers 140 to obtain a corresponding video file and then display the video to the user.
  • the terminal device 160 may communicate with the video hosting servers 140 to obtain a corresponding video file and then display the video to the user.
  • any of the recommended videos may also be displayed to the user directly.
  • importance of visual information and/or audio information in each candidate video in a plurality of candidate videos may be determined in advance, wherein recommended videos are to be selected from the plurality of candidate videos.
  • the embodiments of the present disclosure may select candidate videos as the recommended videos based at least on importance of visual information and/or audio information in each candidate video.
  • FIG. 2 illustrates an exemplary process 200 for determining content scores of candidate videos according to an embodiment.
  • a content score of a video is used for indicating importance of visual information and/or audio information in the video.
  • Video resources 210 on the network may provide a number of various videos, from which recommended videos may be selected and provided to users.
  • the video resources 210 in FIG. 2 may correspond to the video resources 142 in FIG. 1.
  • the candidate video set 220 comprises a number of videos acting as candidates of recommended videos.
  • a content score of each candidate video in the candidate video set 220 may be determined.
  • a content score of a candidate video may comprise two separate sub scores or a vector formed by the two separate sub scores, one sub score indicating importance of visual information in the candidate video, another sub score indicating importance of audio information in the candidate video.
  • a content score of a candidate video is denoted as [0.8, 0.3]
  • the first sub score ā€œ0.8ā€ may indicate importance of visual information in the candidate video
  • the second sub score ā€œ0.3ā€ may indicate importance of audio information in the candidate video.
  • sub scores range from 0 to 1, and a higher sub score indicates higher importance.
  • the visual information would be of high importance for the candidate video, since the first sub score ā€œ0.8ā€ is very close to the maximum score ā€œ1ā€ , while the audio information would be of low importance for the candidate video, since the second sub score ā€œ0.3ā€ is close to the minimum score ā€œ0ā€ . That is, for this candidate video, the visual information is much more important than the audio information, and accordingly content of this candidate video may be conveyed mainly by the visual information.
  • a content score of a candidate video is denoted as [0.8, 0.7]
  • the first sub score ā€œ0.8ā€ may indicate importance of visual information in the candidate video
  • the second sub score ā€œ0.7ā€ may indicate importance of audio information in the candidate video.
  • both the visual information and the audio information in the candidate video have high importance. That is, content of this candidate video should be conveyed by both the visual information and the audio information.
  • a content score of a candidate video may comprise a single score, which may indicate a relative importance degree between visual information and audio information in the candidate video. Assuming that this signal score ranges from 0 to 1, and the higher the score is, the higher importance the visual information has and the lower importance the audio information has, while the lower the score is, the higher importance the audio information has and the lower importance the visual information has, or vice versa. As an example, assuming that a content score of a candidate video is ā€œ0.9ā€ , since this score is much close to the maximum score ā€œ1ā€ , it indicates that visual information in this candidate video is much more important than the audio information in this candidate video.
  • a content score of a candidate video is ā€œ0.3ā€
  • this score is much close to the minimum score ā€œ0ā€
  • a content score of a candidate video is ā€œ0.6ā€
  • this score is only a bit higher than a median score ā€œ0.5ā€
  • a content score of a candidate video may be determined based on, e.g., at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata of the candidate video.
  • the ā€œshot transitionā€ refers to how many times shot transition occurs in a predetermined time period or in time duration of the candidate video. Taking a speech video as an example, a camera may focus on a lecturer at most time and the shots of audience may be very few, and thus shot transition of this video would be very few. Taking a travel video as example, various sceneries may be recorded in the video, e.g., a long shot of a mountain, a close shot of a river, people’s activities on the grass, etc., and thus there may be many shot transitions in this video. Usually, more shot transitions may indicate more visual information existing in a candidate video. The shot transition may be detected among adjacent frames in the candidate video through any existing techniques.
  • the ā€œcamera motionā€ refers to movements of a camera in the candidate video.
  • the camera motion may be characterized by, e.g., time duration, distance, number, etc. of the movements of the camera.
  • a speech video when the camera captures a lecturer in the middle of the screen, the camera may keep static for a long time so as to fix the picture of the lecturer in the middle of the screen, and during this time period, no camera motion occurs.
  • the camera may move along with the dog, and thus camera motion of this video, e.g., time duration, distance or number of movements of the camera, would be very high.
  • a higher camera motion may indicate more visual information existing in a candidate video.
  • the camera motion may be detected among adjacent frames in the candidate video through any existing techniques.
  • the ā€œsceneā€ refers to places or locations at where an event is happening in the candidate video.
  • the scene may be characterized by, e.g., how many scenes occur in the candidate video. For example, if a video records an indoor picture, a car picture, and a football field picture sequentially, since each of the ā€œindoor pictureā€ , ā€œcar pictureā€ , and ā€œfootball field pictureā€ is a scene, this video may be determined as including three scenes. Usually, more scenes may indicate more visual information existing in a candidate video.
  • the scenes in the candidate video may be detected through various existing techniques. For example, the scenes in the candidate video may be detected through deep learning models for image categorization. Moreover, the scenes in the candidate video may also be detected through performing semantic analysis on text information derived from the candidate video.
  • the ā€œhumanā€ refers to persons, characters, etc. appearing in the candidate video.
  • the human may be characterized by, e.g., how many human beings appear in the candidate video, whether a special human beings is appearing in the candidate video, etc.
  • more human beings may indicate more visual information existing in a candidate video.
  • the human beings appeared in the candidate video are famous celebrities, e.g., movie stars, pop stars, sport stars, etc., this may indicate more visual information existing in the candidate video.
  • the human beings in the candidate video may be detected through various existing techniques, e.g., deep learning models for face detection, face recognition, etc.
  • the ā€œhuman motionā€ refers to movements, actions, etc. of human beings in the candidate video.
  • the human motion may be characterized by, e.g., number, time duration, type, etc. of human motions appearing in the candidate video.
  • more human motions and long-time human motions may indicate more visual information existing in a candidate video.
  • some types of human motions e.g., shooting in a football game, may also indicate more visual information existing in a candidate video.
  • the human motion may be detected among adjacent frames in the candidate video through any existing techniques.
  • the ā€œobjectā€ refers to animals, articles, etc. appearing in the candidate video.
  • the object may be characterized by, e.g., how many objects appear in the candidate video, whether special objects are appearing in the candidate video.
  • more objects may indicate more visual information existing in a candidate video.
  • some special objects e.g., a tiger, a turtle, etc., may also indicate more visual information existing in a candidate video.
  • the objects in the candidate video may be detected through various existing techniques, e.g., deep learning models for image detection, etc.
  • the ā€œobject motionā€ refers to movements, actions, etc. of objects in the candidate video.
  • the object motion may be characterized by, e.g., number, time duration, area, etc. of object motions appearing in the candidate video.
  • more object motions and long-time object motions may indicate more visual information existing in a candidate video.
  • certain areas of object motions may also indicate more visual information existing in a candidate video.
  • the object motion may be detected among adjacent frames in the candidate video through any existing techniques.
  • the ā€œtext informationā€ refers to informative texts in the candidate video, e.g., subtitles, closed captions, embedded text, etc.
  • the text information may be characterized by, e.g., the amount of informative texts. Taking a video of talk show as an example, all the sentences spoken by attendees may be shown in a text form on the picture of the video, and thus this video may be determined as having a large amount of text information. Taking a cooking video as an example, during a cooker is explaining how to cook a dish in the video, steps of cooking the dish may be shown in a text form on the picture of the video synchronously, and thus this video may be determined as having a large amount of text information.
  • Text information in the candidate video may be detected through various existing techniques. For example, subtitles and closed captions may be detected through decoding a corresponding text file of the candidate video, and embedded text, which has been merged with the picture of the candidate video, may be detected through, e.g., Optical Character Recognition (OCR) , etc.
  • OCR Optical Character Recognition
  • the ā€œaudio attributeā€ refers to categories of audio appearing in the candidate video, e.g., voice, sing, music, etc.
  • Various audio attributes may indicate different importance of audio information in the candidate video. For example, in a video recording a girl who is singing, the audio information, i.e., singing by the girl, may indicate a high importance of audio information.
  • the audio attribute of the candidate video may be detected based on, e.g., audio tracks in the candidate video through any existing techniques.
  • the ā€œvideo metadataā€ refers to descriptive information associated with the candidate video obtained from a video resource, comprising, e.g., video category, title, etc.
  • the video category may be, e.g., ā€œfunnyā€ , ā€œeducationā€ , ā€œtalk showā€ , ā€œgameā€ , ā€œmusicā€ , ā€œnewsā€ , etc., which may facilitate to determine importance of visual information and/or audio information.
  • a game video it is likely that visual information in the video is more important than audio information in the video.
  • the title of the candidate video may comprise some keywords, e.g., ā€œsongā€ , ā€œinterviewā€ , ā€œspeechā€ , etc., which may facilitate to determine importance of visual information and/or audio information. For example, if the title of the candidate video is ā€œElection Speechā€ , it is very likely that audio information in the candidate video is more important than visual information in the candidate video.
  • any two or more of the above discussed shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata may be combined together so as to determine the content score of the candidate video.
  • this video may contain a large amount of camera motions and object motions but does not include any speech or music, and thus a content score indicating a high importance of visual information may be determined for this video.
  • this video may contain a long time-duration speech, few shot transition, few camera motions, few scenes, a title including a keyword ā€œspeechā€ , etc., and thus a content score indicating a high importance of audio information may be determined for this video.
  • a content side model may be adopted for determining the content score of the candidate video as discussed above.
  • a content side model 230 is used for determining a content score of each candidate video in the candidate video set 220.
  • the content side model 230 may be established based on various techniques, e.g., machine learning, deep learning, etc.
  • Features adopted by the content side model 230 may comprise at least one of: shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata, as discussed above.
  • the content side model 230 may be, e.g., a regression model, a classification model, etc.
  • the content side model may be based on, e.g., a linear model, a logistic model, a decision tree model, a neural network model, etc.
  • Training data for the content side model 230 may be obtained through: obtaining a group of videos to be used for training; for each video in the group of videos, labeling respective values corresponding to the features of the content side model, and labeling a content score for the video; and forming training data from the group of videos with respective labels.
  • a content score of each candidate video in the candidate video set 220 may be determined, and accordingly the candidate video set with respective content scores 240 may be finally obtained, which may be further used for determining recommended videos.
  • the content side model 230 is implemented as a model which adopts features comprising at least one of: shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata.
  • the content side model 230 may also be implemented in any other approaches.
  • the content side model 230 may be a deep learning-based model, which can determine or predict a content score of each candidate video directly based on visual and/or audio stream of the candidate video without extracting any heuristically designed features.
  • This content side model may be trained by a set of training data. Each training data may be formed by a video and a labeled content score indicating importance of visual information and/or audio information in the video.
  • At least one reference factor may be used for the video recommendation.
  • a reference factor may indicate preferred importance of visual information and/or audio information in at least one video to be recommended. That is, the at least one reference factor may provide references or criteria for determining recommended videos. For example, the at least one reference factor may indicate whether to recommend those videos having a higher importance of visual information, or to recommend those videos having a higher importance of audio information, or to recommend those videos having both a high importance of visual information and a high importance of audio information.
  • the at least one reference factor may comprise an indication of a default or current service configuration of the video recommendation, a preference score of the user, a user input from the user, etc., which will be discussed in details later.
  • FIG. 3 illustrates an exemplary process 300 for determining recommended videos according to an embodiment.
  • an indication of service configuration of the video recommendation is used as a reference factor for determining recommended videos.
  • service configuration 310 of the video recommendation may be obtained.
  • the service configuration 310 refers to configuration about how to provide recommended videos to a user which is set in a client application or service providing website.
  • the service configuration 310 may be a default service configuration of the video recommendation, or a current service configuration of the video recommendation.
  • the service configuration 310 may comprise providing recommended videos in a mute mode, or providing recommended videos in a non-mute mode. For example, as for the case of providing recommended videos in a mute mode, those videos with high importance of visual information are suitable to be recommended, whereas those videos with high importance of audio information are not suitable to be recommended since the audio information cannot be displayed to the user.
  • a ranking score of a candidate video may be determined based at least on a content score of the candidate video and an indication of the service configuration 310.
  • the indication of the service configuration 310 may be provided to a ranking model 320 as a reference factor.
  • a candidate video set with content scores 330 may also be provided to the ranking model 320, wherein the candidate video set with content scores 330 corresponds to the candidate video set with content scores 240 in FIG. 2.
  • the ranking model 320 may be an improved version of any existing ranking models for video recommendation.
  • the existing ranking models may determine a ranking score of each candidate video based on features of freshness of the video, popularity of the video, click rate of the video, video quality, relevance between content of the video and the user’s interests, etc.
  • the ranking model 320 may further adopt a content score of a candidate video and at least one reference factor, i.e., the indication of the service configuration 310 in FIG. 3, as additional features. That is, the ranking model 320 may determine a ranking score of each candidate video in the candidate video set based at least on a content score of the candidate video and the indication of the service configuration 310.
  • the ranking model 320 may acknowledge what types of candidate videos, e.g., whether visual information is important or audio information is important, should be given a higher ranking in the following selection of recommended videos. Through considering the content score of the candidate video, the ranking model 320 may decide whether this candidate video complies with the reference or criteria acknowledged before. Thus, the ranking model 320 may determine a ranking score of a candidate video in a consideration of importance of visual information and/or audio information, e.g., give a higher ranking score to a candidate video which has a content score complying with the indication of the service configuration 310. Through the ranking model 320, the candidate video set with respective ranking scores 340 may be obtained.
  • the ranking model 320 may be established based on various techniques, e.g., machine learning, deep learning, etc.
  • Features adopted by the ranking model 320 may comprise a content score of a candidate video, indication of a service configuration, together with any features adopted by the existing ranking models.
  • the ranking model 320 may be based on, e.g., a linear model, a logistic model, a decision tree model, a neural network model, etc.
  • recommended videos 350 may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set. For example, a plurality of highest ranked candidate videos may be selected as recommended videos.
  • the recommended videos 350 may be further provided to the user through a terminal device of the user.
  • FIG. 4 illustrates an exemplary process 400 for determining recommended videos according to an embodiment.
  • a preference score of the user is used as a reference factor for determining recommended videos.
  • a preference score 410 of the user may be obtained.
  • the preference score may indicate expectation degree of the user for visual information and/or audio information in a video to be recommended. That is, the preference score may indicate whether the user expects to obtain recommended videos with high importance of visual information or expects to obtain recommended videos with high importance of audio information. Assuming that the preference score ranges from 0 to 1, and the higher the score is, the higher importance of visual information the user expects, while the lower the score is, the higher importance of audio information the user expects. As an example, assuming that a preference score of the user is ā€œ0.9ā€ , since this score is much close to the maximum value ā€œ1ā€ , it indicates that the user is very expecting to obtain recommended videos with high importance of visual information.
  • the preference score may be determined based on at least one of: current time, current location, configuration of the terminal device of the user, operating state of the terminal device, and historical watching behaviors of the user.
  • the ā€œcurrent timeā€ refers to the current time point, time period of a day, date, day of the week, etc. when the user is accessing the client application or service providing website in which video recommendation is provided. Different ā€œcurrent timeā€ may reflect different expectations of the user. For example, if it is 11 PM now, the user may desire recommended videos with low importance of audio information so as to avoid disturbing other sleeping people.
  • the ā€œcurrent locationā€ refers to where the user is located now, e.g., home, office, subway, street, etc.
  • the current location of the user may be detected through various existing approaches, e.g., through GPS signals of the terminal device, through locating a WiFi device with which the terminal device is connecting, etc.
  • Different ā€œcurrent locationā€ may reflect different expectations of the user. For example, if the user is at home now, the user may desire recommended videos with both high importance of visual information and high importance of audio information, while if the user is at office now, the user may not desire recommend videos with high importance of audio information because it is inconvenient to hear audio information at office.
  • the ā€œconfiguration of the terminal deviceā€ may comprise at least one of: screen size, screen resolution, loudspeaker available or not, and peripheral earphone connected or not, etc.
  • the configuration of the terminal device may restrict the user’s consumption of recommended videos. For example, if the terminal device only has a small screen size or a low screen resolution, it is not suitable to recommend videos with high importance of visual information. For example, if the loudspeaker of the terminal device is off now, it is not suitable to recommend videos with high importance of audio information.
  • the ā€œoperating state of the terminal deviceā€ may comprise at least one of operating in a mute mode, operating in a non-mute mode, operating in a driving mode, etc. For example, if the terminal device is in a mute mode, the user may desire recommended videos with high importance of visual information instead of recommended videos with high importance of audio information. If the terminal device is in a driving mode, e.g., the user of the terminal device is driving a car, the user may desire recommended videos with high importance of audio information.
  • the ā€œhistorical watching behaviors of the userā€ refers to the user’s historical watching actions of previous recommended videos. For example, if the user has watched five recently-recommended videos with high importance of visual information, it is very likely that the user may desire to obtain more recommended videos with high importance of visual information. For example, if during the recent week, the user has watched most of recommended videos with high importance of audio information, it may indicate that the user may expect to obtain more recommended videos with high importance of audio information.
  • any two or more of the above discussed current time, current location, configuration of the terminal device, operating state of the terminal device, and historical watching behaviors of the user may be combined together so as to determine the preference score of the user. For example, if the current location is the office, and the operating state of the terminal device is in a mute mode, then a preference score indicating a high expectation degree of the user for visual information in a video to be recommended may be determined. For example, if the current time is 11PM, and the historical watching behaviors of the user shows that the user has not watched the previously-recommended several videos with high importance of audio information at 11PM, then a preference score indicating a high expectation degree of the user for visual information in a video to be recommended may be determined.
  • the preference score may be determined only based on user state-related information, e.g., at least one of the current time, the current location, historical watching behaviors of the user, etc. In one case, the preference score may be determined only based on terminal device-related information, e.g., at least one of configuration of the terminal device, operating state of the terminal device, etc. In one case, the preference score may also be determined based on both the user state-related information and the terminal device-related information.
  • a user side model may be adopted for determining the preference score of the user as discussed above.
  • a user side model 420 is used for determining the preference score 410.
  • the user side model 420 may be established based on various techniques, e.g., machine learning, deep learning, etc.
  • Features adopted by the user side model 420 may comprise at least one of: time, location, configuration of the terminal device, operating state of the terminal device, and historical watching behaviors of the user, as discussed above.
  • the user side model 420 may be, e.g., a regression model, a classification model, etc.
  • the user side model 420 may be based on, e.g., a linear model, a logistic model, a decision tree model, a neural network model, etc.
  • Training data for the user side model 420 may be obtained from historical watching records of the user, wherein each historical watching record is associated with a watching action of a historical recommended video by the user.
  • Information corresponding to the features of the user side model may be obtained from a historical watching record, and a preference score may also be labeled for this historical watching record. The obtained information and the labeled preference score together may be used as a piece of training data. In this way, a set of training data may be formed based on a number of historical watching records of the user.
  • a user side model may be established for each terminal device. For example, assuming that the user has two terminal devices, a first user side model may be established based on user state-related information and the first terminal device-related information, and a second user side model may be established based on user state-related information and the second terminal device-related information.
  • the preference score of the user may be determined through a user side model corresponding to the terminal device currently-used by the user.
  • a ranking score of a candidate video may be determined based at least on a content score of the candidate video and the preference score 410.
  • the preference score 410 of the user may be provided to a ranking model 430 as a reference factor.
  • a candidate video set with content scores 440 may also be provided to the ranking model 430, wherein the candidate video set with content scores 440 corresponds to the candidate video set with content scores 240 in FIG. 2.
  • the ranking model 430 is similar with the ranking model 320, except that the reference factor in FIG. 4 is the preference score 410 instead of the service configuration 310.
  • the ranking model 430 may further adopt a content score of a candidate video and at least one reference factor, i.e., the preference score 410 in FIG. 4, as additional features. That is, the ranking model 430 may determine a ranking score of each candidate video in the candidate video set based at least on a content score of the candidate video and the preference score 410. Through considering the preference score 410, the ranking model 430 may acknowledge what types of candidate videos, e.g., whether visual information is important or audio information is important, are expected by the user. Through considering the content score of the candidate video, the ranking model 430 may decide whether this candidate video complies with the expectation of the user.
  • the ranking model 430 may determine a ranking score of a candidate video in a consideration of importance of visual information and/or audio information, e.g., give a higher ranking score to a candidate video which has a content score complying with the preference score 410.
  • the candidate video set with respective ranking scores 450 may be obtained.
  • recommended videos 460 may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set. Moreover, the recommended videos 460 may be further provided to the user through the terminal device of the user.
  • the preference score may be determined based on at least one of: current time, current location, configuration of the terminal device, operating state of the terminal device, and historical watching behaviors of the user, the preference score may also be determined in consideration any other factors that may be used for indicating expectation degree of the user for visual information and/or audio information in a video to be recommended.
  • the preference score may be determined further based on the user’s schedule, wherein events in the schedule may indicate whether the user desires recommended videos with high importance of visual information or with high importance of audio information.
  • a preference score indicating a high expectation degree of the user for visual information in a video to be recommended may be determined.
  • the preference score may be determined further based on the user’s physical condition, wherein the physical condition may indicate whether the user desires recommended videos with high importance of visual information or with high importance of audio information. For example, if the user is having an eye disease, then a preference score indicating a high expectation degree of the user for audio information in a video to be recommended may be determined.
  • FIG. 5 illustrates an exemplary process 500 for determining recommended videos according to an embodiment.
  • a user input from the user is used as a reference factor for determining recommended videos.
  • a user input 510 may be obtained from the user.
  • the user input may indicate expectation degree of the user for visual information and/or audio information in at least one video to be recommended. That is, the user input may indicate whether the user expects to obtain recommended videos with high importance of visual information or expects to obtain recommended videos with high importance of audio information.
  • the user input 510 may comprise a designation of preferred importance of visual information and/or audio information in at least one video to be recommended.
  • options of preferred importance may be provided in a user interface of the client application or service providing website, and the user may select one of the options in the user interface so as to designate preferred importance of visual information and/or audio information in at least one video to be recommended.
  • the designation of preferred importance by the user may indicate that whether the user expects to obtain recommended videos with high importance of audio information, and/or to obtain recommended videos with high importance of visual information.
  • the user input 510 may comprise a designation of category of at least one video to be recommended.
  • the user may designate, in a user interface of the client application or service providing website, at least one desired category of the at least one video to be recommended.
  • the designated category may be, e.g., ā€œfunnyā€ , ā€œeducationā€ , ā€œtalk showā€ , ā€œgameā€ , ā€œmusicā€ , ā€œnewsā€ , etc., which may indicate whether the user expects to obtain recommended videos with high importance of audio information, and/or to obtain recommended videos with high importance of visual information.
  • a category ā€œtalk showā€ is designated by the user, it may indicate that the user expects to obtain recommended videos with high importance of audio information.
  • a category ā€œgameā€ is designated by the user, it may indicate that the user expects to obtain recommended videos with high importance of visual information.
  • the user input 510 may comprise a query for searching videos.
  • the user may input a query in a user interface of the client application or service providing website so as to search one or more videos that the user is interested.
  • an exemplary query may be ā€œAmerican presidential election speechā€ which indicates that the user wants to search some speech videos related to the American presidential election.
  • the query may explicitly or implicitly indicate whether the user expects to obtain recommended videos with high importance of visual information, and/or to obtain recommended videos with high importance of audio information.
  • the keyword ā€œspeechā€ in the query may explicitly indicate that the user expects to obtain recommended videos with high importance of audio information.
  • the keyword ā€œmagic showā€ may explicitly indicate that the user expects to obtain recommended videos with high importance of visual information.
  • the query may explicitly indicate that the user expects to obtain recommended videos with high importance of visual information.
  • the user input 510 is not limited to comprise any one or more of the designation of preferred importance, the designation of category, and the query as discussed above, but may comprise any other types of input from the user which can indicate expectation degree of the user for visual information and/or audio information in at least one video to be recommended.
  • a ranking score of a candidate video may be determined based at least on a content score of the candidate video and the user input 510.
  • the user input 510 of the user may be provided to a ranking model 520 as a reference factor.
  • a candidate video set with content scores 530 may also be provided to the ranking model 520, wherein the candidate video set with content scores 530 corresponds to the candidate video set with content scores 240 in FIG. 2.
  • the ranking model 520 is similar with the ranking model 320, except that the reference factor in FIG. 5 is the user input 510 instead of the service configuration 310.
  • the ranking model 520 may further adopt a content score of a candidate video and at least one reference factor, i.e., the user input 510 in FIG. 5, as additional features. That is, the ranking model 520 may determine a ranking score of each candidate video in the candidate video set based at least on a content score of the candidate video and the user input 510. Through considering the user input 510, the ranking model 520 may acknowledge what types of candidate videos, e.g., whether visual information is important or audio information is important, are expected by the user. Through considering the content score of the candidate video, the ranking model 520 may decide whether this candidate video complies with the expectation of the user.
  • the ranking model 520 may determine a ranking score of a candidate video in a consideration of importance of visual information and/or audio information, e.g., give a higher ranking score to a candidate video which has a content score complying with the user input 510.
  • the candidate video set with respective ranking scores 540 may be obtained.
  • recommended videos 550 may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set. Moreover, the recommended videos 550 may be further provided to the user through the terminal device of the user.
  • FIG. 6 illustrates an exemplary process 600 for determining recommended videos according to an embodiment.
  • reference factors for determining recommended videos may comprise service configuration of the video recommendation, a preference score of the user and a user input from the user. That is, the process 600 may be deemed as a combination of the process 300 in FIG. 3, the process 400 in FIG. 4, and the process 500 in FIG. 5.
  • service configuration 610 of the video recommendation may be obtained, which may correspond to the service configuration 310 in FIG. 3.
  • a preference score 620 of the user may be obtained, which may correspond to the preference score 410 in FIG. 4.
  • a user input 630 may be obtained, which may correspond to the user input 510 in FIG. 5.
  • a ranking score of a candidate video may be determined based at least on a content score of the candidate video, the service configuration 610, the preference score 620 and the user input 630.
  • the service configuration 610, the preference score 620 and the user input 630 may be provided to a ranking model 640 as reference factors.
  • a candidate video set with content scores 650 may also be provided to the ranking model 640, wherein the candidate video set with content scores 650 corresponds to the candidate video set with content scores 240 in FIG. 2.
  • the ranking model 640 may further adopt a content score of a candidate video and at least one reference factor, i.e., the service configuration 610, the preference score 620 and the user input 630 in FIG. 6, as additional features. That is, the ranking model 520 may determine a ranking score of each candidate video in the candidate video set based at least on a content score of the candidate video and a combination of the service configuration 610, the preference score 620 and the user input 630. Through considering the combination of the service configuration 610, the preference score 620 and the user input 630, the ranking model 640 may acknowledge what types of candidate videos, e.g., whether visual information is important or audio information is important, shall be recommended to the user.
  • the service configuration 610 the preference score 620 and the user input 630
  • the ranking model 640 may determine a ranking score of a candidate video in a consideration of importance of visual information and/or audio information, e.g., give a higher ranking score to a candidate video which has a content score complying with the combination of the service configuration 610, the preference score 620 and the user input 630. Through the ranking model 640, the candidate video set with respective ranking scores 660 may be obtained.
  • recommended videos 670 may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set. Moreover, the recommended videos 670 may be further provided to the user through the terminal device of the user.
  • the process 600 may be changed in various approaches.
  • any two of the service configuration 610, the preference score 620 and the user input 630 may be adopted as reference factors for the video recommendation. That is to say, the embodiments of the present disclosure may utilize at least one of service configuration, preference score and user input as reference factors to be used for further determining recommended videos.
  • some embodiments of the present disclosure may determine recommended videos from a candidate video set based at least on reference factors and content scores of candidate videos.
  • the content scores of the candidate videos in the candidate video set may be firstly determined through, e.g., a content side model, and then the content scores of the candidate videos together with the reference factors may be used for determining ranking scores of the candidate videos through, e.g., a ranking model, wherein features adopted by the ranking model at least comprise at least one reference factor and a rank score of a candidate video.
  • the process of determining the content scores of the candidate videos in the candidate video may be omitted, i.e., recommended videos may be determined from the candidate video set based at least on reference factors.
  • a ranking model may be used for determining ranking scores of the candidate videos based at least on reference factors, wherein features adopted by the ranking model at least comprise at least one reference factor and those features adopted by the content side model in FIG. 2 to FIG. 6.
  • FIG. 7 illustrates an exemplary process 700 for determining recommended videos according to an embodiment.
  • At least one of a service configuration 710 of the video recommendation, a preference score 720 of the user and a user input 730 from the user may be obtained.
  • the service configuration 710, the preference score 720 and the user input 730 may correspond to the service configuration 310 in FIG. 3, the preference score 410 in FIG. 4 and the user input 510 in FIG. 5 respectively.
  • a ranking score of a candidate video may be determined based at least on at least one of the service configuration 710, the preference score 720 and the user input 730.
  • At least one of the service configuration 710, the preference score 720 and the user input 730 may be provided to a ranking model 740 as reference factors.
  • a candidate video set 750 may also be provided to the ranking model 740, wherein the candidate video set 750 may correspond to the candidate video set 220 in FIG. 2.
  • the ranking model 740 may be an improved version of any existing ranking models for video recommendation. Besides features adopted in the existing ranking models, the ranking model 740 may further adopt at least one reference factor, e.g., the service configuration 710, the preference score 720 and/or the user input 730 in FIG. 7, as additional features. Moreover, the ranking model 740 may further adopt those features adopted by the content side model in FIG. 2 to FIG. 6 as additional features, comprising at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata of a candidate video.
  • the ranking model 740 may further adopt at least one reference factor, e.g., the service configuration 710, the preference score 720 and/or the user input 730 in FIG. 7, as additional features.
  • the ranking model 740 may further adopt those features adopted by the content side model in FIG. 2 to FIG. 6 as additional features, comprising at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute
  • the ranking model 740 may acknowledge what types of candidate videos, e.g., whether visual information is important or audio information is important, shall be recommended to the user.
  • the ranking model 740 may decide whether this candidate video complies with preferred importance indicated by the at least one reference factor. Accordingly, the ranking model 740 may determine a ranking score of a candidate video in a consideration of importance of visual information and/or audio information.
  • recommended videos 770 may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set. Moreover, the recommended videos 770 may be further provided to the user through the terminal device of the user.
  • the ranking models in FIG. 3 to FIG. 7 may be configured for determining a ranking score of a candidate video further based on consumption condition of the candidate video by a number of other users. The more times the candidate video is consumed by other users, the higher ranking score the candidate video may get.
  • the ranking models in FIG. 3 to FIG. 7 may be configured for determining a ranking score of a candidate video further based on relevance between content of the candidate video and the user’s interests.
  • the user’s interests may be determined based on, e.g., historical watching records of the user. For example, the historical watching records of the user may indicate what categories or topics of video content the user is interested in.
  • a higher ranking score may be determined for the candidate video.
  • diversity of video recommendation may also be considered such that the selected recommended videos could have diversity in terms of content.
  • candidate videos in a candidate video set may be firstly ranked through any existing ranking models for video recommendation. Then a filtering operation may be performed on the ranked candidate videos, wherein the filtering operation may consider preferred importance of visual information and/or audio information in at least one video to be recommended.
  • the filtering operation may consider preferred importance of visual information and/or audio information in at least one video to be recommended.
  • at least one of the service configuration, the preference score and the user input as discussed above in FIG. 3 to FIG. 7 may be used by the filtering operation for filtering out those candidate videos not complying with the preferred importance of visual information and/or audio information in at least one video to be recommended.
  • the filtering operation may be implemented through a filter model which adopts features comprising at least one of service configuration, preference score and user input.
  • FIG. 8 illustrates a flowchart of an exemplary method 800 for providing video recommendation according to an embodiment.
  • At 810 at least one reference factor for the video recommendation may be determined, wherein the at least one reference factor indicates preferred importance of visual information and/or audio information in at least one video to be recommended.
  • a ranking score of each candidate video in a candidate video set may be determined based at least on the at least one reference factor.
  • At 830, at least one recommended video may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set.
  • the at least one recommended video may be provided to a user through a terminal device.
  • the at least one reference factor may comprise a preference score of the user, the preference score indicating expectation degree of the user for the visual information and/or the audio information in the at least one video to be recommended.
  • the preference score may be determined based on at least one of: current time, current location, configuration of the terminal device, operating state of the terminal device, and historical watching behaviors of the user.
  • the configuration of the terminal device may comprise at least one of: screen size, screen resolution, loudspeaker available or not, and peripheral earphone connected or not.
  • the operating state of the terminal device may comprise at least one of: operating in a mute mode, operating in a non-mute mode and operating in a driving mode.
  • the preference score may be determined through a user side model, the user side model adopting at least one of the following features: time, location, configuration of the terminal device, operating state of the terminal device, and historical watching behaviors of the user.
  • the at least one reference factor may comprise an indication of a default or current service configuration of the video recommendation.
  • the default or current service configuration may comprise providing the at least one video to be recommended in a mute mode or in a non-mute mode.
  • the at least one reference factor may comprise a user input from the user, the user input indicating expectation degree of the user for the visual information and/or the audio information in the at least one video to be recommended.
  • the user input may comprise at least one of: a designation of the preferred importance of the visual information and/or the audio information in the at least one video to be recommended; a designation of category of the at least one video to be recommended; and a query for searching videos.
  • the method 800 may further comprise: determining a content score of each candidate video in the candidate video set, the content score indicating importance of visual information and/or audio information in the candidate video.
  • the determining the ranking score of each candidate video may be further based on a content score of the candidate video.
  • the content score of each candidate video may be determined based on at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata of the candidate video.
  • the content score of each candidate video may be determined through a content side model, the content side model adopting at least one of the following features: shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata.
  • the content score of each candidate video may be determined through a content side model which is based on deep learning, the content side model being trained by a set of training data, each training data being formed by a video and a labeled content score indicating importance of visual information and/or audio information in the video.
  • the ranking score of each candidate video may be determined through a ranking model, the ranking model at least adopting the following features: at least one reference factor; and a content score of a candidate video.
  • the method 800 may further comprise: detecting at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata of each candidate video in the candidate video set.
  • the determining the ranking score of each candidate video may be further based on at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata of the candidate video.
  • the ranking score of each candidate video may be determined through a ranking model, the ranking model at least adopting the following features: at least one reference factor; and at least one of shot transition, camera motion, scene, human, human motion, object, object motion, text information, audio attribute, and video metadata of a candidate video.
  • the determining the ranking score of each candidate video may be further based on at least one of: consumption condition of the candidate video by a number of other users; and relevance between content of the candidate video and the user’s interests.
  • the video recommendation may be provided in a client application or service providing website.
  • the method 800 may further comprise any steps/processes for providing video recommendation according to the embodiments of the present disclosure as mentioned above.
  • FIG. 9 illustrates an exemplary apparatus 900 for providing video recommendation according to an embodiment.
  • the apparatus 900 may comprise: a reference factor determining module 910, for determining at least one reference factor for the video recommendation, the at least one reference factor indicating preferred importance of visual information and/or audio information in at least one video to be recommended; a ranking score determining module 920, for determining a ranking score of each candidate video in a candidate video set based at least on the at least one reference factor; a recommended video selecting module 930, for selecting at least one recommended video from the candidate video set based at least on ranking scores of candidate videos in the candidate video set; and a recommended video providing module 940, for providing the at least one recommended video to a user through a terminal device.
  • a reference factor determining module 910 for determining at least one reference factor for the video recommendation, the at least one reference factor indicating preferred importance of visual information and/or audio information in at least one video to be recommended
  • a ranking score determining module 920 for determining a ranking score of each candidate video in a candidate video set based at least on the at least one reference factor
  • the at least one reference factor may comprise at least one of: a preference score of the user; an indication of a default or current service configuration of the video recommendation; and a user input from the user.
  • the apparatus 900 may also comprise any other modules configured for providing video recommendation according to the embodiments of the present disclosure as mentioned above.
  • FIG. 10 illustrates an exemplary apparatus 1000 for providing video recommendation according to an embodiment.
  • the apparatus 1000 may comprise at least one processor 1010 and a memory 1020 storing computer-executable instructions.
  • the at least one processor 1010 may: determine at least one reference factor for the video recommendation, the at least one reference factor indicating preferred importance of visual information and/or audio information in at least one video to be recommended; determine a ranking score of each candidate video in a candidate video set based at least on the at least one reference factor; select at least one recommended video from the candidate video set based at least on ranking scores of candidate videos in the candidate video set; and provide the at least one recommended video to a user through a terminal device.
  • the at least one processor 1010 may be further configured for performing any operations of the methods for providing video recommendation according to the embodiments of the present disclosure as mentioned above.
  • a method for presenting recommended videos to a user is provided.
  • a user input may be received.
  • the received user input may correspond to, e.g., the user input 510 in FIG. 5, the user input 630 in FIG. 6, the user input 730 in FIG. 7, etc.
  • the operation of receiving the user input may comprise receiving, from the user, a designation of preferred importance of visual information and/or audio information in at least one video to be recommended. For example, when the user selects one of options of preferred importance provided in a user interface of the third party application or website, a designation of the preferred importance may be received.
  • the operation of receiving the user input may comprise receiving, from the user, a designation of category of at least one video to be recommended.
  • the operation of receiving the user input may comprise receiving, from the user, a query for searching videos. For example, when the user inputs a query in the user interface of the third party application or website so as to search videos that the user is interested, the query may be received.
  • the received user input may be used for identifying preferred importance of visual information and/or audio information in at least one video to be recommended, e.g., expectation degree of the user for visual information and/or audio information in at least one video to be recommended. For example, if a category ā€œtalk showā€ is designated in the user input, it may be identified that the user expects to obtain recommended videos with high importance of audio information. For example, if a query ā€œfamous magic showsā€ is included in the user input, it may be identified that the user expects to obtain recommended videos with high importance of visual information.
  • the identified preferred importance may be further used for determining at least one recommended video from a candidate video set.
  • those ranking approaches discussed above in FIG. 3 to FIG. 7 may be adopted here for ranking candidate videos in the candidate video set and further selecting the at least one recommended video from the ranked candidate videos.
  • the determined at least one recommended video may be presented to the user through the user interface.
  • a recommended video list may be formed and presented to the user.
  • the determined at least one recommended video may be used for updating the recommended video list.
  • An apparatus for presenting recommended videos to a user may be provided, which comprises various modules configured for performing any operations of the above method may be provided. Moreover, an apparatus for presenting recommended videos to a user may be provided, which comprises at least one processor and a memory storing computer-executable instructions, wherein the at least one processor may be configured for performing any operations of the above method.
  • a method for presenting recommended videos to a user is provided.
  • a service configuration of video recommendation may be detected.
  • the detected service configuration may correspond to, e.g., the service configuration 310 in FIG. 3.
  • the detected service configuration may be used for identifying preferred importance of visual information and/or audio information in at least one video to be recommended. For example, if the service configuration indicates that recommended videos shall be provided in a mute mode, it may be identified that those videos with high importance of visual information are preferred to be recommended.
  • the identified preferred importance may be further used for determining at least one recommended video from a candidate video set.
  • those ranking approaches discussed above in FIG. 3 to FIG. 7 may be adopted here for ranking candidate videos in the candidate video set and further selecting the at least one recommended video from the ranked candidate videos.
  • the determined at least one recommended video may be presented to the user through the user interface.
  • a recommended video list may be formed and presented to the user.
  • the determined at least one recommended video may be used for updating the recommended video list.
  • An apparatus for presenting recommended videos to a user may be provided, which comprises various modules configured for performing any operations of the above method may be provided. Moreover, an apparatus for presenting recommended videos to a user may be provided, which comprises at least one processor and a memory storing computer-executable instructions, wherein the at least one processor may be configured for performing any operations of the above method.
  • a method for presenting recommended videos to a user is provided.
  • a preference score of the user may be determined.
  • the preference score may correspond to, e.g., the preference score 410 in FIG. 4, and may be determined in a similar way as that discussed in FIG. 4.
  • the determined preference score may be used for identifying preferred importance of visual information and/or audio information in at least one video to be recommended, e.g., expectation degree of the user for visual information and/or audio information in a video to be recommended.
  • the preference score may indicate whether the user expects to obtain recommended videos with high importance of visual information or expects to obtain recommended videos with high importance of audio information.
  • the identified preferred importance may be further used for determining at least one recommended video from a candidate video set.
  • those ranking approaches discussed above in FIG. 3 to FIG. 7 may be adopted here for ranking candidate videos in the candidate video set and further selecting the at least one recommended video from the ranked candidate videos.
  • the determined at least one recommended video may be presented to the user through the user interface.
  • a recommended video list may be formed and presented to the user.
  • the determined at least one recommended video may be used for updating the recommended video list.
  • An apparatus for presenting recommended videos to a user may be provided, which comprises various modules configured for performing any operations of the above method may be provided. Moreover, an apparatus for presenting recommended videos to a user may be provided, which comprises at least one processor and a memory storing computer-executable instructions, wherein the at least one processor may be configured for performing any operations of the above method.
  • the embodiments of the present disclosure may be embodied in a non-transitory computer-readable medium.
  • the non-transitory computer-readable medium may comprise instructions that, when executed, cause one or more processors to perform any operations of the methods for providing video recommendation or for presenting recommended videos according to the embodiments of the present disclosure as mentioned above.
  • modules in the apparatuses described above may be implemented in various approaches. These modules may be implemented as hardware, software, or a combination thereof. Moreover, any of these modules may be further functionally divided into sub-modules or combined together.
  • processors have been described in connection with various apparatuses and methods. These processors may be implemented using electronic hardware, computer software, or any combination thereof. Whether such processors are implemented as hardware or software will depend upon the particular application and overall design constraints imposed on the system.
  • a processor, any portion of a processor, or any combination of processors presented in the present disclosure may be implemented with a microprocessor, microcontroller, digital signal processor (DSP) , a field-programmable gate array (FPGA) , a programmable logic device (PLD) , a state machine, gated logic, discrete hardware circuits, and other suitable processing components configured to perform the various functions described throughout the present disclosure.
  • DSP digital signal processor
  • FPGA field-programmable gate array
  • PLD programmable logic device
  • a state machine gated logic, discrete hardware circuits, and other suitable processing components configured to perform the various functions described throughout the present disclosure.
  • the functionality of a processor, any portion of a processor, or any combination of processors presented in the present disclosure may be
  • a computer-readable medium may include, by way of example, memory such as a magnetic storage device (e.g., hard disk, floppy disk, magnetic strip) , an optical disk, a smart card, a flash memory device, random access memory (RAM) , read only memory (ROM) , programmable ROM (PROM) , erasable PROM (EPROM) , electrically erasable PROM (EEPROM) , a register, or a removable disk.
  • RAM random access memory
  • ROM read only memory
  • PROM programmable ROM
  • EPROM erasable PROM
  • EEPROM electrically erasable PROM

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Graphics (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present disclosure provides method and apparatus for providing video recommendation. At least one reference factor for the video recommendation may be determined, wherein the at least one reference factor indicates preferred importance of visual information and/or audio information in at least one video to be recommended. A ranking score of each candidate video in a candidate video set may be determined based at least on the at least one reference factor. At least one recommended video may be selected from the candidate video set based at least on ranking scores of candidate videos in the candidate video set. The at least one recommended video may be provided to a user through a terminal device.

Description

PROVIDINGĀ VIDEOĀ RECOMMENDATION BACKGROUND
TheĀ developmentsĀ ofĀ theĀ networkĀ andĀ variousĀ digitalĀ devicesĀ haveĀ enabledĀ peopleĀ toĀ watchĀ videosĀ theyĀ likeĀ atĀ anyĀ time.Ā DueĀ toĀ theĀ convenienceĀ ofĀ creating,Ā editingĀ andĀ sharingĀ videos,Ā theĀ numberĀ ofĀ videosĀ availableĀ onĀ theĀ networkĀ isĀ enormousĀ andĀ growsĀ everyĀ day.Ā ThisĀ makesĀ itĀ moreĀ andĀ moreĀ difficultĀ toĀ findĀ contentsĀ inĀ whichĀ usersĀ areĀ mostĀ interested.Ā DueĀ toĀ limitedĀ timeĀ thatĀ theĀ usersĀ have,Ā effectiveĀ videoĀ recommendationĀ toĀ theĀ usersĀ becomesĀ moreĀ andĀ moreĀ important.
SUMMARY
ThisĀ SummaryĀ isĀ providedĀ toĀ introduceĀ aĀ selectionĀ ofĀ conceptsĀ thatĀ areĀ furtherĀ describedĀ belowĀ inĀ theĀ DetailedĀ Description.Ā ItĀ isĀ notĀ intendedĀ toĀ identifyĀ keyĀ featuresĀ orĀ essentialĀ featuresĀ ofĀ theĀ claimedĀ subjectĀ matter,Ā norĀ isĀ itĀ intendedĀ toĀ beĀ usedĀ toĀ limitĀ theĀ scopeĀ ofĀ theĀ claimedĀ subjectĀ matter.
EmbodimentsĀ ofĀ theĀ presentĀ disclosureĀ proposeĀ methodĀ andĀ apparatusĀ forĀ providingĀ videoĀ recommendation.Ā AtĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendationĀ mayĀ beĀ determined,Ā whereinĀ theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatesĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā AĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor.Ā AtĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.Ā TheĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ providedĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
ItĀ shouldĀ beĀ notedĀ thatĀ theĀ aboveĀ oneĀ orĀ moreĀ aspectsĀ compriseĀ theĀ featuresĀ hereinafterĀ fullyĀ describedĀ andĀ particularlyĀ pointedĀ outĀ inĀ theĀ claims.Ā TheĀ followingĀ descriptionĀ andĀ theĀ drawingsĀ setĀ forthĀ inĀ detailĀ certainĀ illustrativeĀ featuresĀ ofĀ theĀ oneĀ orĀ moreĀ aspects.Ā TheseĀ featuresĀ areĀ onlyĀ indicativeĀ ofĀ theĀ variousĀ waysĀ inĀ whichĀ theĀ principlesĀ ofĀ variousĀ aspectsĀ mayĀ beĀ employed,Ā andĀ thisĀ disclosureĀ isĀ intendedĀ toĀ includeĀ allĀ suchĀ aspectsĀ andĀ theirĀ equivalents.
BRIEFĀ DESCRIPTIONĀ OFĀ THEĀ DRAWINGS
TheĀ disclosedĀ aspectsĀ willĀ hereinafterĀ beĀ describedĀ inĀ connectionĀ withĀ theĀ  appendedĀ drawingsĀ thatĀ areĀ providedĀ toĀ illustrateĀ andĀ notĀ toĀ limitĀ theĀ disclosedĀ aspects.
FIG.Ā 1Ā illustratesĀ exemplaryĀ implementationĀ scenariosĀ ofĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 2Ā illustratesĀ anĀ exemplaryĀ processĀ forĀ determiningĀ contentĀ scoresĀ ofĀ candidateĀ videosĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 3Ā illustratesĀ anĀ exemplaryĀ processĀ forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 4Ā illustratesĀ anĀ exemplaryĀ processĀ forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 5Ā illustratesĀ anĀ exemplaryĀ processĀ forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 6Ā illustratesĀ anĀ exemplaryĀ processĀ forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 7Ā illustratesĀ anĀ exemplaryĀ processĀ forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 8Ā illustratesĀ aĀ flowchartĀ ofĀ anĀ exemplaryĀ methodĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 9Ā illustratesĀ anĀ exemplaryĀ apparatusĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
FIG.Ā 10Ā illustratesĀ anĀ exemplaryĀ apparatusĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
DETAILEDĀ DESCRIPTION
TheĀ presentĀ disclosureĀ willĀ nowĀ beĀ discussedĀ withĀ referenceĀ toĀ severalĀ exampleĀ implementations.Ā ItĀ isĀ toĀ beĀ understoodĀ thatĀ theseĀ implementationsĀ areĀ discussedĀ onlyĀ forĀ enablingĀ thoseĀ skilledĀ inĀ theĀ artĀ toĀ betterĀ understandĀ andĀ thusĀ implementĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosure,Ā ratherĀ thanĀ suggestingĀ anyĀ limitationsĀ onĀ theĀ scopeĀ ofĀ theĀ presentĀ disclosure.
ApplicationsĀ orĀ websitesĀ beingĀ capableĀ ofĀ accessingĀ variousĀ videoĀ resourcesĀ onĀ theĀ networkĀ mayĀ provideĀ videoĀ recommendationĀ toĀ users.Ā TheĀ applicationsĀ orĀ websitesĀ mayĀ beĀ newsĀ clientsĀ orĀ websites,Ā socialĀ networkingĀ applicationsĀ orĀ websites,Ā videoĀ platformsĀ clientsĀ orĀ websites,Ā searchĀ engineĀ clientsĀ orĀ websites,Ā etc.,Ā suchĀ as,Ā CNNĀ News,Ā Toutiao,Ā Facebook,Ā Youtube,Ā Youku,Ā Bing,Ā Baidu,Ā  etc.Ā TheĀ applicationsĀ orĀ websitesĀ mayĀ selectĀ aĀ pluralityĀ ofĀ videosĀ fromĀ theĀ videoĀ resourcesĀ onĀ theĀ networkĀ asĀ recommendedĀ videosĀ andĀ provideĀ theĀ recommendedĀ videosĀ toĀ usersĀ forĀ consumption.Ā WhenĀ determiningĀ whetherĀ aĀ videoĀ onĀ theĀ networkĀ shouldĀ beĀ selectedĀ asĀ aĀ recommendedĀ video,Ā thoseĀ existingĀ approachesĀ forĀ determiningĀ recommendedĀ videosĀ fromĀ theĀ videoĀ resourcesĀ onĀ theĀ networkĀ mayĀ considerĀ someĀ factors,Ā e.g.,Ā freshnessĀ ofĀ theĀ video,Ā popularityĀ ofĀ theĀ video,Ā clickĀ rateĀ ofĀ theĀ video,Ā videoĀ quality,Ā relevanceĀ betweenĀ contentĀ ofĀ theĀ videoĀ andĀ aĀ user’sĀ interests,Ā etc.Ā ForĀ example,Ā ifĀ theĀ videoĀ qualityĀ indicatesĀ thatĀ theĀ videoĀ comesĀ fromĀ anĀ entityĀ havingĀ aĀ highĀ authorityĀ and/orĀ theĀ videoĀ hasĀ aĀ highĀ definition,Ā thisĀ videoĀ isĀ moreĀ likelyĀ toĀ beĀ selectedĀ asĀ aĀ recommendedĀ video.Ā ForĀ example,Ā ifĀ theĀ contentĀ ofĀ theĀ videoĀ belongsĀ toĀ aĀ categoryĀ ofĀ footballĀ andĀ theĀ userĀ alwaysĀ showsĀ interestĀ inĀ football-relatedĀ videos,Ā i.e.,Ā thereĀ isĀ aĀ highĀ relevanceĀ betweenĀ theĀ contentĀ ofĀ theĀ videoĀ andĀ theĀ user’sĀ interests,Ā thisĀ videoĀ mayĀ beĀ recommendedĀ toĀ theĀ userĀ withĀ aĀ highĀ probability.
ItĀ isĀ knownĀ thatĀ aĀ videoĀ mayĀ compriseĀ visualĀ informationĀ andĀ audioĀ information,Ā whereinĀ theĀ visualĀ informationĀ indicatesĀ aĀ seriesĀ ofĀ picturesĀ beingĀ visuallyĀ presentedĀ inĀ theĀ video,Ā andĀ theĀ audioĀ informationĀ indicatesĀ voice,Ā sound,Ā music,Ā etc.Ā beingĀ presentedĀ inĀ anĀ audioĀ formĀ inĀ theĀ video.Ā InĀ someĀ cases,Ā whenĀ aĀ userĀ isĀ consumingĀ aĀ recommendedĀ videoĀ onĀ aĀ terminalĀ device,Ā itĀ mayĀ beĀ inconvenientĀ forĀ theĀ userĀ toĀ consumeĀ bothĀ visualĀ informationĀ andĀ audioĀ informationĀ ofĀ theĀ recommendedĀ video.Ā ForĀ example,Ā theĀ userĀ mayĀ beĀ preparingĀ dinnerĀ inĀ aĀ kitchen,Ā andĀ thusĀ theĀ userĀ canĀ keepĀ listeningĀ butĀ cannotĀ keepĀ watchingĀ aĀ screenĀ ofĀ theĀ terminalĀ device.Ā ForĀ example,Ā ifĀ itĀ isĀ eightĀ o’clockĀ inĀ theĀ morningĀ andĀ theĀ userĀ isĀ onĀ theĀ subwayĀ now,Ā theĀ userĀ mayĀ preferĀ toĀ consumeĀ visualĀ informationĀ ofĀ aĀ recommendedĀ videoĀ butĀ doesn’tĀ wantĀ anyĀ soundsĀ toĀ beĀ displayedĀ toĀ disturbĀ others.Ā ForĀ example,Ā assumingĀ thatĀ theĀ terminalĀ deviceĀ isĀ aĀ smartĀ phoneĀ andĀ theĀ smartĀ phoneĀ isĀ operatingĀ inĀ aĀ muteĀ mode,Ā andĀ thusĀ theĀ userĀ canĀ notĀ consumeĀ audioĀ informationĀ inĀ theĀ recommendedĀ video.Ā ForĀ example,Ā assumingĀ thatĀ theĀ terminalĀ deviceĀ isĀ aĀ smartĀ speakerĀ withĀ aĀ smallĀ screenĀ orĀ withĀ noĀ screen,Ā andĀ theĀ userĀ isĀ drivingĀ aĀ carĀ now,Ā andĀ thusĀ itĀ mayĀ beĀ notĀ suitableĀ forĀ theĀ userĀ toĀ consumeĀ visualĀ informationĀ inĀ theĀ recommendedĀ video.
EmbodimentsĀ ofĀ theĀ presentĀ disclosureĀ proposeĀ toĀ improveĀ videoĀ recommendationĀ throughĀ consideringĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ recommendedĀ videosĀ duringĀ determiningĀ theĀ recommendedĀ videos.Ā  Herein,Ā importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ aĀ videoĀ mayĀ indicate,Ā e.g.,Ā whetherĀ contentĀ ofĀ theĀ videoĀ isĀ conveyedĀ mainlyĀ byĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ information,Ā whetherĀ theĀ visualĀ informationĀ orĀ theĀ audioĀ informationĀ isĀ theĀ mostĀ criticalĀ informationĀ inĀ theĀ video,Ā whetherĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ isĀ indispensableĀ orĀ necessaryĀ forĀ consumingĀ theĀ video,Ā etc.Ā ImportanceĀ ofĀ visualĀ informationĀ andĀ importanceĀ ofĀ audioĀ informationĀ mayĀ varyĀ forĀ differentĀ videos.Ā ForĀ example,Ā forĀ aĀ speechĀ video,Ā importanceĀ ofĀ audioĀ informationĀ isĀ higherĀ thanĀ importanceĀ ofĀ visualĀ informationĀ becauseĀ theĀ videoĀ presentsĀ contentĀ ofĀ theĀ speechĀ mainlyĀ inĀ anĀ audioĀ form.Ā ForĀ example,Ā forĀ aĀ videoĀ recordingĀ aĀ cuteĀ dog’sĀ activities,Ā audioĀ informationĀ mayĀ beĀ lessĀ importantĀ thanĀ visualĀ informationĀ becauseĀ theĀ videoĀ mayĀ presentĀ theĀ activitiesĀ ofĀ theĀ dogĀ mainlyĀ inĀ aĀ visualĀ form.Ā ForĀ example,Ā forĀ aĀ dancingĀ video,Ā bothĀ visualĀ informationĀ andĀ audioĀ informationĀ mayĀ beĀ importantĀ becauseĀ theĀ videoĀ mayĀ presentĀ danceĀ movementsĀ inĀ aĀ visualĀ formĀ andĀ meanwhileĀ presentĀ musicĀ inĀ anĀ audioĀ form.Ā ItĀ canĀ beĀ seenĀ that,Ā whenĀ aĀ userĀ isĀ consumingĀ aĀ video,Ā eitherĀ visualĀ informationĀ orĀ audioĀ informationĀ thatĀ hasĀ aĀ higherĀ importanceĀ mayĀ beĀ sufficientĀ forĀ theĀ userĀ toĀ acknowledgeĀ orĀ understandĀ contentĀ ofĀ theĀ video.
WhenĀ determiningĀ recommendedĀ videosĀ fromĀ aĀ pluralityĀ ofĀ candidateĀ videos,Ā theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ mayĀ decideĀ whetherĀ toĀ recommendĀ thoseĀ videosĀ havingĀ aĀ higherĀ importanceĀ ofĀ visualĀ information,Ā orĀ toĀ recommendĀ thoseĀ videosĀ havingĀ aĀ higherĀ importanceĀ ofĀ audioĀ information,Ā orĀ toĀ recommendĀ thoseĀ videosĀ havingĀ bothĀ aĀ highĀ importanceĀ ofĀ visualĀ informationĀ andĀ aĀ highĀ importanceĀ ofĀ audioĀ information,Ā andĀ accordinglyĀ selectĀ correspondingĀ candidateĀ videosĀ asĀ theĀ recommendedĀ videos.Ā ThroughĀ consideringĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ candidateĀ videosĀ duringĀ determiningĀ videosĀ toĀ beĀ recommended,Ā theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ mayĀ improveĀ aĀ ratioĀ ofĀ satisfactorilyĀ consumedĀ videosĀ inĀ theĀ videoĀ recommendation.
FIG.Ā 1Ā illustratesĀ exemplaryĀ implementationĀ scenariosĀ ofĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.Ā ExemplaryĀ networkĀ architectureĀ 100Ā isĀ shownĀ inĀ FIG.Ā 1,Ā andĀ theĀ videoĀ recommendationĀ mayĀ beĀ providedĀ inĀ theĀ networkĀ architectureĀ 100.
InĀ theĀ networkĀ architectureĀ 100,Ā aĀ networkĀ 110Ā isĀ appliedĀ forĀ interconnectingĀ variousĀ networkĀ entities.Ā TheĀ networkĀ 110Ā mayĀ beĀ anyĀ typeĀ ofĀ  networksĀ capableĀ ofĀ interconnectingĀ networkĀ entities.Ā TheĀ networkĀ 110Ā mayĀ beĀ aĀ singleĀ networkĀ orĀ aĀ combinationĀ ofĀ variousĀ networks.Ā InĀ termsĀ ofĀ coverageĀ range,Ā theĀ networkĀ 110Ā mayĀ beĀ aĀ LocalĀ AreaĀ NetworkĀ (LAN)Ā ,Ā aĀ WideĀ AreaĀ NetworkĀ (WAN)Ā ,Ā etc.Ā InĀ termsĀ ofĀ carryingĀ medium,Ā theĀ networkĀ 110Ā mayĀ beĀ aĀ wirelineĀ network,Ā aĀ wirelessĀ network,Ā etc.Ā InĀ termsĀ ofĀ dataĀ switchingĀ techniques,Ā theĀ networkĀ 110Ā mayĀ beĀ aĀ circuitĀ switchingĀ network,Ā aĀ packetĀ switchingĀ network,Ā etc.
AsĀ shownĀ inĀ FIG.Ā 1,Ā aĀ videoĀ recommendationĀ serverĀ 120,Ā serviceĀ providingĀ websitesĀ 130,Ā videoĀ hostingĀ serversĀ 140,Ā videoĀ resourcesĀ 142,Ā  terminalĀ devices Ā 150Ā andĀ 160,Ā etc.Ā mayĀ connectĀ toĀ theĀ networkĀ 110.
TheĀ videoĀ recommendationĀ serverĀ 120Ā mayĀ beĀ configuredĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosure,Ā e.g.,Ā determiningĀ recommendedĀ videosĀ andĀ providingĀ theĀ recommendedĀ videosĀ toĀ users.Ā InĀ thisĀ disclosure,Ā providingĀ recommendedĀ videosĀ mayĀ referĀ toĀ providingĀ linksĀ ofĀ theĀ recommendedĀ videos,Ā providingĀ graphicalĀ indicationsĀ containingĀ linksĀ ofĀ theĀ recommendedĀ videos,Ā displayingĀ atĀ leastĀ oneĀ ofĀ theĀ recommendedĀ videosĀ directly,Ā etc.
TheĀ serviceĀ providingĀ websitesĀ 130Ā exemplarilyĀ representĀ variousĀ websitesĀ thatĀ mayĀ provideĀ variousĀ servicesĀ toĀ users,Ā whereinĀ theĀ providedĀ servicesĀ mayĀ compriseĀ video-relatedĀ services.Ā ForĀ example,Ā theĀ serviceĀ providingĀ websitesĀ 130Ā mayĀ comprise,Ā e.g.,Ā aĀ newsĀ website,Ā aĀ socialĀ networkingĀ website,Ā aĀ videoĀ platformĀ website,Ā aĀ searchĀ engineĀ website,Ā etc.Ā Moreover,Ā theĀ serviceĀ providingĀ websitesĀ 130Ā mayĀ alsoĀ compriseĀ aĀ websiteĀ establishedĀ byĀ theĀ videoĀ recommendationĀ serverĀ 120.Ā WhenĀ theĀ usersĀ isĀ accessingĀ theĀ serviceĀ providingĀ websitesĀ 130,Ā theĀ serviceĀ providingĀ websitesĀ 130Ā mayĀ beĀ configuredĀ forĀ interactingĀ withĀ theĀ videoĀ recommendationĀ serverĀ 120,Ā obtainingĀ recommendedĀ videosĀ fromĀ theĀ videoĀ recommendationĀ serverĀ 120,Ā andĀ providingĀ theĀ recommendedĀ videosĀ toĀ theĀ users.Ā Thus,Ā theĀ videoĀ recommendationĀ serverĀ 120Ā mayĀ provideĀ videoĀ recommendationĀ inĀ theĀ servicesĀ providedĀ byĀ theĀ serviceĀ providingĀ websitesĀ 130.Ā ItĀ shouldĀ beĀ appreciatedĀ thatĀ althoughĀ theĀ videoĀ recommendationĀ serverĀ 120Ā isĀ exemplarilyĀ shownĀ asĀ separatedĀ fromĀ theĀ serviceĀ providingĀ websitesĀ 130Ā inĀ FIG.Ā 1,Ā functionalityĀ ofĀ theĀ videoĀ recommendationĀ serverĀ 120Ā mayĀ alsoĀ beĀ implementedĀ orĀ incorporatedĀ inĀ theĀ serviceĀ providingĀ websitesĀ 130.
TheĀ videoĀ hostingĀ serversĀ 140Ā exemplarilyĀ representĀ variousĀ networkĀ entitiesĀ capableĀ ofĀ managingĀ videos,Ā whichĀ supportĀ uploading,Ā storing,Ā displaying,Ā downloading,Ā orĀ sharingĀ ofĀ videos.Ā TheĀ videosĀ managedĀ byĀ theĀ videoĀ hostingĀ serversĀ  140Ā areĀ collectivelyĀ shownĀ asĀ theĀ videoĀ resourcesĀ 142.Ā TheĀ videoĀ resourcesĀ 142Ā mayĀ beĀ storedĀ orĀ maintainedĀ inĀ variousĀ databases,Ā cloudĀ storages,Ā etc.Ā TheĀ videoĀ resourcesĀ 142Ā mayĀ beĀ accessedĀ orĀ processedĀ byĀ theĀ videoĀ hostingĀ serversĀ 140.Ā ItĀ shouldĀ beĀ appreciatedĀ thatĀ althoughĀ theĀ videoĀ resourcesĀ 142Ā isĀ exemplarilyĀ shownĀ asĀ separatedĀ fromĀ theĀ videoĀ hostingĀ serversĀ 140Ā inĀ FIG.Ā 1,Ā theĀ videoĀ resourcesĀ 142Ā mayĀ alsoĀ beĀ incorporatedĀ inĀ theĀ videoĀ hostingĀ serversĀ 140.Ā Moreover,Ā althoughĀ notĀ shown,Ā functionalityĀ ofĀ theĀ videoĀ hostingĀ serversĀ 140Ā mayĀ alsoĀ beĀ implementedĀ orĀ incorporatedĀ inĀ theĀ serviceĀ providingĀ websitesĀ 130Ā orĀ theĀ videoĀ recommendationĀ serverĀ 120.Ā Furthermore,Ā aĀ partĀ ofĀ orĀ allĀ ofĀ theĀ videoĀ resourcesĀ 142Ā mayĀ alsoĀ beĀ possessed,Ā accessed,Ā storedĀ orĀ managedĀ byĀ theĀ serviceĀ providingĀ websitesĀ 130Ā orĀ theĀ videoĀ recommendationĀ serverĀ 120.
WhenĀ providingĀ videoĀ recommendation,Ā theĀ videoĀ recommendationĀ serverĀ 120Ā mayĀ accessĀ theĀ videoĀ resourcesĀ 142Ā andĀ determineĀ theĀ recommendedĀ videosĀ fromĀ theĀ videoĀ resourcesĀ 142.
TheĀ  terminalĀ devices Ā 150Ā andĀ 160Ā inĀ FIG.Ā 1Ā mayĀ beĀ anyĀ typeĀ ofĀ electronicĀ computingĀ devicesĀ capableĀ ofĀ connectingĀ toĀ theĀ networkĀ 110,Ā accessingĀ serversĀ orĀ websitesĀ onĀ theĀ networkĀ 110,Ā processingĀ dataĀ orĀ signals,Ā presentingĀ multimediaĀ contents,Ā etc.Ā ForĀ example,Ā theĀ  terminalĀ devices Ā 150Ā andĀ 160Ā mayĀ beĀ smartĀ phones,Ā desktopĀ computers,Ā laptops,Ā tablets,Ā AIĀ terminals,Ā wearableĀ devices,Ā smartĀ TVs,Ā smartĀ speakers,Ā etc.Ā AlthoughĀ twoĀ terminalĀ devicesĀ areĀ shownĀ inĀ FIG.Ā 1,Ā itĀ shouldĀ beĀ appreciatedĀ thatĀ aĀ differentĀ numberĀ ofĀ terminalĀ devicesĀ mayĀ connectĀ toĀ theĀ networkĀ 110.Ā TheĀ  terminalĀ devices Ā 150Ā andĀ 160Ā mayĀ beĀ usedĀ byĀ usersĀ forĀ obtainingĀ variousĀ servicesĀ providedĀ throughĀ theĀ networkĀ 110,Ā whereinĀ theĀ servicesĀ mayĀ compriseĀ videoĀ recommendation.
AsĀ anĀ example,Ā aĀ clientĀ applicationĀ 152Ā isĀ installedĀ inĀ theĀ terminalĀ deviceĀ 150,Ā whereinĀ theĀ clientĀ applicationĀ 152Ā representsĀ variousĀ applicationsĀ orĀ clientsĀ thatĀ mayĀ provideĀ servicesĀ toĀ aĀ userĀ ofĀ theĀ terminalĀ deviceĀ 150.Ā ForĀ example,Ā theĀ clientĀ applicationĀ 152Ā mayĀ be,Ā aĀ newsĀ client,Ā aĀ socialĀ networkingĀ application,Ā aĀ videoĀ platformĀ client,Ā aĀ searchĀ engineĀ client,Ā etc.Ā Moreover,Ā theĀ clientĀ applicationĀ 152Ā mayĀ alsoĀ beĀ aĀ clientĀ associatedĀ withĀ theĀ videoĀ recommendationĀ serverĀ 120.Ā TheĀ clientĀ applicationĀ 152Ā mayĀ communicateĀ withĀ aĀ correspondingĀ applicationĀ serverĀ toĀ provideĀ servicesĀ toĀ theĀ user.Ā InĀ aĀ circumstance,Ā whenĀ theĀ userĀ ofĀ theĀ terminalĀ deviceĀ 150Ā isĀ accessingĀ theĀ clientĀ applicationĀ 152,Ā theĀ clientĀ applicationĀ 152Ā mayĀ interactĀ withĀ theĀ  videoĀ recommendationĀ serverĀ 120,Ā obtainĀ recommendedĀ videosĀ fromĀ theĀ videoĀ recommendationĀ serverĀ 120,Ā andĀ provideĀ theĀ recommendedĀ videosĀ toĀ theĀ usersĀ withinĀ theĀ serviceĀ providedĀ byĀ theĀ clientĀ applicationĀ 152.Ā InĀ aĀ circumstance,Ā ifĀ theĀ functionalityĀ ofĀ theĀ videoĀ recommendationĀ serverĀ 120Ā isĀ implementedĀ orĀ incorporatedĀ inĀ theĀ applicationĀ serverĀ correspondingĀ toĀ theĀ clientĀ applicationĀ 152,Ā theĀ clientĀ applicationĀ 152Ā mayĀ receiveĀ recommendedĀ videosĀ fromĀ theĀ correspondingĀ applicationĀ server,Ā andĀ provideĀ theĀ recommendedĀ videosĀ toĀ theĀ users.
AsĀ anĀ example,Ā althoughĀ theĀ terminalĀ deviceĀ 160Ā isĀ notĀ shownĀ asĀ havingĀ installedĀ anyĀ clientĀ application,Ā aĀ userĀ ofĀ theĀ terminalĀ deviceĀ 160Ā mayĀ stillĀ obtainĀ variousĀ servicesĀ throughĀ accessingĀ websites,Ā e.g.,Ā theĀ serviceĀ providingĀ websitesĀ 130,Ā onĀ theĀ networkĀ 110.Ā DuringĀ theĀ userĀ isĀ accessingĀ theĀ serviceĀ providingĀ websitesĀ 130,Ā theĀ videoĀ recommendationĀ serverĀ 120Ā mayĀ determineĀ recommendedĀ videos,Ā andĀ theĀ recommendedĀ videosĀ mayĀ beĀ providedĀ toĀ theĀ userĀ withinĀ theĀ servicesĀ providedĀ byĀ theĀ serviceĀ providingĀ websitesĀ 130.
ItĀ shouldĀ beĀ appreciatedĀ that,Ā inĀ anyĀ ofĀ theĀ aboveĀ circumstances,Ā ifĀ theĀ userĀ ofĀ theĀ  terminalĀ device Ā 150Ā orĀ 160Ā makesĀ aĀ userĀ inputĀ inĀ theĀ clientĀ applicationĀ 152Ā orĀ onĀ theĀ serviceĀ providingĀ websitesĀ 130,Ā thisĀ userĀ inputĀ mayĀ alsoĀ beĀ providedĀ toĀ andĀ consideredĀ byĀ theĀ videoĀ recommendationĀ serverĀ 120Ā soĀ asĀ toĀ provideĀ recommendedĀ videos.
InĀ theĀ caseĀ thatĀ theĀ userĀ ofĀ theĀ terminalĀ deviceĀ 150Ā obtainsĀ recommendedĀ videosĀ throughĀ theĀ clientĀ applicationĀ 152,Ā whenĀ theĀ userĀ wantsĀ toĀ consumeĀ aĀ recommendedĀ video,Ā e.g.,Ā clicksĀ aĀ linkĀ orĀ aĀ graphicalĀ indicationĀ ofĀ theĀ recommendedĀ videoĀ inĀ theĀ clientĀ applicationĀ 152,Ā theĀ clientĀ applicationĀ 152Ā mayĀ communicateĀ withĀ theĀ videoĀ hostingĀ serversĀ 140Ā toĀ obtainĀ aĀ correspondingĀ videoĀ fileĀ andĀ thenĀ displayĀ theĀ videoĀ toĀ theĀ user.Ā InĀ theĀ caseĀ thatĀ theĀ userĀ ofĀ theĀ terminalĀ deviceĀ 160Ā obtainsĀ recommendedĀ videosĀ onĀ aĀ webĀ pageĀ providedĀ byĀ theĀ serviceĀ providingĀ websitesĀ 130,Ā whenĀ theĀ userĀ wantsĀ toĀ consumeĀ aĀ recommendedĀ video,Ā e.g.,Ā clicksĀ aĀ linkĀ orĀ aĀ graphicalĀ indicationĀ ofĀ theĀ recommendedĀ videoĀ onĀ theĀ webĀ pageĀ providedĀ byĀ theĀ serviceĀ providingĀ websitesĀ 130,Ā theĀ terminalĀ deviceĀ 160Ā mayĀ communicateĀ withĀ theĀ videoĀ hostingĀ serversĀ 140Ā toĀ obtainĀ aĀ correspondingĀ videoĀ fileĀ andĀ thenĀ displayĀ theĀ videoĀ toĀ theĀ user.Ā InĀ otherĀ cases,Ā whenĀ theĀ recommendedĀ videosĀ areĀ providedĀ toĀ theĀ userĀ eitherĀ inĀ theĀ clientĀ applicationĀ 152Ā orĀ onĀ theĀ webĀ pageĀ providedĀ byĀ theĀ serviceĀ providingĀ websitesĀ 130,Ā anyĀ ofĀ theĀ recommendedĀ videosĀ mayĀ alsoĀ beĀ displayedĀ toĀ theĀ  userĀ directly.
Moreover,Ā itĀ shouldĀ beĀ appreciatedĀ thatĀ allĀ theĀ entitiesĀ orĀ unitsĀ shownĀ inĀ FIG.Ā 1Ā andĀ allĀ theĀ implementationĀ scenariosĀ discussedĀ aboveĀ areĀ exemplary,Ā andĀ dependingĀ onĀ specificĀ requirements,Ā anyĀ otherĀ entitiesĀ orĀ unitsĀ mayĀ beĀ involvedĀ inĀ theĀ networkĀ architectureĀ 100Ā andĀ anyĀ otherĀ implementationĀ scenariosĀ mayĀ beĀ coveredĀ byĀ theĀ presentĀ disclosure.
AccordingĀ toĀ someĀ embodimentsĀ ofĀ theĀ presentĀ disclosure,Ā importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ eachĀ candidateĀ videoĀ inĀ aĀ pluralityĀ ofĀ candidateĀ videosĀ mayĀ beĀ determinedĀ inĀ advance,Ā whereinĀ recommendedĀ videosĀ areĀ toĀ beĀ selectedĀ fromĀ theĀ pluralityĀ ofĀ candidateĀ videos.Ā WhenĀ determiningĀ theĀ recommendedĀ videosĀ fromĀ theĀ pluralityĀ ofĀ candidateĀ videos,Ā theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ mayĀ selectĀ candidateĀ videosĀ asĀ theĀ recommendedĀ videosĀ basedĀ atĀ leastĀ onĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ eachĀ candidateĀ video.
FIG.Ā 2Ā illustratesĀ anĀ exemplaryĀ processĀ 200Ā forĀ determiningĀ contentĀ scoresĀ ofĀ candidateĀ videosĀ accordingĀ toĀ anĀ embodiment.Ā Herein,Ā aĀ contentĀ scoreĀ ofĀ aĀ videoĀ isĀ usedĀ forĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ theĀ video.
VideoĀ resourcesĀ 210Ā onĀ theĀ networkĀ mayĀ provideĀ aĀ numberĀ ofĀ variousĀ videos,Ā fromĀ whichĀ recommendedĀ videosĀ mayĀ beĀ selectedĀ andĀ providedĀ toĀ users.Ā TheĀ videoĀ resourcesĀ 210Ā inĀ FIG.Ā 2Ā mayĀ correspondĀ toĀ theĀ videoĀ resourcesĀ 142Ā inĀ FIG.Ā 1.
VideosĀ providedĀ byĀ theĀ videoĀ resourcesĀ 210Ā mayĀ formĀ aĀ candidateĀ videoĀ setĀ 220.Ā TheĀ candidateĀ videoĀ setĀ 220Ā comprisesĀ aĀ numberĀ ofĀ videosĀ actingĀ asĀ candidatesĀ ofĀ recommendedĀ videos.
AccordingĀ toĀ theĀ embodimentĀ ofĀ theĀ presentĀ disclosure,Ā aĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ 220Ā mayĀ beĀ determined.
InĀ anĀ implementation,Ā aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ compriseĀ twoĀ separateĀ subĀ scoresĀ orĀ aĀ vectorĀ formedĀ byĀ theĀ twoĀ separateĀ subĀ scores,Ā oneĀ subĀ scoreĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ inĀ theĀ candidateĀ video,Ā anotherĀ subĀ scoreĀ indicatingĀ importanceĀ ofĀ audioĀ informationĀ inĀ theĀ candidateĀ video.Ā AsĀ anĀ example,Ā assumingĀ thatĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ isĀ denotedĀ asĀ [0.8,Ā 0.3]Ā ,Ā theĀ firstĀ subĀ scoreĀ ā€œ0.8ā€Ā mayĀ indicateĀ importanceĀ ofĀ visualĀ informationĀ inĀ theĀ candidateĀ video,Ā andĀ theĀ secondĀ subĀ scoreĀ ā€œ0.3ā€Ā mayĀ indicateĀ importanceĀ ofĀ audioĀ  informationĀ inĀ theĀ candidateĀ video.Ā Furthermore,Ā assumingĀ thatĀ subĀ scoresĀ rangeĀ fromĀ 0Ā toĀ 1,Ā andĀ aĀ higherĀ subĀ scoreĀ indicatesĀ higherĀ importance.Ā Thus,Ā inĀ theĀ previousĀ example,Ā theĀ visualĀ informationĀ wouldĀ beĀ ofĀ highĀ importanceĀ forĀ theĀ candidateĀ video,Ā sinceĀ theĀ firstĀ subĀ scoreĀ ā€œ0.8ā€Ā isĀ veryĀ closeĀ toĀ theĀ maximumĀ scoreĀ ā€œ1ā€Ā ,Ā whileĀ theĀ audioĀ informationĀ wouldĀ beĀ ofĀ lowĀ importanceĀ forĀ theĀ candidateĀ video,Ā sinceĀ theĀ secondĀ subĀ scoreĀ ā€œ0.3ā€Ā isĀ closeĀ toĀ theĀ minimumĀ scoreĀ ā€œ0ā€Ā .Ā ThatĀ is,Ā forĀ thisĀ candidateĀ video,Ā theĀ visualĀ informationĀ isĀ muchĀ moreĀ importantĀ thanĀ theĀ audioĀ information,Ā andĀ accordinglyĀ contentĀ ofĀ thisĀ candidateĀ videoĀ mayĀ beĀ conveyedĀ mainlyĀ byĀ theĀ visualĀ information.Ā AsĀ anotherĀ example,Ā assumingĀ thatĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ isĀ denotedĀ asĀ [0.8,Ā 0.7]Ā ,Ā theĀ firstĀ subĀ scoreĀ ā€œ0.8ā€Ā mayĀ indicateĀ importanceĀ ofĀ visualĀ informationĀ inĀ theĀ candidateĀ video,Ā andĀ theĀ secondĀ subĀ scoreĀ ā€œ0.7ā€Ā mayĀ indicateĀ importanceĀ ofĀ audioĀ informationĀ inĀ theĀ candidateĀ video.Ā SinceĀ bothĀ theĀ firstĀ subĀ scoreĀ ā€œ0.8ā€Ā andĀ theĀ secondĀ subĀ scoreĀ ā€œ0.7ā€Ā areĀ closeĀ toĀ theĀ maximumĀ scoreĀ ā€œ1ā€Ā ,Ā bothĀ theĀ visualĀ informationĀ andĀ theĀ audioĀ informationĀ inĀ theĀ candidateĀ videoĀ haveĀ highĀ importance.Ā ThatĀ is,Ā contentĀ ofĀ thisĀ candidateĀ videoĀ shouldĀ beĀ conveyedĀ byĀ bothĀ theĀ visualĀ informationĀ andĀ theĀ audioĀ information.
InĀ anĀ implementation,Ā aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ compriseĀ aĀ singleĀ score,Ā whichĀ mayĀ indicateĀ aĀ relativeĀ importanceĀ degreeĀ betweenĀ visualĀ informationĀ andĀ audioĀ informationĀ inĀ theĀ candidateĀ video.Ā AssumingĀ thatĀ thisĀ signalĀ scoreĀ rangesĀ fromĀ 0Ā toĀ 1,Ā andĀ theĀ higherĀ theĀ scoreĀ is,Ā theĀ higherĀ importanceĀ theĀ visualĀ informationĀ hasĀ andĀ theĀ lowerĀ importanceĀ theĀ audioĀ informationĀ has,Ā whileĀ theĀ lowerĀ theĀ scoreĀ is,Ā theĀ higherĀ importanceĀ theĀ audioĀ informationĀ hasĀ andĀ theĀ lowerĀ importanceĀ theĀ visualĀ informationĀ has,Ā orĀ viceĀ versa.Ā AsĀ anĀ example,Ā assumingĀ thatĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ isĀ ā€œ0.9ā€Ā ,Ā sinceĀ thisĀ scoreĀ isĀ muchĀ closeĀ toĀ theĀ maximumĀ scoreĀ ā€œ1ā€Ā ,Ā itĀ indicatesĀ thatĀ visualĀ informationĀ inĀ thisĀ candidateĀ videoĀ isĀ muchĀ moreĀ importantĀ thanĀ theĀ audioĀ informationĀ inĀ thisĀ candidateĀ video.Ā AsĀ anĀ example,Ā assumingĀ thatĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ isĀ ā€œ0.3ā€Ā ,Ā sinceĀ thisĀ scoreĀ isĀ muchĀ closeĀ toĀ theĀ minimumĀ scoreĀ ā€œ0ā€Ā ,Ā itĀ indicatesĀ thatĀ audioĀ informationĀ inĀ thisĀ candidateĀ videoĀ isĀ moreĀ importantĀ thanĀ theĀ visualĀ informationĀ inĀ thisĀ candidateĀ video.Ā AsĀ anĀ example,Ā assumingĀ thatĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ isĀ ā€œ0.6ā€Ā ,Ā sinceĀ thisĀ scoreĀ isĀ onlyĀ aĀ bitĀ higherĀ thanĀ aĀ medianĀ scoreĀ ā€œ0.5ā€Ā ,Ā itĀ indicatesĀ thatĀ bothĀ visualĀ informationĀ andĀ audioĀ informationĀ inĀ thisĀ candidateĀ videoĀ areĀ importantĀ exceptĀ thatĀ theĀ visualĀ informationĀ isĀ aĀ littleĀ bitĀ moreĀ importantĀ thanĀ theĀ audioĀ information.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ allĀ theĀ aboveĀ contentĀ scores,Ā subĀ scores,Ā scoreĀ ranges,Ā etc.Ā areĀ exemplary,Ā andĀ accordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosure,Ā theĀ contentĀ scoreĀ mayĀ beĀ denotedĀ inĀ anyĀ otherĀ numeral,Ā character,Ā orĀ codeĀ formsĀ andĀ mayĀ beĀ definedĀ withĀ anyĀ otherĀ scoreĀ ranges.
AccordingĀ toĀ theĀ embodimentĀ ofĀ theĀ presentĀ disclosure,Ā aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ on,Ā e.g.,Ā atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ theĀ candidateĀ video.
TheĀ ā€œshotĀ transitionā€Ā refersĀ toĀ howĀ manyĀ timesĀ shotĀ transitionĀ occursĀ inĀ aĀ predeterminedĀ timeĀ periodĀ orĀ inĀ timeĀ durationĀ ofĀ theĀ candidateĀ video.Ā TakingĀ aĀ speechĀ videoĀ asĀ anĀ example,Ā aĀ cameraĀ mayĀ focusĀ onĀ aĀ lecturerĀ atĀ mostĀ timeĀ andĀ theĀ shotsĀ ofĀ audienceĀ mayĀ beĀ veryĀ few,Ā andĀ thusĀ shotĀ transitionĀ ofĀ thisĀ videoĀ wouldĀ beĀ veryĀ few.Ā TakingĀ aĀ travelĀ videoĀ asĀ example,Ā variousĀ sceneriesĀ mayĀ beĀ recordedĀ inĀ theĀ video,Ā e.g.,Ā aĀ longĀ shotĀ ofĀ aĀ mountain,Ā aĀ closeĀ shotĀ ofĀ aĀ river,Ā people’sĀ activitiesĀ onĀ theĀ grass,Ā etc.,Ā andĀ thusĀ thereĀ mayĀ beĀ manyĀ shotĀ transitionsĀ inĀ thisĀ video.Ā Usually,Ā moreĀ shotĀ transitionsĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā TheĀ shotĀ transitionĀ mayĀ beĀ detectedĀ amongĀ adjacentĀ framesĀ inĀ theĀ candidateĀ videoĀ throughĀ anyĀ existingĀ techniques.
TheĀ ā€œcameraĀ motionā€Ā refersĀ toĀ movementsĀ ofĀ aĀ cameraĀ inĀ theĀ candidateĀ video.Ā TheĀ cameraĀ motionĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā timeĀ duration,Ā distance,Ā number,Ā etc.Ā ofĀ theĀ movementsĀ ofĀ theĀ camera.Ā TakingĀ aĀ speechĀ videoĀ asĀ anĀ example,Ā whenĀ theĀ cameraĀ capturesĀ aĀ lecturerĀ inĀ theĀ middleĀ ofĀ theĀ screen,Ā theĀ cameraĀ mayĀ keepĀ staticĀ forĀ aĀ longĀ timeĀ soĀ asĀ toĀ fixĀ theĀ pictureĀ ofĀ theĀ lecturerĀ inĀ theĀ middleĀ ofĀ theĀ screen,Ā andĀ duringĀ thisĀ timeĀ period,Ā noĀ cameraĀ motionĀ occurs.Ā TakingĀ aĀ videoĀ recordingĀ aĀ runningĀ dogĀ asĀ anĀ example,Ā theĀ cameraĀ mayĀ moveĀ alongĀ withĀ theĀ dog,Ā andĀ thusĀ cameraĀ motionĀ ofĀ thisĀ video,Ā e.g.,Ā timeĀ duration,Ā distanceĀ orĀ numberĀ ofĀ movementsĀ ofĀ theĀ camera,Ā wouldĀ beĀ veryĀ high.Ā Usually,Ā aĀ higherĀ cameraĀ motionĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā TheĀ cameraĀ motionĀ mayĀ beĀ detectedĀ amongĀ adjacentĀ framesĀ inĀ theĀ candidateĀ videoĀ throughĀ anyĀ existingĀ techniques.
TheĀ ā€œsceneā€Ā refersĀ toĀ placesĀ orĀ locationsĀ atĀ whereĀ anĀ eventĀ isĀ happeningĀ inĀ theĀ candidateĀ video.Ā TheĀ sceneĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā howĀ manyĀ scenesĀ occurĀ inĀ theĀ candidateĀ video.Ā ForĀ example,Ā ifĀ aĀ videoĀ recordsĀ anĀ indoorĀ picture,Ā aĀ carĀ picture,Ā andĀ aĀ footballĀ fieldĀ pictureĀ sequentially,Ā sinceĀ eachĀ ofĀ theĀ ā€œindoorĀ pictureā€Ā ,Ā ā€œcarĀ  pictureā€Ā ,Ā andĀ ā€œfootballĀ fieldĀ pictureā€Ā isĀ aĀ scene,Ā thisĀ videoĀ mayĀ beĀ determinedĀ asĀ includingĀ threeĀ scenes.Ā Usually,Ā moreĀ scenesĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā TheĀ scenesĀ inĀ theĀ candidateĀ videoĀ mayĀ beĀ detectedĀ throughĀ variousĀ existingĀ techniques.Ā ForĀ example,Ā theĀ scenesĀ inĀ theĀ candidateĀ videoĀ mayĀ beĀ detectedĀ throughĀ deepĀ learningĀ modelsĀ forĀ imageĀ categorization.Ā Moreover,Ā theĀ scenesĀ inĀ theĀ candidateĀ videoĀ mayĀ alsoĀ beĀ detectedĀ throughĀ performingĀ semanticĀ analysisĀ onĀ textĀ informationĀ derivedĀ fromĀ theĀ candidateĀ video.
TheĀ ā€œhumanā€Ā refersĀ toĀ persons,Ā characters,Ā etc.Ā appearingĀ inĀ theĀ candidateĀ video.Ā TheĀ humanĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā howĀ manyĀ humanĀ beingsĀ appearĀ inĀ theĀ candidateĀ video,Ā whetherĀ aĀ specialĀ humanĀ beingsĀ isĀ appearingĀ inĀ theĀ candidateĀ video,Ā etc.Ā Usually,Ā moreĀ humanĀ beingsĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā Moreover,Ā ifĀ theĀ humanĀ beingsĀ appearedĀ inĀ theĀ candidateĀ videoĀ areĀ famousĀ celebrities,Ā e.g.,Ā movieĀ stars,Ā popĀ stars,Ā sportĀ stars,Ā etc.,Ā thisĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ theĀ candidateĀ video.Ā TheĀ humanĀ beingsĀ inĀ theĀ candidateĀ videoĀ mayĀ beĀ detectedĀ throughĀ variousĀ existingĀ techniques,Ā e.g.,Ā deepĀ learningĀ modelsĀ forĀ faceĀ detection,Ā faceĀ recognition,Ā etc.
TheĀ ā€œhumanĀ motionā€Ā refersĀ toĀ movements,Ā actions,Ā etc.Ā ofĀ humanĀ beingsĀ inĀ theĀ candidateĀ video.Ā TheĀ humanĀ motionĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā number,Ā timeĀ duration,Ā type,Ā etc.Ā ofĀ humanĀ motionsĀ appearingĀ inĀ theĀ candidateĀ video.Ā Usually,Ā moreĀ humanĀ motionsĀ andĀ long-timeĀ humanĀ motionsĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā Moreover,Ā someĀ typesĀ ofĀ humanĀ motions,Ā e.g.,Ā shootingĀ inĀ aĀ footballĀ game,Ā mayĀ alsoĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā TheĀ humanĀ motionĀ mayĀ beĀ detectedĀ amongĀ adjacentĀ framesĀ inĀ theĀ candidateĀ videoĀ throughĀ anyĀ existingĀ techniques.
TheĀ ā€œobjectā€Ā refersĀ toĀ animals,Ā articles,Ā etc.Ā appearingĀ inĀ theĀ candidateĀ video.Ā TheĀ objectĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā howĀ manyĀ objectsĀ appearĀ inĀ theĀ candidateĀ video,Ā whetherĀ specialĀ objectsĀ areĀ appearingĀ inĀ theĀ candidateĀ video.Ā Usually,Ā moreĀ objectsĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā Moreover,Ā someĀ specialĀ objects,Ā e.g.,Ā aĀ tiger,Ā aĀ turtle,Ā etc.,Ā mayĀ alsoĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā TheĀ objectsĀ inĀ theĀ candidateĀ videoĀ mayĀ beĀ detectedĀ throughĀ variousĀ existingĀ techniques,Ā e.g.,Ā deepĀ learningĀ modelsĀ forĀ imageĀ detection,Ā etc.
TheĀ ā€œobjectĀ motionā€Ā refersĀ toĀ movements,Ā actions,Ā etc.Ā ofĀ objectsĀ inĀ theĀ  candidateĀ video.Ā TheĀ objectĀ motionĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā number,Ā timeĀ duration,Ā area,Ā etc.Ā ofĀ objectĀ motionsĀ appearingĀ inĀ theĀ candidateĀ video.Ā Usually,Ā moreĀ objectĀ motionsĀ andĀ long-timeĀ objectĀ motionsĀ mayĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā Moreover,Ā certainĀ areasĀ ofĀ objectĀ motionsĀ mayĀ alsoĀ indicateĀ moreĀ visualĀ informationĀ existingĀ inĀ aĀ candidateĀ video.Ā TheĀ objectĀ motionĀ mayĀ beĀ detectedĀ amongĀ adjacentĀ framesĀ inĀ theĀ candidateĀ videoĀ throughĀ anyĀ existingĀ techniques.
TheĀ ā€œtextĀ informationā€Ā refersĀ toĀ informativeĀ textsĀ inĀ theĀ candidateĀ video,Ā e.g.,Ā subtitles,Ā closedĀ captions,Ā embeddedĀ text,Ā etc.Ā TheĀ textĀ informationĀ mayĀ beĀ characterizedĀ by,Ā e.g.,Ā theĀ amountĀ ofĀ informativeĀ texts.Ā TakingĀ aĀ videoĀ ofĀ talkĀ showĀ asĀ anĀ example,Ā allĀ theĀ sentencesĀ spokenĀ byĀ attendeesĀ mayĀ beĀ shownĀ inĀ aĀ textĀ formĀ onĀ theĀ pictureĀ ofĀ theĀ video,Ā andĀ thusĀ thisĀ videoĀ mayĀ beĀ determinedĀ asĀ havingĀ aĀ largeĀ amountĀ ofĀ textĀ information.Ā TakingĀ aĀ cookingĀ videoĀ asĀ anĀ example,Ā duringĀ aĀ cookerĀ isĀ explainingĀ howĀ toĀ cookĀ aĀ dishĀ inĀ theĀ video,Ā stepsĀ ofĀ cookingĀ theĀ dishĀ mayĀ beĀ shownĀ inĀ aĀ textĀ formĀ onĀ theĀ pictureĀ ofĀ theĀ videoĀ synchronously,Ā andĀ thusĀ thisĀ videoĀ mayĀ beĀ determinedĀ asĀ havingĀ aĀ largeĀ amountĀ ofĀ textĀ information.Ā SinceĀ textĀ informationĀ isĀ usuallyĀ generatedĀ basedĀ atĀ leastĀ onĀ contentĀ inĀ aĀ candidateĀ videoĀ andĀ aĀ userĀ mayĀ understandĀ contentĀ inĀ theĀ candidateĀ videoĀ throughĀ theĀ textĀ informationĀ insteadĀ ofĀ correspondingĀ audioĀ information,Ā moreĀ textĀ informationĀ mayĀ indicateĀ lowerĀ importanceĀ ofĀ audioĀ informationĀ inĀ theĀ candidateĀ video.Ā TextĀ informationĀ inĀ theĀ candidateĀ videoĀ mayĀ beĀ detectedĀ throughĀ variousĀ existingĀ techniques.Ā ForĀ example,Ā subtitlesĀ andĀ closedĀ captionsĀ mayĀ beĀ detectedĀ throughĀ decodingĀ aĀ correspondingĀ textĀ fileĀ ofĀ theĀ candidateĀ video,Ā andĀ embeddedĀ text,Ā whichĀ hasĀ beenĀ mergedĀ withĀ theĀ pictureĀ ofĀ theĀ candidateĀ video,Ā mayĀ beĀ detectedĀ through,Ā e.g.,Ā OpticalĀ CharacterĀ RecognitionĀ (OCR)Ā ,Ā etc.
TheĀ ā€œaudioĀ attributeā€Ā refersĀ toĀ categoriesĀ ofĀ audioĀ appearingĀ inĀ theĀ candidateĀ video,Ā e.g.,Ā voice,Ā sing,Ā music,Ā etc.Ā VariousĀ audioĀ attributesĀ mayĀ indicateĀ differentĀ importanceĀ ofĀ audioĀ informationĀ inĀ theĀ candidateĀ video.Ā ForĀ example,Ā inĀ aĀ videoĀ recordingĀ aĀ girlĀ whoĀ isĀ singing,Ā theĀ audioĀ information,Ā i.e.,Ā singingĀ byĀ theĀ girl,Ā mayĀ indicateĀ aĀ highĀ importanceĀ ofĀ audioĀ information.Ā TheĀ audioĀ attributeĀ ofĀ theĀ candidateĀ videoĀ mayĀ beĀ detectedĀ basedĀ on,Ā e.g.,Ā audioĀ tracksĀ inĀ theĀ candidateĀ videoĀ throughĀ anyĀ existingĀ techniques.
TheĀ ā€œvideoĀ metadataā€Ā refersĀ toĀ descriptiveĀ informationĀ associatedĀ withĀ theĀ  candidateĀ videoĀ obtainedĀ fromĀ aĀ videoĀ resource,Ā comprising,Ā e.g.,Ā videoĀ category,Ā title,Ā etc.Ā TheĀ videoĀ categoryĀ mayĀ be,Ā e.g.,Ā ā€œfunnyā€Ā ,Ā ā€œeducationā€Ā ,Ā ā€œtalkĀ showā€Ā ,Ā ā€œgameā€Ā ,Ā ā€œmusicā€Ā ,Ā ā€œnewsā€Ā ,Ā etc.,Ā whichĀ mayĀ facilitateĀ toĀ determineĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information.Ā TakingĀ aĀ gameĀ videoĀ asĀ anĀ example,Ā itĀ isĀ likelyĀ thatĀ visualĀ informationĀ inĀ theĀ videoĀ isĀ moreĀ importantĀ thanĀ audioĀ informationĀ inĀ theĀ video.Ā TakingĀ aĀ videoĀ ofĀ talkĀ showĀ asĀ example,Ā itĀ isĀ likelyĀ thatĀ audioĀ informationĀ inĀ theĀ videoĀ hasĀ aĀ highĀ importance.Ā TheĀ titleĀ ofĀ theĀ candidateĀ videoĀ mayĀ compriseĀ someĀ keywords,Ā e.g.,Ā ā€œsongā€Ā ,Ā ā€œinterviewā€Ā ,Ā ā€œspeechā€Ā ,Ā etc.,Ā whichĀ mayĀ facilitateĀ toĀ determineĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information.Ā ForĀ example,Ā ifĀ theĀ titleĀ ofĀ theĀ candidateĀ videoĀ isĀ ā€œElectionĀ Speechā€Ā ,Ā itĀ isĀ veryĀ likelyĀ thatĀ audioĀ informationĀ inĀ theĀ candidateĀ videoĀ isĀ moreĀ importantĀ thanĀ visualĀ informationĀ inĀ theĀ candidateĀ video.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ anyĀ twoĀ orĀ moreĀ ofĀ theĀ aboveĀ discussedĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ mayĀ beĀ combinedĀ togetherĀ soĀ asĀ toĀ determineĀ theĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video.Ā ForĀ example,Ā forĀ aĀ videoĀ recordingĀ aĀ cuteĀ dog’sĀ activities,Ā thisĀ videoĀ mayĀ containĀ aĀ largeĀ amountĀ ofĀ cameraĀ motionsĀ andĀ objectĀ motionsĀ butĀ doesĀ notĀ includeĀ anyĀ speechĀ orĀ music,Ā andĀ thusĀ aĀ contentĀ scoreĀ indicatingĀ aĀ highĀ importanceĀ ofĀ visualĀ informationĀ mayĀ beĀ determinedĀ forĀ thisĀ video.Ā ForĀ example,Ā forĀ aĀ speechĀ video,Ā thisĀ videoĀ mayĀ containĀ aĀ longĀ time-durationĀ speech,Ā fewĀ shotĀ transition,Ā fewĀ cameraĀ motions,Ā fewĀ scenes,Ā aĀ titleĀ includingĀ aĀ keywordĀ ā€œspeechā€Ā ,Ā etc.,Ā andĀ thusĀ aĀ contentĀ scoreĀ indicatingĀ aĀ highĀ importanceĀ ofĀ audioĀ informationĀ mayĀ beĀ determinedĀ forĀ thisĀ video.
InĀ anĀ implementation,Ā aĀ contentĀ sideĀ modelĀ mayĀ beĀ adoptedĀ forĀ determiningĀ theĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ asĀ discussedĀ above.Ā ForĀ example,Ā asĀ shownĀ inĀ FIG.Ā 2,Ā aĀ contentĀ sideĀ modelĀ 230Ā isĀ usedĀ forĀ determiningĀ aĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ 220.Ā TheĀ contentĀ sideĀ modelĀ 230Ā mayĀ beĀ establishedĀ basedĀ onĀ variousĀ techniques,Ā e.g.,Ā machineĀ learning,Ā deepĀ learning,Ā etc.Ā FeaturesĀ adoptedĀ byĀ theĀ contentĀ sideĀ modelĀ 230Ā mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadata,Ā asĀ discussedĀ above.Ā InĀ termsĀ ofĀ function,Ā theĀ contentĀ sideĀ modelĀ 230Ā mayĀ be,Ā e.g.,Ā aĀ regressionĀ model,Ā aĀ classificationĀ model,Ā etc.Ā InĀ termsĀ ofĀ structure,Ā theĀ contentĀ sideĀ modelĀ mayĀ beĀ basedĀ on,Ā e.g.,Ā aĀ linearĀ model,Ā aĀ logisticĀ model,Ā aĀ decisionĀ treeĀ model,Ā aĀ neuralĀ networkĀ model,Ā etc.Ā  TrainingĀ dataĀ forĀ theĀ contentĀ sideĀ modelĀ 230Ā mayĀ beĀ obtainedĀ through:Ā obtainingĀ aĀ groupĀ ofĀ videosĀ toĀ beĀ usedĀ forĀ training;Ā forĀ eachĀ videoĀ inĀ theĀ groupĀ ofĀ videos,Ā labelingĀ respectiveĀ valuesĀ correspondingĀ toĀ theĀ featuresĀ ofĀ theĀ contentĀ sideĀ model,Ā andĀ labelingĀ aĀ contentĀ scoreĀ forĀ theĀ video;Ā andĀ formingĀ trainingĀ dataĀ fromĀ theĀ groupĀ ofĀ videosĀ withĀ respectiveĀ labels.
InĀ FIG.Ā 2,Ā throughĀ theĀ contentĀ sideĀ modelĀ 230,Ā aĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ 220Ā mayĀ beĀ determined,Ā andĀ accordinglyĀ theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ contentĀ scoresĀ 240Ā mayĀ beĀ finallyĀ obtained,Ā whichĀ mayĀ beĀ furtherĀ usedĀ forĀ determiningĀ recommendedĀ videos.
InĀ theĀ aboveĀ discussion,Ā theĀ contentĀ sideĀ modelĀ 230Ā isĀ implementedĀ asĀ aĀ modelĀ whichĀ adoptsĀ featuresĀ comprisingĀ atĀ leastĀ oneĀ of:Ā shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadata.Ā However,Ā itĀ shouldĀ beĀ appreciatedĀ thatĀ theĀ contentĀ sideĀ modelĀ 230Ā mayĀ alsoĀ beĀ implementedĀ inĀ anyĀ otherĀ approaches.Ā ForĀ example,Ā theĀ contentĀ sideĀ modelĀ 230Ā mayĀ beĀ aĀ deepĀ learning-basedĀ model,Ā whichĀ canĀ determineĀ orĀ predictĀ aĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ directlyĀ basedĀ onĀ visualĀ and/orĀ audioĀ streamĀ ofĀ theĀ candidateĀ videoĀ withoutĀ extractingĀ anyĀ heuristicallyĀ designedĀ features.Ā ThisĀ contentĀ sideĀ modelĀ mayĀ beĀ trainedĀ byĀ aĀ setĀ ofĀ trainingĀ data.Ā EachĀ trainingĀ dataĀ mayĀ beĀ formedĀ byĀ aĀ videoĀ andĀ aĀ labeledĀ contentĀ scoreĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ theĀ video.
AccordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosure,Ā atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ beĀ usedĀ forĀ theĀ videoĀ recommendation.Ā Herein,Ā aĀ referenceĀ factorĀ mayĀ indicateĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ThatĀ is,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ provideĀ referencesĀ orĀ criteriaĀ forĀ determiningĀ recommendedĀ videos.Ā ForĀ example,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ indicateĀ whetherĀ toĀ recommendĀ thoseĀ videosĀ havingĀ aĀ higherĀ importanceĀ ofĀ visualĀ information,Ā orĀ toĀ recommendĀ thoseĀ videosĀ havingĀ aĀ higherĀ importanceĀ ofĀ audioĀ information,Ā orĀ toĀ recommendĀ thoseĀ videosĀ havingĀ bothĀ aĀ highĀ importanceĀ ofĀ visualĀ informationĀ andĀ aĀ highĀ importanceĀ ofĀ audioĀ information.Ā TheĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ compriseĀ anĀ indicationĀ ofĀ aĀ defaultĀ orĀ currentĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation,Ā aĀ preferenceĀ scoreĀ ofĀ theĀ user,Ā aĀ userĀ inputĀ fromĀ theĀ user,Ā etc.,Ā whichĀ willĀ beĀ discussedĀ inĀ detailsĀ later.
FIG.Ā 3Ā illustratesĀ anĀ exemplaryĀ processĀ 300Ā forĀ determiningĀ recommendedĀ  videosĀ accordingĀ toĀ anĀ embodiment.Ā InĀ theĀ processĀ 300,Ā anĀ indicationĀ ofĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendationĀ isĀ usedĀ asĀ aĀ referenceĀ factorĀ forĀ determiningĀ recommendedĀ videos.
AccordingĀ toĀ theĀ processĀ 300,Ā serviceĀ configurationĀ 310Ā ofĀ theĀ videoĀ recommendationĀ mayĀ beĀ obtained.Ā TheĀ serviceĀ configurationĀ 310Ā refersĀ toĀ configurationĀ aboutĀ howĀ toĀ provideĀ recommendedĀ videosĀ toĀ aĀ userĀ whichĀ isĀ setĀ inĀ aĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website.Ā TheĀ serviceĀ configurationĀ 310Ā mayĀ beĀ aĀ defaultĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation,Ā orĀ aĀ currentĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation.Ā InĀ anĀ implementation,Ā theĀ serviceĀ configurationĀ 310Ā mayĀ compriseĀ providingĀ recommendedĀ videosĀ inĀ aĀ muteĀ mode,Ā orĀ providingĀ recommendedĀ videosĀ inĀ aĀ non-muteĀ mode.Ā ForĀ example,Ā asĀ forĀ theĀ caseĀ ofĀ providingĀ recommendedĀ videosĀ inĀ aĀ muteĀ mode,Ā thoseĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ areĀ suitableĀ toĀ beĀ recommended,Ā whereasĀ thoseĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ informationĀ areĀ notĀ suitableĀ toĀ beĀ recommendedĀ sinceĀ theĀ audioĀ informationĀ cannotĀ beĀ displayedĀ toĀ theĀ user.
AccordingĀ toĀ theĀ processĀ 300,Ā aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ anĀ indicationĀ ofĀ theĀ serviceĀ configurationĀ 310.Ā InĀ anĀ implementation,Ā theĀ indicationĀ ofĀ theĀ serviceĀ configurationĀ 310Ā mayĀ beĀ providedĀ toĀ aĀ rankingĀ modelĀ 320Ā asĀ aĀ referenceĀ factor.Ā Moreover,Ā aĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 330Ā mayĀ alsoĀ beĀ providedĀ toĀ theĀ rankingĀ modelĀ 320,Ā whereinĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 330Ā correspondsĀ toĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 240Ā inĀ FIG.Ā 2.Ā TheĀ rankingĀ modelĀ 320Ā mayĀ beĀ anĀ improvedĀ versionĀ ofĀ anyĀ existingĀ rankingĀ modelsĀ forĀ videoĀ recommendation.Ā TheĀ existingĀ rankingĀ modelsĀ mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ basedĀ onĀ featuresĀ ofĀ freshnessĀ ofĀ theĀ video,Ā popularityĀ ofĀ theĀ video,Ā clickĀ rateĀ ofĀ theĀ video,Ā videoĀ quality,Ā relevanceĀ betweenĀ contentĀ ofĀ theĀ videoĀ andĀ theĀ user’sĀ interests,Ā etc.Ā BesidesĀ theĀ featuresĀ adoptedĀ inĀ theĀ existingĀ rankingĀ models,Ā theĀ rankingĀ modelĀ 320Ā mayĀ furtherĀ adoptĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ andĀ atĀ leastĀ oneĀ referenceĀ factor,Ā i.e.,Ā theĀ indicationĀ ofĀ theĀ serviceĀ configurationĀ 310Ā inĀ FIG.Ā 3,Ā asĀ additionalĀ features.Ā ThatĀ is,Ā theĀ rankingĀ modelĀ 320Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ indicationĀ ofĀ theĀ serviceĀ configurationĀ 310.Ā ThroughĀ consideringĀ theĀ indicationĀ ofĀ theĀ serviceĀ configurationĀ 310,Ā theĀ rankingĀ modelĀ 320Ā  mayĀ acknowledgeĀ whatĀ typesĀ ofĀ candidateĀ videos,Ā e.g.,Ā whetherĀ visualĀ informationĀ isĀ importantĀ orĀ audioĀ informationĀ isĀ important,Ā shouldĀ beĀ givenĀ aĀ higherĀ rankingĀ inĀ theĀ followingĀ selectionĀ ofĀ recommendedĀ videos.Ā ThroughĀ consideringĀ theĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video,Ā theĀ rankingĀ modelĀ 320Ā mayĀ decideĀ whetherĀ thisĀ candidateĀ videoĀ compliesĀ withĀ theĀ referenceĀ orĀ criteriaĀ acknowledgedĀ before.Ā Thus,Ā theĀ rankingĀ modelĀ 320Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ inĀ aĀ considerationĀ ofĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information,Ā e.g.,Ā giveĀ aĀ higherĀ rankingĀ scoreĀ toĀ aĀ candidateĀ videoĀ whichĀ hasĀ aĀ contentĀ scoreĀ complyingĀ withĀ theĀ indicationĀ ofĀ theĀ serviceĀ configurationĀ 310.Ā ThroughĀ theĀ rankingĀ modelĀ 320,Ā theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 340Ā mayĀ beĀ obtained.
TheĀ rankingĀ modelĀ 320Ā mayĀ beĀ establishedĀ basedĀ onĀ variousĀ techniques,Ā e.g.,Ā machineĀ learning,Ā deepĀ learning,Ā etc.Ā FeaturesĀ adoptedĀ byĀ theĀ rankingĀ modelĀ 320Ā mayĀ compriseĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ video,Ā indicationĀ ofĀ aĀ serviceĀ configuration,Ā togetherĀ withĀ anyĀ featuresĀ adoptedĀ byĀ theĀ existingĀ rankingĀ models.Ā InĀ termsĀ ofĀ structure,Ā theĀ rankingĀ modelĀ 320Ā mayĀ beĀ basedĀ on,Ā e.g.,Ā aĀ linearĀ model,Ā aĀ logisticĀ model,Ā aĀ decisionĀ treeĀ model,Ā aĀ neuralĀ networkĀ model,Ā etc.
AccordingĀ toĀ theĀ processĀ 300,Ā afterĀ theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 340Ā isĀ obtained,Ā recommendedĀ videosĀ 350Ā mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.Ā ForĀ example,Ā aĀ pluralityĀ ofĀ highestĀ rankedĀ candidateĀ videosĀ mayĀ beĀ selectedĀ asĀ recommendedĀ videos.
TheĀ recommendedĀ videosĀ 350Ā mayĀ beĀ furtherĀ providedĀ toĀ theĀ userĀ throughĀ aĀ terminalĀ deviceĀ ofĀ theĀ user.
FIG.Ā 4Ā illustratesĀ anĀ exemplaryĀ processĀ 400Ā forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.Ā InĀ theĀ processĀ 400,Ā aĀ preferenceĀ scoreĀ ofĀ theĀ userĀ isĀ usedĀ asĀ aĀ referenceĀ factorĀ forĀ determiningĀ recommendedĀ videos.
AccordingĀ toĀ theĀ processĀ 400,Ā aĀ preferenceĀ scoreĀ 410Ā ofĀ theĀ userĀ mayĀ beĀ obtained.Ā TheĀ preferenceĀ scoreĀ mayĀ indicateĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommended.Ā ThatĀ is,Ā theĀ preferenceĀ scoreĀ mayĀ indicateĀ whetherĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ orĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā AssumingĀ thatĀ theĀ preferenceĀ scoreĀ rangesĀ fromĀ 0Ā toĀ 1,Ā andĀ theĀ higherĀ theĀ scoreĀ is,Ā theĀ higherĀ importanceĀ ofĀ visualĀ  informationĀ theĀ userĀ expects,Ā whileĀ theĀ lowerĀ theĀ scoreĀ is,Ā theĀ higherĀ importanceĀ ofĀ audioĀ informationĀ theĀ userĀ expects.Ā AsĀ anĀ example,Ā assumingĀ thatĀ aĀ preferenceĀ scoreĀ ofĀ theĀ userĀ isĀ ā€œ0.9ā€Ā ,Ā sinceĀ thisĀ scoreĀ isĀ muchĀ closeĀ toĀ theĀ maximumĀ valueĀ ā€œ1ā€Ā ,Ā itĀ indicatesĀ thatĀ theĀ userĀ isĀ veryĀ expectingĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.Ā TheĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ basedĀ onĀ atĀ leastĀ oneĀ of:Ā currentĀ time,Ā currentĀ location,Ā configurationĀ ofĀ theĀ terminalĀ deviceĀ ofĀ theĀ user,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user.
TheĀ ā€œcurrentĀ timeā€Ā refersĀ toĀ theĀ currentĀ timeĀ point,Ā timeĀ periodĀ ofĀ aĀ day,Ā date,Ā dayĀ ofĀ theĀ week,Ā etc.Ā whenĀ theĀ userĀ isĀ accessingĀ theĀ clientĀ applicationĀ orĀ serviceĀ providingĀ websiteĀ inĀ whichĀ videoĀ recommendationĀ isĀ provided.Ā DifferentĀ ā€œcurrentĀ timeā€Ā mayĀ reflectĀ differentĀ expectationsĀ ofĀ theĀ user.Ā ForĀ example,Ā ifĀ itĀ isĀ 11Ā PMĀ now,Ā theĀ userĀ mayĀ desireĀ recommendedĀ videosĀ withĀ lowĀ importanceĀ ofĀ audioĀ informationĀ soĀ asĀ toĀ avoidĀ disturbingĀ otherĀ sleepingĀ people.
TheĀ ā€œcurrentĀ locationā€Ā refersĀ toĀ whereĀ theĀ userĀ isĀ locatedĀ now,Ā e.g.,Ā home,Ā office,Ā subway,Ā street,Ā etc.Ā TheĀ currentĀ locationĀ ofĀ theĀ userĀ mayĀ beĀ detectedĀ throughĀ variousĀ existingĀ approaches,Ā e.g.,Ā throughĀ GPSĀ signalsĀ ofĀ theĀ terminalĀ device,Ā throughĀ locatingĀ aĀ WiFiĀ deviceĀ withĀ whichĀ theĀ terminalĀ deviceĀ isĀ connecting,Ā etc.Ā DifferentĀ ā€œcurrentĀ locationā€Ā mayĀ reflectĀ differentĀ expectationsĀ ofĀ theĀ user.Ā ForĀ example,Ā ifĀ theĀ userĀ isĀ atĀ homeĀ now,Ā theĀ userĀ mayĀ desireĀ recommendedĀ videosĀ withĀ bothĀ highĀ importanceĀ ofĀ visualĀ informationĀ andĀ highĀ importanceĀ ofĀ audioĀ information,Ā whileĀ ifĀ theĀ userĀ isĀ atĀ officeĀ now,Ā theĀ userĀ mayĀ notĀ desireĀ recommendĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ informationĀ becauseĀ itĀ isĀ inconvenientĀ toĀ hearĀ audioĀ informationĀ atĀ office.
TheĀ ā€œconfigurationĀ ofĀ theĀ terminalĀ deviceā€Ā mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā screenĀ size,Ā screenĀ resolution,Ā loudspeakerĀ availableĀ orĀ not,Ā andĀ peripheralĀ earphoneĀ connectedĀ orĀ not,Ā etc.Ā TheĀ configurationĀ ofĀ theĀ terminalĀ deviceĀ mayĀ restrictĀ theĀ user’sĀ consumptionĀ ofĀ recommendedĀ videos.Ā ForĀ example,Ā ifĀ theĀ terminalĀ deviceĀ onlyĀ hasĀ aĀ smallĀ screenĀ sizeĀ orĀ aĀ lowĀ screenĀ resolution,Ā itĀ isĀ notĀ suitableĀ toĀ recommendĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.Ā ForĀ example,Ā ifĀ theĀ loudspeakerĀ ofĀ theĀ terminalĀ deviceĀ isĀ offĀ now,Ā itĀ isĀ notĀ suitableĀ toĀ recommendĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.
TheĀ ā€œoperatingĀ stateĀ ofĀ theĀ terminalĀ deviceā€Ā mayĀ compriseĀ atĀ leastĀ oneĀ ofĀ  operatingĀ inĀ aĀ muteĀ mode,Ā operatingĀ inĀ aĀ non-muteĀ mode,Ā operatingĀ inĀ aĀ drivingĀ mode,Ā etc.Ā ForĀ example,Ā ifĀ theĀ terminalĀ deviceĀ isĀ inĀ aĀ muteĀ mode,Ā theĀ userĀ mayĀ desireĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ insteadĀ ofĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā IfĀ theĀ terminalĀ deviceĀ isĀ inĀ aĀ drivingĀ mode,Ā e.g.,Ā theĀ userĀ ofĀ theĀ terminalĀ deviceĀ isĀ drivingĀ aĀ car,Ā theĀ userĀ mayĀ desireĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.
TheĀ ā€œhistoricalĀ watchingĀ behaviorsĀ ofĀ theĀ userā€Ā refersĀ toĀ theĀ user’sĀ historicalĀ watchingĀ actionsĀ ofĀ previousĀ recommendedĀ videos.Ā ForĀ example,Ā ifĀ theĀ userĀ hasĀ watchedĀ fiveĀ recently-recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information,Ā itĀ isĀ veryĀ likelyĀ thatĀ theĀ userĀ mayĀ desireĀ toĀ obtainĀ moreĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.Ā ForĀ example,Ā ifĀ duringĀ theĀ recentĀ week,Ā theĀ userĀ hasĀ watchedĀ mostĀ ofĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information,Ā itĀ mayĀ indicateĀ thatĀ theĀ userĀ mayĀ expectĀ toĀ obtainĀ moreĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ anyĀ twoĀ orĀ moreĀ ofĀ theĀ aboveĀ discussedĀ currentĀ time,Ā currentĀ location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ userĀ mayĀ beĀ combinedĀ togetherĀ soĀ asĀ toĀ determineĀ theĀ preferenceĀ scoreĀ ofĀ theĀ user.Ā ForĀ example,Ā ifĀ theĀ currentĀ locationĀ isĀ theĀ office,Ā andĀ theĀ operatingĀ stateĀ ofĀ theĀ terminalĀ deviceĀ isĀ inĀ aĀ muteĀ mode,Ā thenĀ aĀ preferenceĀ scoreĀ indicatingĀ aĀ highĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommendedĀ mayĀ beĀ determined.Ā ForĀ example,Ā ifĀ theĀ currentĀ timeĀ isĀ 11PM,Ā andĀ theĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ userĀ showsĀ thatĀ theĀ userĀ hasĀ notĀ watchedĀ theĀ previously-recommendedĀ severalĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ informationĀ atĀ 11PM,Ā thenĀ aĀ preferenceĀ scoreĀ indicatingĀ aĀ highĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommendedĀ mayĀ beĀ determined.Ā InĀ oneĀ case,Ā theĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ onlyĀ basedĀ onĀ userĀ state-relatedĀ information,Ā e.g.,Ā atĀ leastĀ oneĀ ofĀ theĀ currentĀ time,Ā theĀ currentĀ location,Ā historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user,Ā etc.Ā InĀ oneĀ case,Ā theĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ onlyĀ basedĀ onĀ terminalĀ device-relatedĀ information,Ā e.g.,Ā atĀ leastĀ oneĀ ofĀ configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā etc.Ā InĀ oneĀ case,Ā theĀ preferenceĀ scoreĀ mayĀ alsoĀ beĀ determinedĀ basedĀ onĀ bothĀ theĀ userĀ state-relatedĀ informationĀ andĀ theĀ terminalĀ device-relatedĀ information.
InĀ anĀ implementation,Ā aĀ userĀ sideĀ modelĀ mayĀ beĀ adoptedĀ forĀ determiningĀ  theĀ preferenceĀ scoreĀ ofĀ theĀ userĀ asĀ discussedĀ above.Ā ForĀ example,Ā asĀ shownĀ inĀ FIG.Ā 4,Ā aĀ userĀ sideĀ modelĀ 420Ā isĀ usedĀ forĀ determiningĀ theĀ preferenceĀ scoreĀ 410.Ā TheĀ userĀ sideĀ modelĀ 420Ā mayĀ beĀ establishedĀ basedĀ onĀ variousĀ techniques,Ā e.g.,Ā machineĀ learning,Ā deepĀ learning,Ā etc.Ā FeaturesĀ adoptedĀ byĀ theĀ userĀ sideĀ modelĀ 420Ā mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā time,Ā location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user,Ā asĀ discussedĀ above.Ā InĀ termsĀ ofĀ function,Ā theĀ userĀ sideĀ modelĀ 420Ā mayĀ be,Ā e.g.,Ā aĀ regressionĀ model,Ā aĀ classificationĀ model,Ā etc.Ā InĀ termsĀ ofĀ structure,Ā theĀ userĀ sideĀ modelĀ 420Ā mayĀ beĀ basedĀ on,Ā e.g.,Ā aĀ linearĀ model,Ā aĀ logisticĀ model,Ā aĀ decisionĀ treeĀ model,Ā aĀ neuralĀ networkĀ model,Ā etc.Ā TrainingĀ dataĀ forĀ theĀ userĀ sideĀ modelĀ 420Ā mayĀ beĀ obtainedĀ fromĀ historicalĀ watchingĀ recordsĀ ofĀ theĀ user,Ā whereinĀ eachĀ historicalĀ watchingĀ recordĀ isĀ associatedĀ withĀ aĀ watchingĀ actionĀ ofĀ aĀ historicalĀ recommendedĀ videoĀ byĀ theĀ user.Ā InformationĀ correspondingĀ toĀ theĀ featuresĀ ofĀ theĀ userĀ sideĀ modelĀ mayĀ beĀ obtainedĀ fromĀ aĀ historicalĀ watchingĀ record,Ā andĀ aĀ preferenceĀ scoreĀ mayĀ alsoĀ beĀ labeledĀ forĀ thisĀ historicalĀ watchingĀ record.Ā TheĀ obtainedĀ informationĀ andĀ theĀ labeledĀ preferenceĀ scoreĀ togetherĀ mayĀ beĀ usedĀ asĀ aĀ pieceĀ ofĀ trainingĀ data.Ā InĀ thisĀ way,Ā aĀ setĀ ofĀ trainingĀ dataĀ mayĀ beĀ formedĀ basedĀ onĀ aĀ numberĀ ofĀ historicalĀ watchingĀ recordsĀ ofĀ theĀ user.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ itĀ isĀ possibleĀ thatĀ theĀ userĀ possessesĀ moreĀ thanĀ oneĀ terminalĀ deviceĀ andĀ theĀ userĀ mayĀ useĀ anyĀ ofĀ theseĀ terminalĀ devicesĀ toĀ accessĀ theĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website.Ā InĀ thisĀ case,Ā aĀ userĀ sideĀ modelĀ mayĀ beĀ establishedĀ forĀ eachĀ terminalĀ device.Ā ForĀ example,Ā assumingĀ thatĀ theĀ userĀ hasĀ twoĀ terminalĀ devices,Ā aĀ firstĀ userĀ sideĀ modelĀ mayĀ beĀ establishedĀ basedĀ onĀ userĀ state-relatedĀ informationĀ andĀ theĀ firstĀ terminalĀ device-relatedĀ information,Ā andĀ aĀ secondĀ userĀ sideĀ modelĀ mayĀ beĀ establishedĀ basedĀ onĀ userĀ state-relatedĀ informationĀ andĀ theĀ secondĀ terminalĀ device-relatedĀ information.Ā Thus,Ā theĀ preferenceĀ scoreĀ ofĀ theĀ userĀ mayĀ beĀ determinedĀ throughĀ aĀ userĀ sideĀ modelĀ correspondingĀ toĀ theĀ terminalĀ deviceĀ currently-usedĀ byĀ theĀ user.
AccordingĀ toĀ theĀ processĀ 400,Ā aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ preferenceĀ scoreĀ 410.Ā InĀ anĀ implementation,Ā theĀ preferenceĀ scoreĀ 410Ā ofĀ theĀ userĀ mayĀ beĀ providedĀ toĀ aĀ rankingĀ modelĀ 430Ā asĀ aĀ referenceĀ factor.Ā Moreover,Ā aĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 440Ā mayĀ alsoĀ beĀ providedĀ toĀ theĀ rankingĀ modelĀ 430,Ā whereinĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 440Ā correspondsĀ toĀ theĀ candidateĀ videoĀ setĀ  withĀ contentĀ scoresĀ 240Ā inĀ FIG.Ā 2.Ā TheĀ rankingĀ modelĀ 430Ā isĀ similarĀ withĀ theĀ rankingĀ modelĀ 320,Ā exceptĀ thatĀ theĀ referenceĀ factorĀ inĀ FIG.Ā 4Ā isĀ theĀ preferenceĀ scoreĀ 410Ā insteadĀ ofĀ theĀ serviceĀ configurationĀ 310.Ā BesidesĀ theĀ featuresĀ adoptedĀ inĀ theĀ existingĀ rankingĀ models,Ā theĀ rankingĀ modelĀ 430Ā mayĀ furtherĀ adoptĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ andĀ atĀ leastĀ oneĀ referenceĀ factor,Ā i.e.,Ā theĀ preferenceĀ scoreĀ 410Ā inĀ FIG.Ā 4,Ā asĀ additionalĀ features.Ā ThatĀ is,Ā theĀ rankingĀ modelĀ 430Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ preferenceĀ scoreĀ 410.Ā ThroughĀ consideringĀ theĀ preferenceĀ scoreĀ 410,Ā theĀ rankingĀ modelĀ 430Ā mayĀ acknowledgeĀ whatĀ typesĀ ofĀ candidateĀ videos,Ā e.g.,Ā whetherĀ visualĀ informationĀ isĀ importantĀ orĀ audioĀ informationĀ isĀ important,Ā areĀ expectedĀ byĀ theĀ user.Ā ThroughĀ consideringĀ theĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video,Ā theĀ rankingĀ modelĀ 430Ā mayĀ decideĀ whetherĀ thisĀ candidateĀ videoĀ compliesĀ withĀ theĀ expectationĀ ofĀ theĀ user.Ā Thus,Ā theĀ rankingĀ modelĀ 430Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ inĀ aĀ considerationĀ ofĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information,Ā e.g.,Ā giveĀ aĀ higherĀ rankingĀ scoreĀ toĀ aĀ candidateĀ videoĀ whichĀ hasĀ aĀ contentĀ scoreĀ complyingĀ withĀ theĀ preferenceĀ scoreĀ 410.Ā ThroughĀ theĀ rankingĀ modelĀ 430,Ā theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 450Ā mayĀ beĀ obtained.
AccordingĀ toĀ theĀ processĀ 400,Ā afterĀ theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 450Ā isĀ obtained,Ā recommendedĀ videosĀ 460Ā mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.Ā Moreover,Ā theĀ recommendedĀ videosĀ 460Ā mayĀ beĀ furtherĀ providedĀ toĀ theĀ userĀ throughĀ theĀ terminalĀ deviceĀ ofĀ theĀ user.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ althoughĀ itĀ isĀ discussedĀ aboveĀ thatĀ theĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ basedĀ onĀ atĀ leastĀ oneĀ of:Ā currentĀ time,Ā currentĀ location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user,Ā theĀ preferenceĀ scoreĀ mayĀ alsoĀ beĀ determinedĀ inĀ considerationĀ anyĀ otherĀ factorsĀ thatĀ mayĀ beĀ usedĀ forĀ indicatingĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommended.Ā InĀ anĀ implementation,Ā theĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ furtherĀ basedĀ onĀ theĀ user’sĀ schedule,Ā whereinĀ eventsĀ inĀ theĀ scheduleĀ mayĀ indicateĀ whetherĀ theĀ userĀ desiresĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ orĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā ForĀ example,Ā ifĀ theĀ user’sĀ  scheduleĀ showsĀ thatĀ theĀ userĀ isĀ atĀ aĀ meetingĀ orĀ havingĀ lessonsĀ atĀ aĀ classroom,Ā thenĀ aĀ preferenceĀ scoreĀ indicatingĀ aĀ highĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommendedĀ mayĀ beĀ determined.Ā InĀ anĀ implementation,Ā theĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ furtherĀ basedĀ onĀ theĀ user’sĀ physicalĀ condition,Ā whereinĀ theĀ physicalĀ conditionĀ mayĀ indicateĀ whetherĀ theĀ userĀ desiresĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ orĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā ForĀ example,Ā ifĀ theĀ userĀ isĀ havingĀ anĀ eyeĀ disease,Ā thenĀ aĀ preferenceĀ scoreĀ indicatingĀ aĀ highĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ audioĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommendedĀ mayĀ beĀ determined.
FIG.Ā 5Ā illustratesĀ anĀ exemplaryĀ processĀ 500Ā forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.Ā InĀ theĀ processĀ 500,Ā aĀ userĀ inputĀ fromĀ theĀ userĀ isĀ usedĀ asĀ aĀ referenceĀ factorĀ forĀ determiningĀ recommendedĀ videos.
AccordingĀ toĀ theĀ processĀ 500,Ā aĀ userĀ inputĀ 510Ā mayĀ beĀ obtainedĀ fromĀ theĀ user.Ā TheĀ userĀ inputĀ mayĀ indicateĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ThatĀ is,Ā theĀ userĀ inputĀ mayĀ indicateĀ whetherĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ orĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.
InĀ anĀ implementation,Ā theĀ userĀ inputĀ 510Ā mayĀ compriseĀ aĀ designationĀ ofĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā optionsĀ ofĀ preferredĀ importanceĀ mayĀ beĀ providedĀ inĀ aĀ userĀ interfaceĀ ofĀ theĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website,Ā andĀ theĀ userĀ mayĀ selectĀ oneĀ ofĀ theĀ optionsĀ inĀ theĀ userĀ interfaceĀ soĀ asĀ toĀ designateĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā TheĀ designationĀ ofĀ preferredĀ importanceĀ byĀ theĀ userĀ mayĀ indicateĀ thatĀ whetherĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information,Ā and/orĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.
InĀ anĀ implementation,Ā theĀ userĀ inputĀ 510Ā mayĀ compriseĀ aĀ designationĀ ofĀ categoryĀ ofĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā theĀ userĀ mayĀ designate,Ā inĀ aĀ userĀ interfaceĀ ofĀ theĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website,Ā atĀ leastĀ oneĀ desiredĀ categoryĀ ofĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā TheĀ designatedĀ categoryĀ mayĀ be,Ā e.g.,Ā ā€œfunnyā€Ā ,Ā ā€œeducationā€Ā ,Ā ā€œtalkĀ showā€Ā ,Ā ā€œgameā€Ā ,Ā  ā€œmusicā€Ā ,Ā ā€œnewsā€Ā ,Ā etc.,Ā whichĀ mayĀ indicateĀ whetherĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information,Ā and/orĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.Ā ForĀ example,Ā ifĀ aĀ categoryĀ ā€œtalkĀ showā€Ā isĀ designatedĀ byĀ theĀ user,Ā itĀ mayĀ indicateĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā ForĀ example,Ā ifĀ aĀ categoryĀ ā€œgameā€Ā isĀ designatedĀ byĀ theĀ user,Ā itĀ mayĀ indicateĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.
InĀ anĀ implementation,Ā theĀ userĀ inputĀ 510Ā mayĀ compriseĀ aĀ queryĀ forĀ searchingĀ videos.Ā ForĀ example,Ā whenĀ theĀ userĀ isĀ accessingĀ theĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website,Ā theĀ userĀ mayĀ inputĀ aĀ queryĀ inĀ aĀ userĀ interfaceĀ ofĀ theĀ clientĀ applicationĀ orĀ serviceĀ providingĀ websiteĀ soĀ asĀ toĀ searchĀ oneĀ orĀ moreĀ videosĀ thatĀ theĀ userĀ isĀ interested.Ā ForĀ example,Ā anĀ exemplaryĀ queryĀ mayĀ beĀ ā€œAmericanĀ presidentialĀ electionĀ speechā€Ā whichĀ indicatesĀ thatĀ theĀ userĀ wantsĀ toĀ searchĀ someĀ speechĀ videosĀ relatedĀ toĀ theĀ AmericanĀ presidentialĀ election.Ā TheĀ queryĀ mayĀ explicitlyĀ orĀ implicitlyĀ indicateĀ whetherĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information,Ā and/orĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā TakingĀ theĀ queryĀ ā€œAmericanĀ presidentialĀ electionĀ speechā€Ā asĀ anĀ example,Ā theĀ keywordĀ ā€œspeechā€Ā inĀ theĀ queryĀ mayĀ explicitlyĀ indicateĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā TakingĀ aĀ queryĀ ā€œfamousĀ magicĀ showsā€Ā asĀ anĀ example,Ā theĀ keywordĀ ā€œmagicĀ showā€Ā mayĀ explicitlyĀ indicateĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.Ā TakingĀ aĀ queryĀ ā€œsunsetĀ onĀ theĀ beachā€Ā asĀ anĀ example,Ā theĀ queryĀ mayĀ explicitlyĀ indicateĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ theĀ userĀ inputĀ 510Ā isĀ notĀ limitedĀ toĀ compriseĀ anyĀ oneĀ orĀ moreĀ ofĀ theĀ designationĀ ofĀ preferredĀ importance,Ā theĀ designationĀ ofĀ category,Ā andĀ theĀ queryĀ asĀ discussedĀ above,Ā butĀ mayĀ compriseĀ anyĀ otherĀ typesĀ ofĀ inputĀ fromĀ theĀ userĀ whichĀ canĀ indicateĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.
AccordingĀ toĀ theĀ processĀ 500,Ā aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ userĀ inputĀ 510.Ā InĀ anĀ implementation,Ā theĀ userĀ inputĀ 510Ā ofĀ theĀ userĀ mayĀ beĀ providedĀ toĀ aĀ rankingĀ modelĀ 520Ā asĀ aĀ referenceĀ factor.Ā Moreover,Ā aĀ candidateĀ videoĀ setĀ withĀ contentĀ  scoresĀ 530Ā mayĀ alsoĀ beĀ providedĀ toĀ theĀ rankingĀ modelĀ 520,Ā whereinĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 530Ā correspondsĀ toĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 240Ā inĀ FIG.Ā 2.Ā TheĀ rankingĀ modelĀ 520Ā isĀ similarĀ withĀ theĀ rankingĀ modelĀ 320,Ā exceptĀ thatĀ theĀ referenceĀ factorĀ inĀ FIG.Ā 5Ā isĀ theĀ userĀ inputĀ 510Ā insteadĀ ofĀ theĀ serviceĀ configurationĀ 310.Ā BesidesĀ theĀ featuresĀ adoptedĀ inĀ theĀ existingĀ rankingĀ models,Ā theĀ rankingĀ modelĀ 520Ā mayĀ furtherĀ adoptĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ andĀ atĀ leastĀ oneĀ referenceĀ factor,Ā i.e.,Ā theĀ userĀ inputĀ 510Ā inĀ FIG.Ā 5,Ā asĀ additionalĀ features.Ā ThatĀ is,Ā theĀ rankingĀ modelĀ 520Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ userĀ inputĀ 510.Ā ThroughĀ consideringĀ theĀ userĀ inputĀ 510,Ā theĀ rankingĀ modelĀ 520Ā mayĀ acknowledgeĀ whatĀ typesĀ ofĀ candidateĀ videos,Ā e.g.,Ā whetherĀ visualĀ informationĀ isĀ importantĀ orĀ audioĀ informationĀ isĀ important,Ā areĀ expectedĀ byĀ theĀ user.Ā ThroughĀ consideringĀ theĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video,Ā theĀ rankingĀ modelĀ 520Ā mayĀ decideĀ whetherĀ thisĀ candidateĀ videoĀ compliesĀ withĀ theĀ expectationĀ ofĀ theĀ user.Ā Thus,Ā theĀ rankingĀ modelĀ 520Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ inĀ aĀ considerationĀ ofĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information,Ā e.g.,Ā giveĀ aĀ higherĀ rankingĀ scoreĀ toĀ aĀ candidateĀ videoĀ whichĀ hasĀ aĀ contentĀ scoreĀ complyingĀ withĀ theĀ userĀ inputĀ 510.Ā ThroughĀ theĀ rankingĀ modelĀ 520,Ā theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 540Ā mayĀ beĀ obtained.
AccordingĀ toĀ theĀ processĀ 500,Ā afterĀ theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 540Ā isĀ obtained,Ā recommendedĀ videosĀ 550Ā mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.Ā Moreover,Ā theĀ recommendedĀ videosĀ 550Ā mayĀ beĀ furtherĀ providedĀ toĀ theĀ userĀ throughĀ theĀ terminalĀ deviceĀ ofĀ theĀ user.
FIG.Ā 6Ā illustratesĀ anĀ exemplaryĀ processĀ 600Ā forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.Ā InĀ theĀ processĀ 600,Ā referenceĀ factorsĀ forĀ determiningĀ recommendedĀ videosĀ mayĀ compriseĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation,Ā aĀ preferenceĀ scoreĀ ofĀ theĀ userĀ andĀ aĀ userĀ inputĀ fromĀ theĀ user.Ā ThatĀ is,Ā theĀ processĀ 600Ā mayĀ beĀ deemedĀ asĀ aĀ combinationĀ ofĀ theĀ processĀ 300Ā inĀ FIG.Ā 3,Ā theĀ processĀ 400Ā inĀ FIG.Ā 4,Ā andĀ theĀ processĀ 500Ā inĀ FIG.Ā 5.
AccordingĀ toĀ theĀ processĀ 600,Ā serviceĀ configurationĀ 610Ā ofĀ theĀ videoĀ recommendationĀ mayĀ beĀ obtained,Ā whichĀ mayĀ correspondĀ toĀ theĀ serviceĀ configurationĀ 310Ā inĀ FIG.Ā 3.Ā AĀ preferenceĀ scoreĀ 620Ā ofĀ theĀ userĀ mayĀ beĀ obtained,Ā whichĀ mayĀ  correspondĀ toĀ theĀ preferenceĀ scoreĀ 410Ā inĀ FIG.Ā 4.Ā AĀ userĀ inputĀ 630Ā mayĀ beĀ obtained,Ā whichĀ mayĀ correspondĀ toĀ theĀ userĀ inputĀ 510Ā inĀ FIG.Ā 5.
AccordingĀ toĀ theĀ processĀ 600,Ā aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video,Ā theĀ serviceĀ configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630.Ā InĀ anĀ implementation,Ā theĀ serviceĀ configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630Ā mayĀ beĀ providedĀ toĀ aĀ rankingĀ modelĀ 640Ā asĀ referenceĀ factors.Ā Moreover,Ā aĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 650Ā mayĀ alsoĀ beĀ providedĀ toĀ theĀ rankingĀ modelĀ 640,Ā whereinĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 650Ā correspondsĀ toĀ theĀ candidateĀ videoĀ setĀ withĀ contentĀ scoresĀ 240Ā inĀ FIG.Ā 2.Ā BesidesĀ theĀ featuresĀ adoptedĀ inĀ theĀ existingĀ rankingĀ models,Ā theĀ rankingĀ modelĀ 640Ā mayĀ furtherĀ adoptĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ videoĀ andĀ atĀ leastĀ oneĀ referenceĀ factor,Ā i.e.,Ā theĀ serviceĀ configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630Ā inĀ FIG.Ā 6,Ā asĀ additionalĀ features.Ā ThatĀ is,Ā theĀ rankingĀ modelĀ 520Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ videoĀ andĀ aĀ combinationĀ ofĀ theĀ serviceĀ configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630.Ā ThroughĀ consideringĀ theĀ combinationĀ ofĀ theĀ serviceĀ configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630,Ā theĀ rankingĀ modelĀ 640Ā mayĀ acknowledgeĀ whatĀ typesĀ ofĀ candidateĀ videos,Ā e.g.,Ā whetherĀ visualĀ informationĀ isĀ importantĀ orĀ audioĀ informationĀ isĀ important,Ā shallĀ beĀ recommendedĀ toĀ theĀ user.Ā Accordingly,Ā theĀ rankingĀ modelĀ 640Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ inĀ aĀ considerationĀ ofĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information,Ā e.g.,Ā giveĀ aĀ higherĀ rankingĀ scoreĀ toĀ aĀ candidateĀ videoĀ whichĀ hasĀ aĀ contentĀ scoreĀ complyingĀ withĀ theĀ combinationĀ ofĀ theĀ serviceĀ configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630.Ā ThroughĀ theĀ rankingĀ modelĀ 640,Ā theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 660Ā mayĀ beĀ obtained.
AccordingĀ toĀ theĀ processĀ 600,Ā afterĀ theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 660Ā isĀ obtained,Ā recommendedĀ videosĀ 670Ā mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.Ā Moreover,Ā theĀ recommendedĀ videosĀ 670Ā mayĀ beĀ furtherĀ providedĀ toĀ theĀ userĀ throughĀ theĀ terminalĀ deviceĀ ofĀ theĀ user.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ accordingĀ toĀ actualĀ requirements,Ā theĀ processĀ 600Ā mayĀ beĀ changedĀ inĀ variousĀ approaches.Ā ForĀ example,Ā anyĀ twoĀ ofĀ theĀ serviceĀ  configurationĀ 610,Ā theĀ preferenceĀ scoreĀ 620Ā andĀ theĀ userĀ inputĀ 630Ā mayĀ beĀ adoptedĀ asĀ referenceĀ factorsĀ forĀ theĀ videoĀ recommendation.Ā ThatĀ isĀ toĀ say,Ā theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ mayĀ utilizeĀ atĀ leastĀ oneĀ ofĀ serviceĀ configuration,Ā preferenceĀ scoreĀ andĀ userĀ inputĀ asĀ referenceĀ factorsĀ toĀ beĀ usedĀ forĀ furtherĀ determiningĀ recommendedĀ videos.
ItĀ isĀ discussedĀ aboveĀ inĀ connectionĀ withĀ FIG.Ā 2Ā toĀ FIG.Ā 6Ā thatĀ someĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ mayĀ determineĀ recommendedĀ videosĀ fromĀ aĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ referenceĀ factorsĀ andĀ contentĀ scoresĀ ofĀ candidateĀ videos.Ā ForĀ example,Ā theĀ contentĀ scoresĀ ofĀ theĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ setĀ mayĀ beĀ firstlyĀ determinedĀ through,Ā e.g.,Ā aĀ contentĀ sideĀ model,Ā andĀ thenĀ theĀ contentĀ scoresĀ ofĀ theĀ candidateĀ videosĀ togetherĀ withĀ theĀ referenceĀ factorsĀ mayĀ beĀ usedĀ forĀ determiningĀ rankingĀ scoresĀ ofĀ theĀ candidateĀ videosĀ through,Ā e.g.,Ā aĀ rankingĀ model,Ā whereinĀ featuresĀ adoptedĀ byĀ theĀ rankingĀ modelĀ atĀ leastĀ compriseĀ atĀ leastĀ oneĀ referenceĀ factorĀ andĀ aĀ rankĀ scoreĀ ofĀ aĀ candidateĀ video.Ā However,Ā accordingĀ toĀ someĀ otherĀ embodimentsĀ ofĀ theĀ presentĀ disclosure,Ā theĀ processĀ ofĀ determiningĀ theĀ contentĀ scoresĀ ofĀ theĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ mayĀ beĀ omitted,Ā i.e.,Ā recommendedĀ videosĀ mayĀ beĀ determinedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ referenceĀ factors.Ā AccordingĀ toĀ theseĀ embodiments,Ā aĀ rankingĀ modelĀ mayĀ beĀ usedĀ forĀ determiningĀ rankingĀ scoresĀ ofĀ theĀ candidateĀ videosĀ basedĀ atĀ leastĀ onĀ referenceĀ factors,Ā whereinĀ featuresĀ adoptedĀ byĀ theĀ rankingĀ modelĀ atĀ leastĀ compriseĀ atĀ leastĀ oneĀ referenceĀ factorĀ andĀ thoseĀ featuresĀ adoptedĀ byĀ theĀ contentĀ sideĀ modelĀ inĀ FIG.Ā 2Ā toĀ FIG.Ā 6.
FIG.Ā 7Ā illustratesĀ anĀ exemplaryĀ processĀ 700Ā forĀ determiningĀ recommendedĀ videosĀ accordingĀ toĀ anĀ embodiment.
AtĀ leastĀ oneĀ ofĀ aĀ serviceĀ configurationĀ 710Ā ofĀ theĀ videoĀ recommendation,Ā aĀ preferenceĀ scoreĀ 720Ā ofĀ theĀ userĀ andĀ aĀ userĀ inputĀ 730Ā fromĀ theĀ userĀ mayĀ beĀ obtained.Ā TheĀ serviceĀ configurationĀ 710,Ā theĀ preferenceĀ scoreĀ 720Ā andĀ theĀ userĀ inputĀ 730Ā mayĀ correspondĀ toĀ theĀ serviceĀ configurationĀ 310Ā inĀ FIG.Ā 3,Ā theĀ preferenceĀ scoreĀ 410Ā inĀ FIG.Ā 4Ā andĀ theĀ userĀ inputĀ 510Ā inĀ FIG.Ā 5Ā respectively.
AccordingĀ toĀ theĀ processĀ 700,Ā aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ atĀ leastĀ oneĀ ofĀ theĀ serviceĀ configurationĀ 710,Ā theĀ preferenceĀ scoreĀ 720Ā andĀ theĀ userĀ inputĀ 730.
InĀ anĀ implementation,Ā atĀ leastĀ oneĀ ofĀ theĀ serviceĀ configurationĀ 710,Ā theĀ preferenceĀ scoreĀ 720Ā andĀ theĀ userĀ inputĀ 730Ā mayĀ beĀ providedĀ toĀ aĀ rankingĀ modelĀ 740Ā  asĀ referenceĀ factors.Ā Moreover,Ā aĀ candidateĀ videoĀ setĀ 750Ā mayĀ alsoĀ beĀ providedĀ toĀ theĀ rankingĀ modelĀ 740,Ā whereinĀ theĀ candidateĀ videoĀ setĀ 750Ā mayĀ correspondĀ toĀ theĀ candidateĀ videoĀ setĀ 220Ā inĀ FIG.Ā 2.
TheĀ rankingĀ modelĀ 740Ā mayĀ beĀ anĀ improvedĀ versionĀ ofĀ anyĀ existingĀ rankingĀ modelsĀ forĀ videoĀ recommendation.Ā BesidesĀ featuresĀ adoptedĀ inĀ theĀ existingĀ rankingĀ models,Ā theĀ rankingĀ modelĀ 740Ā mayĀ furtherĀ adoptĀ atĀ leastĀ oneĀ referenceĀ factor,Ā e.g.,Ā theĀ serviceĀ configurationĀ 710,Ā theĀ preferenceĀ scoreĀ 720Ā and/orĀ theĀ userĀ inputĀ 730Ā inĀ FIG.Ā 7,Ā asĀ additionalĀ features.Ā Moreover,Ā theĀ rankingĀ modelĀ 740Ā mayĀ furtherĀ adoptĀ thoseĀ featuresĀ adoptedĀ byĀ theĀ contentĀ sideĀ modelĀ inĀ FIG.Ā 2Ā toĀ FIG.Ā 6Ā asĀ additionalĀ features,Ā comprisingĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ aĀ candidateĀ video.Ā DuringĀ determiningĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ set,Ā atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ theĀ candidateĀ videoĀ mayĀ beĀ detected.Ā TheĀ detectedĀ informationĀ aboutĀ theĀ candidateĀ videoĀ togetherĀ withĀ theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ beĀ furtherĀ usedĀ forĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ theĀ candidateĀ video,Ā e.g.,Ā throughĀ theĀ rankingĀ modelĀ 740.Ā ThroughĀ consideringĀ theĀ atĀ leastĀ oneĀ referenceĀ factor,Ā theĀ rankingĀ modelĀ 740Ā mayĀ acknowledgeĀ whatĀ typesĀ ofĀ candidateĀ videos,Ā e.g.,Ā whetherĀ visualĀ informationĀ isĀ importantĀ orĀ audioĀ informationĀ isĀ important,Ā shallĀ beĀ recommendedĀ toĀ theĀ user.Ā ThroughĀ consideringĀ theĀ detectedĀ informationĀ aboutĀ theĀ candidateĀ video,Ā theĀ rankingĀ modelĀ 740Ā mayĀ decideĀ whetherĀ thisĀ candidateĀ videoĀ compliesĀ withĀ preferredĀ importanceĀ indicatedĀ byĀ theĀ atĀ leastĀ oneĀ referenceĀ factor.Ā Accordingly,Ā theĀ rankingĀ modelĀ 740Ā mayĀ determineĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ inĀ aĀ considerationĀ ofĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ information.Ā ThroughĀ theĀ rankingĀ modelĀ 740,Ā theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 760Ā mayĀ beĀ obtained.
AccordingĀ toĀ theĀ processĀ 700,Ā afterĀ theĀ candidateĀ videoĀ setĀ withĀ respectiveĀ rankingĀ scoresĀ 760Ā isĀ obtained,Ā recommendedĀ videosĀ 770Ā mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.Ā Moreover,Ā theĀ recommendedĀ videosĀ 770Ā mayĀ beĀ furtherĀ providedĀ toĀ theĀ userĀ throughĀ theĀ terminalĀ deviceĀ ofĀ theĀ user.
ItĀ shouldĀ beĀ appreciatedĀ that,Ā inĀ someĀ implementations,Ā theĀ rankingĀ modelsĀ  inĀ FIG.Ā 3Ā toĀ FIG.Ā 7Ā mayĀ beĀ configuredĀ forĀ determiningĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ furtherĀ basedĀ onĀ consumptionĀ conditionĀ ofĀ theĀ candidateĀ videoĀ byĀ aĀ numberĀ ofĀ otherĀ users.Ā TheĀ moreĀ timesĀ theĀ candidateĀ videoĀ isĀ consumedĀ byĀ otherĀ users,Ā theĀ higherĀ rankingĀ scoreĀ theĀ candidateĀ videoĀ mayĀ get.Ā InĀ someĀ implementations,Ā theĀ rankingĀ modelsĀ inĀ FIG.Ā 3Ā toĀ FIG.Ā 7Ā mayĀ beĀ configuredĀ forĀ determiningĀ aĀ rankingĀ scoreĀ ofĀ aĀ candidateĀ videoĀ furtherĀ basedĀ onĀ relevanceĀ betweenĀ contentĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ user’sĀ interests.Ā TheĀ user’sĀ interestsĀ mayĀ beĀ determinedĀ basedĀ on,Ā e.g.,Ā historicalĀ watchingĀ recordsĀ ofĀ theĀ user.Ā ForĀ example,Ā theĀ historicalĀ watchingĀ recordsĀ ofĀ theĀ userĀ mayĀ indicateĀ whatĀ categoriesĀ orĀ topicsĀ ofĀ videoĀ contentĀ theĀ userĀ isĀ interestedĀ in.Ā IfĀ theĀ contentĀ ofĀ theĀ candidateĀ videoĀ hasĀ aĀ higherĀ relevanceĀ withĀ theĀ user’sĀ interests,Ā aĀ higherĀ rankingĀ scoreĀ mayĀ beĀ determinedĀ forĀ theĀ candidateĀ video.Ā Moreover,Ā inĀ someĀ implementations,Ā whenĀ selectingĀ theĀ recommendedĀ videosĀ fromĀ theĀ candidateĀ videoĀ setĀ withĀ rankingĀ scores,Ā besidesĀ consideringĀ selectingĀ theĀ highestĀ rankingĀ candidateĀ videosĀ basedĀ onĀ theĀ rankingĀ scores,Ā diversityĀ ofĀ videoĀ recommendationĀ mayĀ alsoĀ beĀ consideredĀ suchĀ thatĀ theĀ selectedĀ recommendedĀ videosĀ couldĀ haveĀ diversityĀ inĀ termsĀ ofĀ content.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ theĀ presentĀ disclosureĀ alsoĀ coversĀ anyĀ variantsĀ ofĀ theĀ methodsĀ forĀ providingĀ videoĀ recommendationĀ discussedĀ aboveĀ inĀ connectionĀ withĀ FIG.Ā 3Ā toĀ FIG.Ā 7.Ā ForĀ example,Ā inĀ anĀ implementation,Ā candidateĀ videosĀ inĀ aĀ candidateĀ videoĀ setĀ mayĀ beĀ firstlyĀ rankedĀ throughĀ anyĀ existingĀ rankingĀ modelsĀ forĀ videoĀ recommendation.Ā ThenĀ aĀ filteringĀ operationĀ mayĀ beĀ performedĀ onĀ theĀ rankedĀ candidateĀ videos,Ā whereinĀ theĀ filteringĀ operationĀ mayĀ considerĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā atĀ leastĀ oneĀ ofĀ theĀ serviceĀ configuration,Ā theĀ preferenceĀ scoreĀ andĀ theĀ userĀ inputĀ asĀ discussedĀ aboveĀ inĀ FIG.Ā 3Ā toĀ FIG.Ā 7Ā mayĀ beĀ usedĀ byĀ theĀ filteringĀ operationĀ forĀ filteringĀ outĀ thoseĀ candidateĀ videosĀ notĀ complyingĀ withĀ theĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā AfterĀ theĀ filteringĀ operation,Ā atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ obtained,Ā andĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ furtherĀ providedĀ toĀ theĀ user.Ā InĀ anĀ implementation,Ā theĀ filteringĀ operationĀ mayĀ beĀ implementedĀ throughĀ aĀ filterĀ modelĀ whichĀ adoptsĀ featuresĀ comprisingĀ atĀ leastĀ oneĀ ofĀ serviceĀ configuration,Ā preferenceĀ scoreĀ andĀ userĀ input.
FIG.Ā 8Ā illustratesĀ aĀ flowchartĀ ofĀ anĀ exemplaryĀ methodĀ 800Ā forĀ providingĀ  videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
AtĀ 810,Ā atĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendationĀ mayĀ beĀ determined,Ā whereinĀ theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatesĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.
AtĀ 820,Ā aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ mayĀ beĀ determinedĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor.
AtĀ 830,Ā atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ selectedĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set.
AtĀ 840,Ā theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ providedĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
InĀ anĀ implementation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ compriseĀ aĀ preferenceĀ scoreĀ ofĀ theĀ user,Ā theĀ preferenceĀ scoreĀ indicatingĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ inĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā TheĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ basedĀ onĀ atĀ leastĀ oneĀ of:Ā currentĀ time,Ā currentĀ location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user.Ā TheĀ configurationĀ ofĀ theĀ terminalĀ deviceĀ mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā screenĀ size,Ā screenĀ resolution,Ā loudspeakerĀ availableĀ orĀ not,Ā andĀ peripheralĀ earphoneĀ connectedĀ orĀ not.Ā TheĀ operatingĀ stateĀ ofĀ theĀ terminalĀ deviceĀ mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā operatingĀ inĀ aĀ muteĀ mode,Ā operatingĀ inĀ aĀ non-muteĀ modeĀ andĀ operatingĀ inĀ aĀ drivingĀ mode.Ā TheĀ preferenceĀ scoreĀ mayĀ beĀ determinedĀ throughĀ aĀ userĀ sideĀ model,Ā theĀ userĀ sideĀ modelĀ adoptingĀ atĀ leastĀ oneĀ ofĀ theĀ followingĀ features:Ā time,Ā location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user.
InĀ anĀ implementation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ compriseĀ anĀ indicationĀ ofĀ aĀ defaultĀ orĀ currentĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation.Ā TheĀ defaultĀ orĀ currentĀ serviceĀ configurationĀ mayĀ compriseĀ providingĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommendedĀ inĀ aĀ muteĀ modeĀ orĀ inĀ aĀ non-muteĀ mode.
InĀ anĀ implementation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ compriseĀ aĀ userĀ inputĀ fromĀ theĀ user,Ā theĀ userĀ inputĀ indicatingĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ inĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā TheĀ userĀ inputĀ mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā aĀ designationĀ ofĀ theĀ preferredĀ importanceĀ ofĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ inĀ theĀ atĀ  leastĀ oneĀ videoĀ toĀ beĀ recommended;Ā aĀ designationĀ ofĀ categoryĀ ofĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;Ā andĀ aĀ queryĀ forĀ searchingĀ videos.
InĀ anĀ implementation,Ā theĀ methodĀ 800Ā mayĀ furtherĀ comprise:Ā determiningĀ aĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ set,Ā theĀ contentĀ scoreĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ theĀ candidateĀ video.Ā TheĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ furtherĀ basedĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video.Ā TheĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ determinedĀ basedĀ onĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ theĀ candidateĀ video.Ā TheĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ determinedĀ throughĀ aĀ contentĀ sideĀ model,Ā theĀ contentĀ sideĀ modelĀ adoptingĀ atĀ leastĀ oneĀ ofĀ theĀ followingĀ features:Ā shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadata.Ā Alternatively,Ā theĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ determinedĀ throughĀ aĀ contentĀ sideĀ modelĀ whichĀ isĀ basedĀ onĀ deepĀ learning,Ā theĀ contentĀ sideĀ modelĀ beingĀ trainedĀ byĀ aĀ setĀ ofĀ trainingĀ data,Ā eachĀ trainingĀ dataĀ beingĀ formedĀ byĀ aĀ videoĀ andĀ aĀ labeledĀ contentĀ scoreĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ theĀ video.Ā TheĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ determinedĀ throughĀ aĀ rankingĀ model,Ā theĀ rankingĀ modelĀ atĀ leastĀ adoptingĀ theĀ followingĀ features:Ā atĀ leastĀ oneĀ referenceĀ factor;Ā andĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ video.
InĀ anĀ implementation,Ā theĀ methodĀ 800Ā mayĀ furtherĀ comprise:Ā detectingĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ set.Ā TheĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ furtherĀ basedĀ onĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ theĀ candidateĀ video.Ā TheĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ mayĀ beĀ determinedĀ throughĀ aĀ rankingĀ model,Ā theĀ rankingĀ modelĀ atĀ leastĀ adoptingĀ theĀ followingĀ features:Ā atĀ leastĀ oneĀ referenceĀ factor;Ā andĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ aĀ candidateĀ video.
InĀ anĀ implementation,Ā theĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ  videoĀ mayĀ beĀ furtherĀ basedĀ onĀ atĀ leastĀ oneĀ of:Ā consumptionĀ conditionĀ ofĀ theĀ candidateĀ videoĀ byĀ aĀ numberĀ ofĀ otherĀ users;Ā andĀ relevanceĀ betweenĀ contentĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ user’sĀ interests.
InĀ anĀ implementation,Ā theĀ videoĀ recommendationĀ mayĀ beĀ providedĀ inĀ aĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ theĀ methodĀ 800Ā mayĀ furtherĀ compriseĀ anyĀ steps/processesĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ asĀ mentionedĀ above.
FIG.Ā 9Ā illustratesĀ anĀ exemplaryĀ apparatusĀ 900Ā forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
TheĀ apparatusĀ 900Ā mayĀ comprise:Ā aĀ referenceĀ factorĀ determiningĀ moduleĀ 910,Ā forĀ determiningĀ atĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;Ā aĀ rankingĀ scoreĀ determiningĀ moduleĀ 920,Ā forĀ determiningĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor;Ā aĀ recommendedĀ videoĀ selectingĀ moduleĀ 930,Ā forĀ selectingĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set;Ā andĀ aĀ recommendedĀ videoĀ providingĀ moduleĀ 940,Ā forĀ providingĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
InĀ anĀ implementation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ mayĀ compriseĀ atĀ leastĀ oneĀ of:Ā aĀ preferenceĀ scoreĀ ofĀ theĀ user;Ā anĀ indicationĀ ofĀ aĀ defaultĀ orĀ currentĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation;Ā andĀ aĀ userĀ inputĀ fromĀ theĀ user.
Moreover,Ā theĀ apparatusĀ 900Ā mayĀ alsoĀ compriseĀ anyĀ otherĀ modulesĀ configuredĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ asĀ mentionedĀ above.
FIG.Ā 10Ā illustratesĀ anĀ exemplaryĀ apparatusĀ 1000Ā forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ anĀ embodiment.
TheĀ apparatusĀ 1000Ā mayĀ compriseĀ atĀ leastĀ oneĀ processorĀ 1010Ā andĀ aĀ memoryĀ 1020Ā storingĀ computer-executableĀ instructions.Ā WhenĀ executingĀ theĀ computer-executableĀ instructions,Ā theĀ atĀ leastĀ oneĀ processorĀ 1010Ā may:Ā determineĀ atĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ  inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;Ā determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor;Ā selectĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set;Ā andĀ provideĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
TheĀ atĀ leastĀ oneĀ processorĀ 1010Ā mayĀ beĀ furtherĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ methodsĀ forĀ providingĀ videoĀ recommendationĀ accordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ asĀ mentionedĀ above.
MethodsĀ andĀ apparatusesĀ forĀ providingĀ videoĀ recommendationĀ haveĀ beenĀ discussedĀ aboveĀ basedĀ onĀ variousĀ embodimentsĀ ofĀ theĀ presentĀ disclosure.Ā ItĀ shouldĀ beĀ appreciatedĀ thatĀ anyĀ additions,Ā deletions,Ā replacements,Ā reconstructions,Ā andĀ derivationsĀ ofĀ componentsĀ includedĀ inĀ theseĀ methodsĀ andĀ apparatusesĀ shallĀ alsoĀ beĀ coveredĀ byĀ theĀ presentĀ disclosure.
AccordingĀ toĀ anĀ exemplaryĀ embodiment,Ā aĀ methodĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ isĀ provided.
DuringĀ theĀ userĀ isĀ accessingĀ aĀ thirdĀ partyĀ applicationĀ orĀ websiteĀ whichĀ providesĀ videoĀ recommendationĀ service,Ā aĀ userĀ inputĀ mayĀ beĀ received.Ā TheĀ receivedĀ userĀ inputĀ mayĀ correspondĀ to,Ā e.g.,Ā theĀ userĀ inputĀ 510Ā inĀ FIG.Ā 5,Ā theĀ userĀ inputĀ 630Ā inĀ FIG.Ā 6,Ā theĀ userĀ inputĀ 730Ā inĀ FIG.Ā 7,Ā etc.Ā InĀ anĀ implementation,Ā theĀ operationĀ ofĀ receivingĀ theĀ userĀ inputĀ mayĀ compriseĀ receiving,Ā fromĀ theĀ user,Ā aĀ designationĀ ofĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā whenĀ theĀ userĀ selectsĀ oneĀ ofĀ optionsĀ ofĀ preferredĀ importanceĀ providedĀ inĀ aĀ userĀ interfaceĀ ofĀ theĀ thirdĀ partyĀ applicationĀ orĀ website,Ā aĀ designationĀ ofĀ theĀ preferredĀ importanceĀ mayĀ beĀ received.Ā InĀ anĀ implementation,Ā theĀ operationĀ ofĀ receivingĀ theĀ userĀ inputĀ mayĀ compriseĀ receiving,Ā fromĀ theĀ user,Ā aĀ designationĀ ofĀ categoryĀ ofĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā whenĀ theĀ userĀ selectsĀ orĀ inputs,Ā inĀ theĀ userĀ interfaceĀ ofĀ theĀ thirdĀ partyĀ applicationĀ orĀ website,Ā atĀ leastĀ oneĀ desiredĀ categoryĀ ofĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended,Ā aĀ designationĀ ofĀ theĀ categoryĀ mayĀ beĀ received.Ā InĀ anĀ implementation,Ā theĀ operationĀ ofĀ receivingĀ theĀ userĀ inputĀ mayĀ compriseĀ receiving,Ā fromĀ theĀ user,Ā aĀ queryĀ forĀ searchingĀ videos.Ā ForĀ example,Ā whenĀ theĀ userĀ inputsĀ aĀ queryĀ inĀ theĀ userĀ interfaceĀ ofĀ theĀ thirdĀ partyĀ applicationĀ orĀ websiteĀ soĀ asĀ toĀ searchĀ videosĀ thatĀ theĀ userĀ isĀ interested,Ā theĀ queryĀ mayĀ beĀ received.
AccordingĀ toĀ theĀ method,Ā theĀ receivedĀ userĀ inputĀ mayĀ beĀ usedĀ forĀ identifyingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended,Ā e.g.,Ā expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā ifĀ aĀ categoryĀ ā€œtalkĀ showā€Ā isĀ designatedĀ inĀ theĀ userĀ input,Ā itĀ mayĀ beĀ identifiedĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.Ā ForĀ example,Ā ifĀ aĀ queryĀ ā€œfamousĀ magicĀ showsā€Ā isĀ includedĀ inĀ theĀ userĀ input,Ā itĀ mayĀ beĀ identifiedĀ thatĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ information.
AccordingĀ toĀ theĀ method,Ā theĀ identifiedĀ preferredĀ importanceĀ mayĀ beĀ furtherĀ usedĀ forĀ determiningĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ aĀ candidateĀ videoĀ set.Ā ForĀ example,Ā thoseĀ rankingĀ approachesĀ discussedĀ aboveĀ inĀ FIG.Ā 3Ā toĀ FIG.Ā 7Ā mayĀ beĀ adoptedĀ hereĀ forĀ rankingĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ setĀ andĀ furtherĀ selectingĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ rankedĀ candidateĀ videos.
AccordingĀ toĀ theĀ method,Ā theĀ determinedĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ presentedĀ toĀ theĀ userĀ throughĀ theĀ userĀ interface.Ā InĀ anĀ implementation,Ā aĀ recommendedĀ videoĀ listĀ mayĀ beĀ formedĀ andĀ presentedĀ toĀ theĀ user.Ā InĀ anĀ implementation,Ā ifĀ thereĀ isĀ aĀ recommendedĀ videoĀ listĀ alreadyĀ presentedĀ toĀ theĀ user,Ā theĀ determinedĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ usedĀ forĀ updatingĀ theĀ recommendedĀ videoĀ list.
AnĀ apparatusĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ mayĀ beĀ provided,Ā whichĀ comprisesĀ variousĀ modulesĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ aboveĀ methodĀ mayĀ beĀ provided.Ā Moreover,Ā anĀ apparatusĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ mayĀ beĀ provided,Ā whichĀ comprisesĀ atĀ leastĀ oneĀ processorĀ andĀ aĀ memoryĀ storingĀ computer-executableĀ instructions,Ā whereinĀ theĀ atĀ leastĀ oneĀ processorĀ mayĀ beĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ aboveĀ method.
AccordingĀ toĀ anotherĀ exemplaryĀ embodiment,Ā aĀ methodĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ isĀ provided.
DuringĀ theĀ userĀ isĀ accessingĀ aĀ thirdĀ partyĀ applicationĀ orĀ websiteĀ whichĀ providesĀ videoĀ recommendationĀ service,Ā aĀ serviceĀ configurationĀ ofĀ videoĀ recommendationĀ mayĀ beĀ detected.Ā TheĀ detectedĀ serviceĀ configurationĀ mayĀ correspondĀ to,Ā e.g.,Ā theĀ serviceĀ configurationĀ 310Ā inĀ FIG.Ā 3.
AccordingĀ toĀ theĀ method,Ā theĀ detectedĀ serviceĀ configurationĀ mayĀ beĀ usedĀ forĀ identifyingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā ifĀ theĀ serviceĀ configurationĀ indicatesĀ thatĀ recommendedĀ videosĀ shallĀ beĀ providedĀ inĀ aĀ muteĀ mode,Ā itĀ mayĀ beĀ identifiedĀ thatĀ thoseĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ areĀ preferredĀ toĀ beĀ recommended.
AccordingĀ toĀ theĀ method,Ā theĀ identifiedĀ preferredĀ importanceĀ mayĀ beĀ furtherĀ usedĀ forĀ determiningĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ aĀ candidateĀ videoĀ set.Ā ForĀ example,Ā thoseĀ rankingĀ approachesĀ discussedĀ aboveĀ inĀ FIG.Ā 3Ā toĀ FIG.Ā 7Ā mayĀ beĀ adoptedĀ hereĀ forĀ rankingĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ setĀ andĀ furtherĀ selectingĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ rankedĀ candidateĀ videos.
AccordingĀ toĀ theĀ method,Ā theĀ determinedĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ presentedĀ toĀ theĀ userĀ throughĀ theĀ userĀ interface.Ā InĀ anĀ implementation,Ā aĀ recommendedĀ videoĀ listĀ mayĀ beĀ formedĀ andĀ presentedĀ toĀ theĀ user.Ā InĀ anĀ implementation,Ā ifĀ thereĀ isĀ aĀ recommendedĀ videoĀ listĀ alreadyĀ presentedĀ toĀ theĀ user,Ā theĀ determinedĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ usedĀ forĀ updatingĀ theĀ recommendedĀ videoĀ list.
AnĀ apparatusĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ mayĀ beĀ provided,Ā whichĀ comprisesĀ variousĀ modulesĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ aboveĀ methodĀ mayĀ beĀ provided.Ā Moreover,Ā anĀ apparatusĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ mayĀ beĀ provided,Ā whichĀ comprisesĀ atĀ leastĀ oneĀ processorĀ andĀ aĀ memoryĀ storingĀ computer-executableĀ instructions,Ā whereinĀ theĀ atĀ leastĀ oneĀ processorĀ mayĀ beĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ aboveĀ method.
AccordingĀ toĀ anotherĀ exemplaryĀ embodiment,Ā aĀ methodĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ isĀ provided.
DuringĀ theĀ userĀ isĀ accessingĀ aĀ thirdĀ partyĀ applicationĀ orĀ websiteĀ whichĀ providesĀ videoĀ recommendationĀ service,Ā aĀ preferenceĀ scoreĀ ofĀ theĀ userĀ mayĀ beĀ determined.Ā TheĀ preferenceĀ scoreĀ mayĀ correspondĀ to,Ā e.g.,Ā theĀ preferenceĀ scoreĀ 410Ā inĀ FIG.Ā 4,Ā andĀ mayĀ beĀ determinedĀ inĀ aĀ similarĀ wayĀ asĀ thatĀ discussedĀ inĀ FIG.Ā 4.
AccordingĀ toĀ theĀ method,Ā theĀ determinedĀ preferenceĀ scoreĀ mayĀ beĀ usedĀ forĀ identifyingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended,Ā e.g.,Ā expectationĀ degreeĀ ofĀ theĀ userĀ forĀ visualĀ  informationĀ and/orĀ audioĀ informationĀ inĀ aĀ videoĀ toĀ beĀ recommended.Ā ForĀ example,Ā theĀ preferenceĀ scoreĀ mayĀ indicateĀ whetherĀ theĀ userĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ visualĀ informationĀ orĀ expectsĀ toĀ obtainĀ recommendedĀ videosĀ withĀ highĀ importanceĀ ofĀ audioĀ information.
AccordingĀ toĀ theĀ method,Ā theĀ identifiedĀ preferredĀ importanceĀ mayĀ beĀ furtherĀ usedĀ forĀ determiningĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ aĀ candidateĀ videoĀ set.Ā ForĀ example,Ā thoseĀ rankingĀ approachesĀ discussedĀ aboveĀ inĀ FIG.Ā 3Ā toĀ FIG.Ā 7Ā mayĀ beĀ adoptedĀ hereĀ forĀ rankingĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ setĀ andĀ furtherĀ selectingĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ rankedĀ candidateĀ videos.
AccordingĀ toĀ theĀ method,Ā theĀ determinedĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ presentedĀ toĀ theĀ userĀ throughĀ theĀ userĀ interface.Ā InĀ anĀ implementation,Ā aĀ recommendedĀ videoĀ listĀ mayĀ beĀ formedĀ andĀ presentedĀ toĀ theĀ user.Ā InĀ anĀ implementation,Ā ifĀ thereĀ isĀ aĀ recommendedĀ videoĀ listĀ alreadyĀ presentedĀ toĀ theĀ user,Ā theĀ determinedĀ atĀ leastĀ oneĀ recommendedĀ videoĀ mayĀ beĀ usedĀ forĀ updatingĀ theĀ recommendedĀ videoĀ list.
AnĀ apparatusĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ mayĀ beĀ provided,Ā whichĀ comprisesĀ variousĀ modulesĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ aboveĀ methodĀ mayĀ beĀ provided.Ā Moreover,Ā anĀ apparatusĀ forĀ presentingĀ recommendedĀ videosĀ toĀ aĀ userĀ mayĀ beĀ provided,Ā whichĀ comprisesĀ atĀ leastĀ oneĀ processorĀ andĀ aĀ memoryĀ storingĀ computer-executableĀ instructions,Ā whereinĀ theĀ atĀ leastĀ oneĀ processorĀ mayĀ beĀ configuredĀ forĀ performingĀ anyĀ operationsĀ ofĀ theĀ aboveĀ method.
TheĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ mayĀ beĀ embodiedĀ inĀ aĀ non-transitoryĀ computer-readableĀ medium.Ā TheĀ non-transitoryĀ computer-readableĀ mediumĀ mayĀ compriseĀ instructionsĀ that,Ā whenĀ executed,Ā causeĀ oneĀ orĀ moreĀ processorsĀ toĀ performĀ anyĀ operationsĀ ofĀ theĀ methodsĀ forĀ providingĀ videoĀ recommendationĀ orĀ forĀ presentingĀ recommendedĀ videosĀ accordingĀ toĀ theĀ embodimentsĀ ofĀ theĀ presentĀ disclosureĀ asĀ mentionedĀ above.
ItĀ shouldĀ beĀ appreciatedĀ thatĀ allĀ theĀ operationsĀ inĀ theĀ methodsĀ describedĀ aboveĀ areĀ merelyĀ exemplary,Ā andĀ theĀ presentĀ disclosureĀ isĀ notĀ limitedĀ toĀ anyĀ operationsĀ inĀ theĀ methodsĀ orĀ sequenceĀ ordersĀ ofĀ theseĀ operations,Ā andĀ shouldĀ coverĀ allĀ otherĀ equivalentsĀ underĀ theĀ sameĀ orĀ similarĀ concepts.
ItĀ shouldĀ alsoĀ beĀ appreciatedĀ thatĀ allĀ theĀ modulesĀ inĀ theĀ apparatusesĀ  describedĀ aboveĀ mayĀ beĀ implementedĀ inĀ variousĀ approaches.Ā TheseĀ modulesĀ mayĀ beĀ implementedĀ asĀ hardware,Ā software,Ā orĀ aĀ combinationĀ thereof.Ā Moreover,Ā anyĀ ofĀ theseĀ modulesĀ mayĀ beĀ furtherĀ functionallyĀ dividedĀ intoĀ sub-modulesĀ orĀ combinedĀ together.
ProcessorsĀ haveĀ beenĀ describedĀ inĀ connectionĀ withĀ variousĀ apparatusesĀ andĀ methods.Ā TheseĀ processorsĀ mayĀ beĀ implementedĀ usingĀ electronicĀ hardware,Ā computerĀ software,Ā orĀ anyĀ combinationĀ thereof.Ā WhetherĀ suchĀ processorsĀ areĀ implementedĀ asĀ hardwareĀ orĀ softwareĀ willĀ dependĀ uponĀ theĀ particularĀ applicationĀ andĀ overallĀ designĀ constraintsĀ imposedĀ onĀ theĀ system.Ā ByĀ wayĀ ofĀ example,Ā aĀ processor,Ā anyĀ portionĀ ofĀ aĀ processor,Ā orĀ anyĀ combinationĀ ofĀ processorsĀ presentedĀ inĀ theĀ presentĀ disclosureĀ mayĀ beĀ implementedĀ withĀ aĀ microprocessor,Ā microcontroller,Ā digitalĀ signalĀ processorĀ (DSP)Ā ,Ā aĀ field-programmableĀ gateĀ arrayĀ (FPGA)Ā ,Ā aĀ programmableĀ logicĀ deviceĀ (PLD)Ā ,Ā aĀ stateĀ machine,Ā gatedĀ logic,Ā discreteĀ hardwareĀ circuits,Ā andĀ otherĀ suitableĀ processingĀ componentsĀ configuredĀ toĀ performĀ theĀ variousĀ functionsĀ describedĀ throughoutĀ theĀ presentĀ disclosure.Ā TheĀ functionalityĀ ofĀ aĀ processor,Ā anyĀ portionĀ ofĀ aĀ processor,Ā orĀ anyĀ combinationĀ ofĀ processorsĀ presentedĀ inĀ theĀ presentĀ disclosureĀ mayĀ beĀ implementedĀ withĀ softwareĀ beingĀ executedĀ byĀ aĀ microprocessor,Ā microcontroller,Ā DSP,Ā orĀ otherĀ suitableĀ platform.
SoftwareĀ shallĀ beĀ construedĀ broadlyĀ toĀ meanĀ instructions,Ā instructionĀ sets,Ā code,Ā codeĀ segments,Ā programĀ code,Ā programs,Ā subprograms,Ā softwareĀ modules,Ā applications,Ā softwareĀ applications,Ā softwareĀ packages,Ā routines,Ā subroutines,Ā objects,Ā threadsĀ ofĀ execution,Ā procedures,Ā functions,Ā etc.Ā TheĀ softwareĀ mayĀ resideĀ onĀ aĀ computer-readableĀ medium.Ā AĀ computer-readableĀ mediumĀ mayĀ include,Ā byĀ wayĀ ofĀ example,Ā memoryĀ suchĀ asĀ aĀ magneticĀ storageĀ deviceĀ (e.g.,Ā hardĀ disk,Ā floppyĀ disk,Ā magneticĀ strip)Ā ,Ā anĀ opticalĀ disk,Ā aĀ smartĀ card,Ā aĀ flashĀ memoryĀ device,Ā randomĀ accessĀ memoryĀ (RAM)Ā ,Ā readĀ onlyĀ memoryĀ (ROM)Ā ,Ā programmableĀ ROMĀ (PROM)Ā ,Ā erasableĀ PROMĀ (EPROM)Ā ,Ā electricallyĀ erasableĀ PROMĀ (EEPROM)Ā ,Ā aĀ register,Ā orĀ aĀ removableĀ disk.Ā AlthoughĀ memoryĀ isĀ shownĀ separateĀ fromĀ theĀ processorsĀ inĀ theĀ variousĀ aspectsĀ presentedĀ throughoutĀ theĀ presentĀ disclosure,Ā theĀ memoryĀ mayĀ beĀ internalĀ toĀ theĀ processors,Ā e.g.,Ā cacheĀ orĀ register.
TheĀ previousĀ descriptionĀ isĀ providedĀ toĀ enableĀ anyĀ personĀ skilledĀ inĀ theĀ artĀ toĀ practiceĀ theĀ variousĀ aspectsĀ describedĀ herein.Ā VariousĀ modificationsĀ toĀ theseĀ aspectsĀ willĀ beĀ readilyĀ apparentĀ toĀ thoseĀ skilledĀ inĀ theĀ art,Ā andĀ theĀ genericĀ principlesĀ definedĀ hereinĀ mayĀ beĀ appliedĀ toĀ otherĀ aspects.Ā Thus,Ā theĀ claimsĀ areĀ notĀ intendedĀ toĀ beĀ  limitedĀ toĀ theĀ aspectsĀ shownĀ herein.Ā AllĀ structuralĀ andĀ functionalĀ equivalentsĀ toĀ theĀ elementsĀ ofĀ theĀ variousĀ aspectsĀ describedĀ throughoutĀ theĀ presentĀ disclosureĀ thatĀ areĀ knownĀ orĀ laterĀ comeĀ toĀ beĀ knownĀ toĀ thoseĀ ofĀ ordinaryĀ skilledĀ inĀ theĀ artĀ areĀ expresslyĀ incorporatedĀ hereinĀ byĀ referenceĀ andĀ areĀ intendedĀ toĀ beĀ encompassedĀ byĀ theĀ claims.

Claims (20)

  1. AĀ methodĀ forĀ providingĀ videoĀ recommendation,Ā comprising:
    determiningĀ atĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;
    determiningĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor;
    selectingĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set;Ā and
    providingĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
  2. TheĀ methodĀ ofĀ claimĀ 1,Ā whereinĀ theĀ atĀ leastĀ oneĀ referenceĀ factorĀ comprisesĀ aĀ preferenceĀ scoreĀ ofĀ theĀ user,Ā theĀ preferenceĀ scoreĀ indicatingĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ inĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.
  3. TheĀ methodĀ ofĀ claimĀ 2,Ā whereinĀ theĀ preferenceĀ scoreĀ isĀ determinedĀ basedĀ onĀ atĀ leastĀ oneĀ of:Ā currentĀ time,Ā currentĀ location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user.
  4. TheĀ methodĀ ofĀ claimĀ 3,Ā wherein
    theĀ configurationĀ ofĀ theĀ terminalĀ deviceĀ comprisesĀ atĀ leastĀ oneĀ of:Ā screenĀ size,Ā screenĀ resolution,Ā loudspeakerĀ availableĀ orĀ not,Ā andĀ peripheralĀ earphoneĀ connectedĀ orĀ not,Ā and
    theĀ operatingĀ stateĀ ofĀ theĀ terminalĀ deviceĀ comprisesĀ atĀ leastĀ oneĀ of:Ā operatingĀ inĀ aĀ muteĀ mode,Ā operatingĀ inĀ aĀ non-muteĀ modeĀ andĀ operatingĀ inĀ aĀ drivingĀ mode.
  5. TheĀ methodĀ ofĀ claimĀ 3,Ā whereinĀ theĀ preferenceĀ scoreĀ isĀ determinedĀ throughĀ aĀ userĀ sideĀ model,Ā theĀ userĀ sideĀ modelĀ adoptingĀ atĀ leastĀ oneĀ ofĀ theĀ followingĀ features:Ā time,Ā location,Ā configurationĀ ofĀ theĀ terminalĀ device,Ā operatingĀ stateĀ ofĀ theĀ terminalĀ device,Ā andĀ historicalĀ watchingĀ behaviorsĀ ofĀ theĀ user.
  6. TheĀ methodĀ ofĀ claimĀ 1,Ā whereinĀ theĀ atĀ leastĀ oneĀ referenceĀ factorĀ comprisesĀ anĀ indicationĀ ofĀ aĀ defaultĀ orĀ currentĀ serviceĀ configurationĀ ofĀ theĀ videoĀ recommendation.
  7. TheĀ methodĀ ofĀ claimĀ 6,Ā whereinĀ theĀ defaultĀ orĀ currentĀ serviceĀ configurationĀ comprisesĀ providingĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommendedĀ inĀ aĀ muteĀ modeĀ orĀ inĀ aĀ non-muteĀ mode.
  8. TheĀ methodĀ ofĀ claimĀ 1,Ā whereinĀ theĀ atĀ leastĀ oneĀ referenceĀ factorĀ comprisesĀ aĀ userĀ inputĀ fromĀ theĀ user,Ā theĀ userĀ inputĀ indicatingĀ expectationĀ degreeĀ ofĀ theĀ userĀ forĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ inĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended.
  9. TheĀ methodĀ ofĀ claimĀ 8,Ā whereinĀ theĀ userĀ inputĀ comprisesĀ atĀ leastĀ oneĀ of:
    aĀ designationĀ ofĀ theĀ preferredĀ importanceĀ ofĀ theĀ visualĀ informationĀ and/orĀ theĀ audioĀ informationĀ inĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;
    aĀ designationĀ ofĀ categoryĀ ofĀ theĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;Ā and
    aĀ queryĀ forĀ searchingĀ videos.
  10. TheĀ methodĀ ofĀ claimĀ 1,Ā furtherĀ comprising:
    determiningĀ aĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ set,Ā theĀ contentĀ scoreĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ theĀ candidateĀ video,Ā and
    whereinĀ theĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ furtherĀ basedĀ onĀ aĀ contentĀ scoreĀ ofĀ theĀ candidateĀ video.
  11. TheĀ methodĀ ofĀ claimĀ 10,Ā whereinĀ theĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ determinedĀ basedĀ onĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ theĀ candidateĀ video.
  12. TheĀ methodĀ ofĀ claimĀ 10,Ā whereinĀ theĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ determinedĀ throughĀ aĀ contentĀ sideĀ model,Ā theĀ contentĀ sideĀ modelĀ adoptingĀ atĀ leastĀ  oneĀ ofĀ theĀ followingĀ features:Ā shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadata.
  13. TheĀ methodĀ ofĀ claimĀ 10,Ā whereinĀ theĀ contentĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ determinedĀ throughĀ aĀ contentĀ sideĀ modelĀ whichĀ isĀ basedĀ onĀ deepĀ learning,Ā theĀ contentĀ sideĀ modelĀ beingĀ trainedĀ byĀ aĀ setĀ ofĀ trainingĀ data,Ā eachĀ trainingĀ dataĀ beingĀ formedĀ byĀ aĀ videoĀ andĀ aĀ labeledĀ contentĀ scoreĀ indicatingĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ theĀ video.
  14. TheĀ methodĀ ofĀ claimĀ 10,Ā whereinĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ determinedĀ throughĀ aĀ rankingĀ model,Ā theĀ rankingĀ modelĀ atĀ leastĀ adoptingĀ theĀ followingĀ features:Ā atĀ leastĀ oneĀ referenceĀ factor;Ā andĀ aĀ contentĀ scoreĀ ofĀ aĀ candidateĀ video.
  15. TheĀ methodĀ ofĀ claimĀ 1,Ā furtherĀ comprising:
    detectingĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ eachĀ candidateĀ videoĀ inĀ theĀ candidateĀ videoĀ set,Ā and
    whereinĀ theĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ furtherĀ basedĀ onĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ theĀ candidateĀ video.
  16. TheĀ methodĀ ofĀ claimĀ 15,Ā whereinĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ determinedĀ throughĀ aĀ rankingĀ model,Ā theĀ rankingĀ modelĀ atĀ leastĀ adoptingĀ theĀ followingĀ features:Ā atĀ leastĀ oneĀ referenceĀ factor;Ā andĀ atĀ leastĀ oneĀ ofĀ shotĀ transition,Ā cameraĀ motion,Ā scene,Ā human,Ā humanĀ motion,Ā object,Ā objectĀ motion,Ā textĀ information,Ā audioĀ attribute,Ā andĀ videoĀ metadataĀ ofĀ aĀ candidateĀ video.
  17. TheĀ methodĀ ofĀ claimĀ 1,Ā whereinĀ theĀ determiningĀ theĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ isĀ furtherĀ basedĀ onĀ atĀ leastĀ oneĀ of:Ā consumptionĀ conditionĀ ofĀ theĀ candidateĀ videoĀ byĀ aĀ numberĀ ofĀ otherĀ users;Ā andĀ relevanceĀ betweenĀ contentĀ ofĀ theĀ candidateĀ videoĀ andĀ theĀ user’sĀ interests.
  18. TheĀ methodĀ ofĀ claimĀ 1,Ā whereinĀ theĀ videoĀ recommendationĀ isĀ providedĀ inĀ aĀ clientĀ applicationĀ orĀ serviceĀ providingĀ website.
  19. AnĀ apparatusĀ forĀ providingĀ videoĀ recommendation,Ā comprising:
    aĀ referenceĀ factorĀ determiningĀ module,Ā forĀ determiningĀ atĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;
    aĀ rankingĀ scoreĀ determiningĀ module,Ā forĀ determiningĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor;
    aĀ recommendedĀ videoĀ selectingĀ module,Ā forĀ selectingĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set;Ā and
    aĀ recommendedĀ videoĀ providingĀ module,Ā forĀ providingĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
  20. AnĀ apparatusĀ forĀ providingĀ videoĀ recommendation,Ā comprising:
    oneĀ orĀ moreĀ processors;Ā and
    aĀ memoryĀ storingĀ computer-executableĀ instructionsĀ that,Ā whenĀ executed,Ā causeĀ theĀ oneĀ orĀ moreĀ processorsĀ to:
    determineĀ atĀ leastĀ oneĀ referenceĀ factorĀ forĀ theĀ videoĀ recommendation,Ā theĀ atĀ leastĀ oneĀ referenceĀ factorĀ indicatingĀ preferredĀ importanceĀ ofĀ visualĀ informationĀ and/orĀ audioĀ informationĀ inĀ atĀ leastĀ oneĀ videoĀ toĀ beĀ recommended;
    determineĀ aĀ rankingĀ scoreĀ ofĀ eachĀ candidateĀ videoĀ inĀ aĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ theĀ atĀ leastĀ oneĀ referenceĀ factor;
    selectĀ atĀ leastĀ oneĀ recommendedĀ videoĀ fromĀ theĀ candidateĀ videoĀ setĀ basedĀ atĀ leastĀ onĀ rankingĀ scoresĀ ofĀ candidateĀ videosĀ inĀ theĀ candidateĀ videoĀ set;Ā and
    provideĀ theĀ atĀ leastĀ oneĀ recommendedĀ videoĀ toĀ aĀ userĀ throughĀ aĀ terminalĀ device.
PCT/CN2018/099914 2018-08-10 2018-08-10 Providing video recommendation Ceased WO2020029235A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US17/256,951 US20210144418A1 (en) 2018-08-10 2018-08-10 Providing video recommendation
PCT/CN2018/099914 WO2020029235A1 (en) 2018-08-10 2018-08-10 Providing video recommendation
CN201880069804.3A CN111279709B (en) 2018-08-10 2018-08-10 Providing video recommendations
EP18929802.9A EP3834424A4 (en) 2018-08-10 2018-08-10 PROVIDING A VIDEO RECOMMENDATION

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/099914 WO2020029235A1 (en) 2018-08-10 2018-08-10 Providing video recommendation

Publications (1)

Publication Number Publication Date
WO2020029235A1 true WO2020029235A1 (en) 2020-02-13

Family

ID=69415282

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/099914 Ceased WO2020029235A1 (en) 2018-08-10 2018-08-10 Providing video recommendation

Country Status (4)

Country Link
US (1) US20210144418A1 (en)
EP (1) EP3834424A4 (en)
CN (1) CN111279709B (en)
WO (1) WO2020029235A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291201A (en) * 2020-03-06 2020-06-16 ē™¾åŗ¦åœØēŗæē½‘ē»œęŠ€ęœÆļ¼ˆåŒ—äŗ¬ļ¼‰ęœ‰é™å…¬åø Multimedia content score processing method and device and electronic equipment
CN112188295A (en) * 2020-09-29 2021-01-05 ęœ‰åŠå²›(åŒ—äŗ¬)äæ”ęÆē§‘ęŠ€ęœ‰é™å…¬åø Video recommendation method and device
CN113259727A (en) * 2021-04-30 2021-08-13 å¹æå·žč™Žē‰™ē§‘ęŠ€ęœ‰é™å…¬åø Video recommendation method, video recommendation device and computer-readable storage medium
CN114491094A (en) * 2022-01-12 2022-05-13 中国人民大学 Multimedia resource determination method, prediction model training method and device
US11568084B2 (en) 2020-11-12 2023-01-31 Tata Consultancy Services Limited Method and system for sequencing asset segments of privacy policy using optimization techniques
CN116150427A (en) * 2023-02-24 2023-05-23 åŒ—äŗ¬ē™¾åŗ¦ē½‘č®Æē§‘ęŠ€ęœ‰é™å…¬åø Video recommendation method, device, electronic device and storage medium
US11948213B2 (en) 2020-09-28 2024-04-02 Tata Consultancy Services Limited Method and system for sequencing asset segments of privacy policy

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11720793B2 (en) * 2019-10-14 2023-08-08 Google Llc Video anchors
US12231731B2 (en) * 2021-03-30 2025-02-18 Boe Technology Group Co., Ltd. Method and device for recommending real-time audios and/or videos, and computer storage medium
US11695992B2 (en) * 2021-07-15 2023-07-04 Rovi Guides, Inc. Content recommendations for users with disabilities
CN113987260A (en) * 2021-09-14 2022-01-28 åŒ—äŗ¬ē™¾åŗ¦ē½‘č®Æē§‘ęŠ€ęœ‰é™å…¬åø Video pushing method and device, electronic equipment and storage medium
CN114268815B (en) * 2021-12-15 2024-08-13 åŒ—äŗ¬č¾¾ä½³äŗ’č”äæ”ęÆęŠ€ęœÆęœ‰é™å…¬åø Video quality determining method, device, electronic equipment and storage medium
CN114417058B (en) * 2022-01-06 2025-11-04 ē™¾ęžœå›­ęŠ€ęœÆ(ę–°åŠ å”)ęœ‰é™å…¬åø A method, apparatus, computer device, and storage medium for selecting video footage.
CN114547396B (en) * 2022-02-28 2025-06-24 äŗæå’–é€š(ę¹–åŒ—)ęŠ€ęœÆęœ‰é™å…¬åø Method, device, equipment, medium and program product for processing information recall to be pushed
CN114697761B (en) * 2022-04-07 2024-02-13 č„øčŒęœ‰é™å…¬åø A processing method, device, terminal equipment and medium
CN115604538A (en) * 2022-10-09 2023-01-13 ęŠ–éŸ³č§†ē•Œęœ‰é™å…¬åøļ¼ˆCn) Video control method, device, electronic device and storage medium
CN116112710B (en) * 2023-02-17 2025-07-25 ęœŖę„ē”µč§†ęœ‰é™å…¬åø Information recommendation method, device and server

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050168436A1 (en) 2004-01-30 2005-08-04 International Business Machines Corporation Conveying the importance of display screen data using audible indicators
CN103239788A (en) 2012-02-07 2013-08-14 č””ęøŠ Mood regulating and emotion developing device and method
CN103634617A (en) 2013-11-26 2014-03-12 ä¹č§†č‡“ę–°ē”µå­ē§‘ęŠ€ļ¼ˆå¤©ę“„ļ¼‰ęœ‰é™å…¬åø Video recommending method and device in intelligent television
CN106131703A (en) 2016-06-28 2016-11-16 é’å²›ęµ·äæ”ä¼ åŖ’ē½‘ē»œęŠ€ęœÆęœ‰é™å…¬åø A kind of method of video recommendations and terminal

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030126130A1 (en) * 2001-12-31 2003-07-03 Koninklijke Philips Electronics N.V. Sort slider with context intuitive sort keys
JP2007519053A (en) * 2003-06-30 2007-07-12 ć‚³ćƒ¼ćƒ‹ćƒ³ć‚Æćƒ¬ćƒƒć‚« ćƒ•ć‚£ćƒŖćƒƒćƒ—ć‚¹ ć‚Øćƒ¬ć‚Æćƒˆćƒ­ćƒ‹ć‚Æć‚¹ ć‚ØćƒŒ ćƒ“ć‚£ System and method for generating a multimedia summary of a multimedia stream
JP4556752B2 (en) * 2005-04-18 2010-10-06 ę Ŗå¼ä¼šē¤¾ę—„ē«‹č£½ä½œę‰€ Recording / playback apparatus having commercial viewing control function
US8010645B2 (en) * 2006-05-12 2011-08-30 Sharp Laboratories Of America, Inc. Method and apparatus for providing feeds to users
CN104836720B (en) * 2014-02-12 2022-02-25 åŒ—äŗ¬äø‰ę˜Ÿé€šäæ”ęŠ€ęœÆē ”ē©¶ęœ‰é™å…¬åø Method and device for information recommendation in interactive communication
US9094730B1 (en) * 2014-06-19 2015-07-28 Google Inc. Providing timely media recommendations
US20160350658A1 (en) * 2015-06-01 2016-12-01 Microsoft Technology Licensing, Llc Viewport-based implicit feedback
US10659845B2 (en) * 2015-08-06 2020-05-19 Google Llc Methods, systems, and media for providing video content suitable for audio-only playback
CN105704331B (en) * 2016-04-26 2020-10-09 å±±äøœäŗ‘å°šå¤§ę•°ę®ęœ‰é™å…¬åø Application program recommendation method and system of mobile terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050168436A1 (en) 2004-01-30 2005-08-04 International Business Machines Corporation Conveying the importance of display screen data using audible indicators
CN103239788A (en) 2012-02-07 2013-08-14 č””ęøŠ Mood regulating and emotion developing device and method
CN103634617A (en) 2013-11-26 2014-03-12 ä¹č§†č‡“ę–°ē”µå­ē§‘ęŠ€ļ¼ˆå¤©ę“„ļ¼‰ęœ‰é™å…¬åø Video recommending method and device in intelligent television
CN106131703A (en) 2016-06-28 2016-11-16 é’å²›ęµ·äæ”ä¼ åŖ’ē½‘ē»œęŠ€ęœÆęœ‰é™å…¬åø A kind of method of video recommendations and terminal

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3834424A4

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111291201A (en) * 2020-03-06 2020-06-16 ē™¾åŗ¦åœØēŗæē½‘ē»œęŠ€ęœÆļ¼ˆåŒ—äŗ¬ļ¼‰ęœ‰é™å…¬åø Multimedia content score processing method and device and electronic equipment
CN111291201B (en) * 2020-03-06 2023-10-03 ē™¾åŗ¦åœØēŗæē½‘ē»œęŠ€ęœÆļ¼ˆåŒ—äŗ¬ļ¼‰ęœ‰é™å…¬åø Multimedia content score processing method and device and electronic equipment
US11948213B2 (en) 2020-09-28 2024-04-02 Tata Consultancy Services Limited Method and system for sequencing asset segments of privacy policy
CN112188295A (en) * 2020-09-29 2021-01-05 ęœ‰åŠå²›(åŒ—äŗ¬)äæ”ęÆē§‘ęŠ€ęœ‰é™å…¬åø Video recommendation method and device
US12167059B2 (en) 2020-09-29 2024-12-10 Bigo Technology Pte. Ltd. Method and electronic device for recommending videos, and storage medium
US11568084B2 (en) 2020-11-12 2023-01-31 Tata Consultancy Services Limited Method and system for sequencing asset segments of privacy policy using optimization techniques
CN113259727A (en) * 2021-04-30 2021-08-13 å¹æå·žč™Žē‰™ē§‘ęŠ€ęœ‰é™å…¬åø Video recommendation method, video recommendation device and computer-readable storage medium
CN114491094A (en) * 2022-01-12 2022-05-13 中国人民大学 Multimedia resource determination method, prediction model training method and device
CN116150427A (en) * 2023-02-24 2023-05-23 åŒ—äŗ¬ē™¾åŗ¦ē½‘č®Æē§‘ęŠ€ęœ‰é™å…¬åø Video recommendation method, device, electronic device and storage medium

Also Published As

Publication number Publication date
CN111279709A (en) 2020-06-12
EP3834424A4 (en) 2022-03-23
CN111279709B (en) 2022-11-08
US20210144418A1 (en) 2021-05-13
EP3834424A1 (en) 2021-06-16

Similar Documents

Publication Publication Date Title
WO2020029235A1 (en) Providing video recommendation
US20220256210A1 (en) Video classification and preview selection
US10115433B2 (en) Section identification in video content
US10958981B2 (en) Optimizing timing of display of a video overlay
US9253511B2 (en) Systems and methods for performing multi-modal video datastream segmentation
US9892109B2 (en) Automatically coding fact check results in a web page
US9055343B1 (en) Recommending content based on probability that a user has interest in viewing the content again
US9465435B1 (en) Segmentation of a video based on user engagement in respective segments of the video
KR102795023B1 (en) Predicting topics of potential relevance based on retrieved/created digital media files
US20160014482A1 (en) Systems and Methods for Generating Video Summary Sequences From One or More Video Segments
US20150331856A1 (en) Time-based content aggregator
US11979465B2 (en) Recommending media content to a user based on information associated with a referral source
US20100088726A1 (en) Automatic one-click bookmarks and bookmark headings for user-generated videos
US20150127643A1 (en) Digitally displaying and organizing personal multimedia content
US20240087547A1 (en) Systems and methods for transforming digital audio content
US11126682B1 (en) Hyperlink based multimedia processing
CN110476162A (en) Use Navigation Mnemonics to Control Displayed Activity Information
US20190278801A1 (en) Methods, Systems, and Products for Recalling and Retrieving Documentary Evidence
US12568282B2 (en) Content display method, apparatus, device and medium
CN112445921B (en) Abstract generation method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18929802

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2018929802

Country of ref document: EP

Effective date: 20210310