WO2025096294A1 - Systèmes et procédés de classification de médicaments à doses multiples - Google Patents
Systèmes et procédés de classification de médicaments à doses multiples Download PDFInfo
- Publication number
- WO2025096294A1 WO2025096294A1 PCT/US2024/052929 US2024052929W WO2025096294A1 WO 2025096294 A1 WO2025096294 A1 WO 2025096294A1 US 2024052929 W US2024052929 W US 2024052929W WO 2025096294 A1 WO2025096294 A1 WO 2025096294A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- dosage
- observational
- compound
- sequence
- indication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/20—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/16—Devices for psychotechnics; Testing reaction times ; Devices for evaluating the psychological state
- A61B5/165—Evaluating the state of mind, e.g. depression, anxiety
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H20/00—ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
- G16H20/10—ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance relating to drugs or medications, e.g. for ensuring correct administration to patients
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
- A61B5/7267—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems involving training the classification device
Definitions
- BACKGROUND [0003] The evaluation of compounds (e.g., as part of a drug development program) can involve non-human animal trials. Such animal trials can provide the safety and efficacy data necessary to support subsequent human trials. [0004] Observational assessments can be an important component of such trials. The effects of experimental substances on animal physiology or behavior can provide information about the potential clinical effects of the experimental substances on human subjects. For example, the discovery that chlorpromazine produces differential effects on avoidance and escape behavior in animals encouraged the evaluation of the behavioral effects of other experimental antipsychotic drugs. [0005] An observational assessment can include recording the physiology or behavior of an animal treated with a compound. Observational platforms can automatically collect such Attorney Docket No.16048.0009-00304 observational data.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can collect EEG signals from behaving animals administered varying dosages of a compound.
- observational platforms can
- the instructions can cause the system to perform operations.
- the operations can include obtaining a sample corresponding to a compound.
- the sample can include electroencephalographic spectrograms corresponding to different positions in a dosage sequence.
- the operations can include generating an indication of a predicted therapeutic effect of the compound when administered to a human for each position in the dosage sequence by applying the sample to a machine learning model.
- the machine learning model can be configured to generate separate encodings for the electroencephalographic spectrograms and generate the indication using the separate encodings.
- the operations can include providing the indication.
- the disclosed embodiments include a system for classifying compounds.
- the system can include at least one processor and at least one non-transitory, computer-readable medium.
- the operations can further include providing the indication.
- the disclosed embodiments include a method of training a machine learning model to classify compounds.
- the method can include obtaining a first sample corresponding to a first compound.
- the first sample can include a first observational dataset corresponding to a first position in a first dosage sequence and acquired from a first non-human animal administered a first dosage of the first compound during a first trial.
- the method can include labeling the first sample with an indication of at least one first predicted therapeutic effect of the first compound when administered to a human.
- the method can include training the machine- learning model, using a training dataset including the first sample, to generate an indication of at least one predicted therapeutic effect of a compound when administered to a human using a sample including observational datasets corresponding to positions in a dosage sequence.
- the machine-learning model can be configured to generate separate encodings corresponding to different positions in the dosage sequence and generate the indication using the separate encodings.
- FIGs.1A to 1C depict a system for acquiring (an optionally analyzing) observational data concerning a non-human animal, consistent with disclosed embodiments.
- FIG.2 depicts observational datasets generated using data acquired during a set of trials, consistent with disclosed embodiments.
- FIG.3 depicts an exemplary method for training a machine learning model to predict therapeutic effect(s) of a compound when administered to a human, consistent with disclosed embodiments.
- FIG.4 depicts an exemplary method for predicting therapeutic effect(s) of a compound when administered to a human, consistent with disclosed embodiments.
- FIG.5 depicts an exemplary architecture of a machine learning model for predicting a therapeutic effect of a compound in humans using observational data obtained from non- human animals, consistent with disclosed embodiments.
- FIG.6 depicts an exemplary system for generating machine learning models, consistent with disclosed embodiments.
- Attorney Docket No.16048.0009-00304 DETAILED DESCRIPTION [0018]
- Machine learning models consistent with disclosed embodiments can predict the effect(s) of a compound in humans based on the effect(s) of the compound in non-human animals. Such machine learning models can generate a prediction using observational datasets acquired during multiple trials. The trials can involve administration of different dosages of the compound to one or more non-human animals. In some instances, the trial may be intended to investigate the effect of the compound on the behavior of the animal.
- the trial may be conducted as part of a drug discovery study of the compound.
- Compounds can exhibit differing relationships between dosage and observed effects. Such relationships can be predictive of the effect of the compound in humans. For example, as the dosage of an administered compound is increased, the effect of the compound on the behavior of a non-human animal can follow a characteristic trajectory in some feature space. A machine learning model can be trained to associate this characteristic trajectory with predicted therapeutic effect(s) in humans. Attorney Docket No.16048.0009-00304 [0021] Predictions generated using observational datasets corresponding to a sequence of dosage levels may therefore be more accurate than predictions generated using observational datasets corresponding to a single dosage level.
- dosage levels in the sequence of dosage levels can correspond to different dosages for different compounds, consecutive dosage levels can correspond to the same dosage, and the observational datasets corresponding to different dosage levels may be acquired from different animals.
- a machine learning model can be configured to use either increasing sequences of dosage levels (e.g., given a sequence of dosages ⁇ ⁇ ⁇ , ... , ⁇ ⁇ ⁇ and corresponding dosage levels ⁇ 1, ... , ⁇ , then ⁇ ⁇ ⁇ ⁇ ⁇ for ⁇ ⁇
- the disclosed embodiments are discussed with respect to increasing sequences of dosage levels. However, the disclosed embodiments are not so limited. [0023] The robustness and flexibility of the disclosed embodiments provide additional benefits beyond improved accuracy.
- the disclosed embodiments permit reuse of historical observational data acquired under a variety of trial protocols (e.g., protocols having differing dosage levels, differing numbers of dosage levels, different animals administered different dosage levels, or the like). Historical observational data can be reformulated into observational datasets corresponding to a sequence of dosage levels and used for training or inference.
- the disclosed embodiments permit reuse of existing libraries of trial data for drug discovery purposes. For at least these reasons, the disclosed embodiments amount to a technical improvement in, at least, the field of drug discovery.
- observational data can include physiological data (e.g., electrophysiological data, such as electroencephalography signals, electrocardiography signals, electromyography signals, neural recordings, or the like; heart Attorney Docket No.16048.0009-00304 rate; respiration; oxygenation; thermal data, such as core body temperature, ocular temperature, paw temperature, or the like; or other physiological data) activity data (e.g., position, motion, force, head location, body center, paw positions, or the like); and/or other suitable measures.
- physiological data e.g., electrophysiological data, such as electroencephalography signals, electrocardiography signals, electromyography signals, neural recordings, or the like; heart Attorney Docket No.16048.0009-00304 rate; respiration; oxygenation; thermal data, such as core body temperature, ocular temperature, paw temperature, or the like; or other physiological data
- activity data e.g., position, motion, force, head location, body center, paw positions, or the like
- observational data can be acquired and stored
- the observational data can be or include multiple channels of repeated measurements over time (e.g., time series date).
- the observational data can be or include multiple channels of EEG signals.
- EEG signals can be acquired at up to 20kHz.
- the number of measurements included in a channel can be extremely large.
- a channel of EEG data collected at a sampling rate of 1 kHz for 30 minutes can include nearly 2 million measurements.
- the observational data can use large amounts of memory, such as memory on a graphics processing unit.
- the observational data can be acquired during the trial using one or more testing systems.
- a testing system can include a controlled environment configured with sensors.
- the testing system can use the sensors to acquire observational data from a non-human animal disposed within the controlled environment during the trial.
- the non-human animal may be administered the compound by any suitable method (e.g., by injecting the compound or feeding the compound to the animal).
- the trials can be performed using the same non-human animal (or different animals of the same type of non-human animal).
- a trial can include a pre-dosing and post-dosing period.
- the pre-dosing period can be between five minutes and two hours, or Attorney Docket No.16048.0009-00304 more.
- the post-dosing period can be between 10 minutes and four hours, or more.
- observational data acquired in the pre-dosing period can be used to normalize observational data acquired in the post-dosing period.
- normalized observational data from the post-dosing period can be applied to a machine-learning model to predict the effect of the compound in humans.
- both pre-dosing and (unnormalized) post-dosing observational data can be applied to a machine-learning model to predict the effect of the compound in humans.
- the sensors of a testing system can include electrophysiological recording devices (e.g., electroencephalography (EEG) recording devices, or the like), cameras, piezoelectric sensors, infrared detectors, radiofrequency detectors, and the like.
- a sensor can include both an emitter such as an infrared beam, a radiofrequency, a source of heat, a source of optical signals, or other such source, as well as a receiver, such as a component configured to receive data or electronic communications.
- Multiple cameras can be used to acquire depth or 3D information.
- the particular configuration of sensors can be selected or adapted based on the subject and/or the compound administered.
- the sensors may be in contact with the research animal (e.g., electrophysiological sensor electrodes, actigraphy accelerometers, etc.), in the experimental chamber (cameras, piezoelectric sensors, accelerometers connected to the chamber, etc), or at a distance from the experimental chamber (cameras, infrared sensors, etc), or a combination thereof.
- FIG.1A depicts an exemplary schematic of a system 110 for acquiring (and optionally analyzing) observational data concerning a non-human animal 115, consistent with disclosed embodiments.
- System 110 may include a control computer 50 communicating 135 Attorney Docket No.16048.0009-00304 with a computer-controlled enclosure 120.
- System 110 can include sensors configurable to acquire data during an experiment on a non-human animal 115.
- System 110 can include a computer-controlled enclosure 120 configured to contain the non-human animal 115 during the experiment.
- the computer-controlled enclosure 120 can include actuators configurable to interact with non-human animal 115.
- sensors and/or actuators may include an aversive stimulus probe 118 that may be deployed and retracted, a motor challenge 126 that may be deployed or retracted so as to force non-human animal 115 to walk on a plurality of physical obstacles 124 arranged in an array with a predefined pitch to provide a motor challenge, lighting 104 that may be configured to change illumination intensity levels and/or light wavelengths as applied to the non-human animal 115, a tactile stimulator 116 to administer tactile stimuli to the non-human animal 115, a top camera 106, a top thermal camera 102, a first side camera 128, one or more second side camera(s) 114, a floor force sensor 122, waterers and feeders 129, and additional actuators for applying any additional suitable stimuli to the non-human animal 115 and/or any additional sensors.
- aversive stimulus probe 118 that may be deployed and retracted
- a motor challenge 126 that may be deployed or retracted so as to force non-human animal 115 to walk on
- computer-controlled enclosure 120 can include social pod 112 (e.g., shown as an opening in enclosure 120 in FIG.1A that enables social pod 112 to be affixed to enclosure 120 via the opening). Affixing social pod 112 to computer-controlled enclosure 120 can enable a second non-human animal in social pod 112 to interact with non- human animal 115.
- computer-controlled enclosure 120 can include a 3D camera.
- a 3D camera can be implemented using multiple 2D cameras configured and arranged around enclosure 120 to obtain 3D image data.
- the 3D camera can be implemented using at least one of top camera 106, the first side camera 128, the second side cameras 114, or additional cameras.
- the first side camera 128 and the second side cameras 114 may be oriented to capture images in the computer-controlled enclosure 120 from different perspectives.
- An angle between the orientation of the first side camera 128 and each of the one or more second side cameras 114 may be any suitable angle to capture movement throughout the computer-controlled enclosure 120.
- the angle may be 90 degrees, though other angles may be used, such as, e.g., any angle from about 1 degree to about 179 degrees, such that imagery from both the first side camera 128 and each of the one or more second side camera 114 may be processed to determine movement within the computer-controlled enclosure 120.
- FIG.1B depicts a view of computer-controlled enclosure 120, considered with disclosed embodiments. Depicted in this view are the non-human animal 115, the top camera 106, the thermal camera 102, the adverse stimulus probe 118, the tactile stimulator 116, waterers and feeders 129, motor challenge 126, and sensor interface circuitry 168. Also depicted is the location at which social pod 112 can be attached to computer-controlled enclosure 120. FIG.1C depicts a view of social pod 112, which can be attached to computer- controlled enclosure 120 at the location depicted in FIG.1B. [0035] In some embodiments, the plurality of sensors may include sensors associated with some actuators to capture specific responses to the actuators.
- the plurality of sensors may be combined with multiple actuators to challenge the test subject to react to various events, which are recorded and analyzed by the system.
- the resulting ethophysiogram, or collection of physiological and behavioral responses may create a dataset (e.g., a content-rich dataset) suitable for use with the disclosed systems and methods.
- control computer 150 may include at least one processor 160 for executing suitable computer applications as described herein for Attorney Docket No.16048.0009-00304 performing the analysis of the behavior of the non-human animal 115, at least one memory and/or suitable storage device denoted by memory 162 for storing the computer code and any databases used in the analyses of the acquired data over the predetermined time period, control circuitry 164 for controlling the plurality of actuators in accordance with the experimental plan, sensor interface circuitry 168 for outputting data from the plurality of sensors, image device interface circuitry 170 for receiving the output data from any or all of the cameras and/or thermal cameras, input and output (I/O) devices 172, and/or communication circuitry 192 to enable the control computer 150 to communicate over any suitable communication network.
- processor 160 for executing suitable computer applications as described herein for Attorney Docket No.16048.0009-00304 performing the analysis of the behavior of the non-human animal 115
- memory 162 for storing the computer code and any databases used in the analyses of the acquired data over the pre
- the I/O devices 172 may include, for example, a display 186 and/or a keyboard 184. Keyboard 184 may allow a user or operator of system 110 to input data to the control computer 150. At least one processor 160 may control a graphic user interface (GUI) 188 shown on the display 186. The GUI 188 may display any suitable parameters and/or data visualizations related to the experimental session and/or results of analyses of the data acquired by the plurality of sensors coupled to enclosure 120 in accordance with the experimental plan.
- a head mount 190 may be placed on the subject’s head (e.g., skull).
- the head mount may include at least one electrode to measure brain electrical activity such as electroencephalographic (EEG) signals, for example.
- the head mount 190 may also include at least one accelerometer.
- the signals from the at least one electrode and/or the at least one accelerometer in the head mount 190 may be coupled via wires to circuitry 108 that can relay the signals for processing to the at least one processor 160.
- system 110 can be configured to acquire electrophysiological data, such as pharmaco-EEG (pEEG) data or the like.
- System 110 can be configured to record actigraphy and quantitative pEEG from one or more brain regions of unanesthetized non- human animals before and after administration of a compound.
- electrophysiological data can be used, consistent with disclosed embodiments, to identify novel compounds that have a desired effect on electrophysiological activity (e.g., pEEG activity).
- system 110 can be configured to phenotype disease models including autism spectrum disorder, rare genetic epilepsies, Huntington Disease, Alzheimer’s Disease, or the like.
- electrophysiological data can yield pharmaco-dynamic signatures specific to pharmacological action, such data can be used to evaluate translational biomarkers in CNS disorders and rapidly screen compounds for potential activity at specific pharmacological targets to provide valuable information for guiding the early stages of drug development.
- the observational dataset can include or depend upon an EEG signal acquired from the non-human animal.
- the observational dataset can be or include the time-dependent frequency components of the EEG signal.
- the observational data can be or include EEG frequency band data (e.g., signal amplitude or power in the delta, theta, alpha, beta, or gamma bands) or spectrogram data.
- EEG frequency band data e.g., signal amplitude or power in the delta, theta, alpha, beta, or gamma bands
- spectrogram data e.g., signal amplitude or power in the delta, theta, alpha, beta, or gamma bands
- spectrogram data e.g., signal amplitude or power in the delta, theta, alpha, beta, or gamma bands
- the observational dataset can include EEG signals (e.g., times series of voltage or current amplitudes, or the like), or can be a multivariate dataset additionally including other physiological data or activity data of the non-human animal.
- EEG signals e.g., times series of voltage or current amplitudes, or the like
- the disclosed embodiments are not limited to any particular parameters for generating the spectrogram data from the EEG signal (e.g., frequency range, window duration or shape, sampling frequency, or window overlap). Suitable spectrogram generation parameters can be determined based on the animal and the observed frequency components of the EEG signal.
- the frequency range can be between a lower bound less than 1, 5, or 10 Hz and an upper bound greater than 5, 10, 20, 50, or 100 Hz.
- the spectrogram can include, for example, 32 to 512 frequency values, or more. These frequency values can cover a frequency range from a lower value (e.g., a value between 1 and 100 Hz) to a higher value (e.g., a value between 50 and 500 Hz). The frequency values can be distributed within the frequency range uniformly, logarithmically, or according to some other distribution.
- the spectrogram can include 30 to 480 samples, or more. These samples can cover a duration of 30 minutes to 4 hours (e.g., corresponding to a sample period ranging from 3 seconds to 10 minutes). In some embodiments, the samples can be distributed uniformly, logarithmically, or according to some other distribution within the duration of the trial.
- FIG.2 depicts observational datasets generated using data acquired during a set of trials, consistent with disclosed embodiments.
- a non-human animal was administered at different dosage of Psilocybin.
- a sample, as described herein, can include the observational datasets arranged in a sequence of dosage levels, based on the administered dosages of Psilocybin.
- the observational datasets are arranged in a sequence of increasing dosage levels.
- the order in which the trials were conducted need not match the sequence of dosage levels.
- the observational dataset comprises EEG spectrograms generated from EEG signals acquired during the trials. Each EEG spectrogram corresponds to approximate 60 mins of trial time.
- the EEG spectrograms begin upon the administration of the Psilocybin and depict the normalized power of the EEG Attorney Docket No.16048.0009-00304 signal in frequency bands from zero to 100 Hz.
- the power was normalized against a baseline recording taken prior to the administration of Psilocybin.
- the normalized power is depicted as the percent change in power as compared to the baseline recording.
- the response to Psilocybin in this set of trials was characterized by an increase in power at lower frequencies that developed over time following administration of the compound.
- FIG.3 depicts an exemplary method 300 for training a machine learning model to predict therapeutic effect(s) of a compound when administered to a human, consistent with disclosed embodiments.
- the machine learning model can be trained to predict the effect(s) of the compound when administered to a human based on the effect(s) of administering the compound at varying dosages to one or more non-human animals.
- the machine learning model can be trained to predict the effect(s) of the compound when administered to a human at a particular dosage.
- the input to the machine learning model can be a sequence of observational datasets corresponding to trials at a sequence of dosage levels of the compound. For convenience of description, such observational datasets are described as being EEG spectrogram(s).
- method 300 can include generating a suitable training dataset and training the machine learning model.
- method 300 is described as being performed by a training system. However, this description is not intended to be limiting.
- multiple systems can collectively perform method 300.
- multiple systems can collectively perform one or more steps of method 300 (e.g., multiple systems can coordinate obtaining a dataset and processing it into a training dataset).
- different systems can perform different steps of method 300 (e.g., a first system can generate a training dataset and a second system can train the machine learning model using the training dataset).
- the training system can obtain an original dataset 310, consistent with disclosed embodiments.
- the training system can receive or retrieve at least a portion of the original dataset.
- the training system can access memory, database, or another system to obtain suitable observational datasets.
- a user of the system can provide the training system with at least a portion of the original dataset.
- the observational datasets can include measurements obtained from non- human animals during trials in which compounds were administered to the non-human animals.
- the training system can create at least a portion of the original dataset.
- the training system can be or include one or more systems for acquiring observational data (e.g., system 110, or the like).
- a user of the system for acquiring observational data can conduct trials, which can provide observational data to the training system.
- the training system can create samples for a training dataset, consistent with disclosed embodiments.
- the samples can be created using the original dataset obtained in step 310.
- a sample can include a sequence of a predetermined number of observational datasets.
- the machine learning model can include a corresponding sequence of encoders.
- creating the samples can include converting or formatting the observational data into a format expected by the machine learning model.
- the training system can generate suitable EEG spectrograms from EEG signals.
- Such generation can include Attorney Docket No.16048.0009-00304 preprocessing the EEG signals, prior to generating suitable EEG spectrograms (e.g., cropping or zero-padding the EEG signal to accord for differences in trial length, filtering the EEG signal, removing, or correcting artifacts or noise, changing or normalizing EEG signal amplitudes, resampling the EEG signal, or other suitable operations).
- the training system can then generate EEG spectrograms from the processed EEG signals according to any suitable method.
- creating the samples can include performing data augmentation operations. Such operations can include generating observational datasets using other observational datasets, generating multiple samples using the same observational datasets, generating samples using observational datasets acquired from multiple different animals, or other suitable data augmentation operations. As may be appreciated, such data augmentation operations can improve the performance of the machine learning model.
- such data augmentation can reduce overfitting, address imbalances in amounts of training data for different therapeutic classes in the original dataset (e.g., arising from unequal numbers of trials having been performed with different compounds), improve the robustness of the machine learning model to inter-subject variability, improve the robustness of the machine learning model to missing observational data (e.g., missing dosage levels), and the like.
- the training system can be configured to generate observational datasets from other observational datasets.
- the training system can be configured to generate an observational dataset by combining other observational datasets.
- the generated observational dataset and the other observational datasets can correspond to the same combination of compound and dosage level.
- the training system can be configured to generate an EEG spectrogram Attorney Docket No.16048.0009-00304 corresponding to a 0.1 mg/kg dose of Psilocybin by averaging other EEG spectrograms corresponding to a 0.1 mg/kg dose of Psilocybin.
- the other EEG spectrograms can correspond to trials conducted in different non-human animals (e.g., a first trial may have been performed using a first rat and a second trial may have been performed using a second rat).
- the average can be a weighted average.
- the training system can generate additional observational datasets (e.g., to increase the number of samples for a class).
- the generated observational dataset and the other observational datasets can correspond to differing dosage levels of the same compound.
- the training system can be configured to generate an EEG spectrogram corresponding to a 0.32 mg/kg dose of Psilocybin by averaging an EEG spectrogram corresponding to a 0.1 mg/kg dose of Psilocybin (e.g., acquired from a non-human animal, such as a first rat) and an EEG spectrogram corresponding to a 1.0 mg/kg dose of Psilocybin (e.g., acquired from the same or a different non-human animal, such as a second rat).
- the average can be a weighted average.
- the training system can generate observational datasets corresponding to intermediate dosage levels (e.g., when such observational datasets are not available, or insufficient in number, in the original dataset).
- the original dataset may include observational datasets corresponding to three repeats for each of four of six dosage levels, but only a single repeat for each of the second and fourth dosage levels.
- the training system can generate two additional repeats for the second dosage level using observational datasets for the first and third dosage levels (e.g., an EEG spectrogram for the second level can be generated as a weighted average of EEG spectrograms obtained for the first and third dosage levels).
- the training system can also generate two additional repeats for the fourth dosage level using observational datasets for the third and fifth dosage levels (e.g., an EEG spectrogram for the fourth level can be generated as a weighted average of EEG spectrograms Attorney Docket No.16048.0009-00304 obtained for the third and fifth dosage levels). In this manner, the training system can ensure that there are three repeats for each of the six dosage levels.
- the training system can be configured to generate multiple samples using the same observational datasets.
- the original data set may include observational datasets corresponding to three repeats each for six dosage levels. The repeats may or may not have been performed in the same animal.
- the training system can use these observational datasets to generate, for example, 729 samples (e.g., a sample for each unique combination of repeat and dosage level).
- additional “repeats” can be generated by averaging pairs of samples within each dosage level, permitting the generation of additional samples.
- a compound has been administered at three dosages to two mice (e.g., ⁇ ⁇ for ⁇ ⁇ 1,2,3 and ⁇ ⁇ 1,2) in six trials.
- the training system can be configured to generate additional samples by blending samples at different dosages and/or in different animals.
- additional sample can be generated as a linear combination of existing samples acquired in trials using different dosages and/or in different animals: ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ where ⁇ ⁇ animals, ⁇ ⁇ dosages, and ⁇ ⁇ ⁇ ⁇ 0,1 ⁇ and ⁇ ⁇ ⁇ ⁇ 1.
- such a linear combination can include samples drawn from fewer than two animals subjects.
- when generating a sample for a dosage level such a linear combination can include only samples at the dosage level, or at adjacent dosage levels.
- the linear combination can only include samples obtained at the fourth, fifth, or sixth dosage levels.
- a compound has been administered at three dosages to two mice (e.g., ⁇ ⁇ for ⁇ ⁇ 1,2,3 and ⁇ ⁇ 1,2) in six trials.
- the training system can be configured to generate additional samples ⁇ ⁇ for ⁇ ⁇ 2 using ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ for ⁇ ⁇ 2.
- n is the number of dosages, e.g., three in this example
- m is the number of animals, e.g., two in this example.
- the blending matrix can be further parameterized. For ⁇ ⁇ ⁇ ⁇ ⁇ 1 ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇
- the first dosage of the third generated sample can be a combination of the observational data for the first and second dosages of the two measured samples, according to the blending coefficients 0 ⁇ ⁇ , ⁇ ⁇ 1.
- ⁇ controls the weight given to samples at the first dosage level versus the second dosage level
- ⁇ controls the weight given to the samples Attorney Docket No.16048.0009-00304 at the first animal versus the second animal.
- the value of ⁇ can be chosen randomly each time a new sample is generated.
- the training system can have a set of blending matrices for each dosage level and select among them (e.g., deterministically, or according to a probability distribution). For example, the training system can generate a second dosage level by selecting among three blending matrices ⁇ ⁇
- the disclosed embodiments are not limited to embodiments that expressly perform matrix multiplication using blending matrices (or expressly sample such blending matrices). Instead, the disclosed embodiments can include approaches that similarly generate new samples by combining observational data for different dosage levels and/or different animals.
- Attorney Docket No.16048.0009-00304 [0060]
- the training system may include multiple observational datasets corresponding to the same dosage in a sample (e.g., duplicate observational datasets) for a compound.
- the number of duplicate observational datasets included in the sample can depend on the number of dosages for which observational datasets are available for the compound in the original dataset.
- the number of duplicate observational datasets included in the sample can further depend on the architecture of the machine learning model (e.g., the number of encoders in the machine learning model). For example: ⁇ ⁇ ⁇ max ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ , 0 ⁇ where ⁇ ⁇ is the number of duplicate observational datasets, ⁇ ⁇ is the number of encoders, and ⁇ ⁇ is the number of dosage levels for which observational data is available in the original dataset.
- the training system can be configured to associate the same dosages with multiple differing dosage levels when creating a sample.
- the training system can be configured to map the same dosage to multiple dosage levels. For example, training system can map the lowest dosage to multiple dosage levels (e.g., ⁇ ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ ⁇ ).
- training system can map the highest dosage to multiple dosage levels (e.g., ⁇ ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ ⁇ ).
- the training system can repeat each dosage once (e.g., ⁇ ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ ⁇ ).
- the training system can select, for each dosage level, an observational dataset corresponding to the dosage associated with that dosage level. These observational datasets can be included in the sample.
- the machine learning model can be configured to accept a sample including observational data for six dosage levels.
- the training system can be configured to obtain a sequence of observational data for the first three dosage levels (e.g., ⁇ ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ ⁇ , with ⁇ ⁇ being observation data from a first animal at a first dosage level, ⁇ ⁇ being observation data from a second animal at a second dosage level, and ⁇ ⁇ being observation data for a third dosage level created using blending matrices as described herein, or the like).
- the training system can be configured to generate subsequences of these three dosage levels and then pad them out to the final sequence length of six samples. For example: ⁇ ⁇ ⁇ , ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ , ⁇ ⁇ ⁇ , ⁇ ⁇ ⁇ ⁇ , ⁇ , ⁇ ⁇ ⁇ ⁇ , ⁇ , ⁇ , ⁇ , ⁇ ⁇ ⁇ [0063]
- the same dosages can be associated with the same positions in the sequence of dosage levels for samples corresponding to the same compound.
- all samples corresponding to Psilocybin may include three dosage levels corresponding to dosages of 0.1 mg/kg, 0.32 mg/kg, and 1.0 mg/kg.
- different dosages can be associated with the same positions in the sequence of dosage levels for samples corresponding to the different compounds.
- a dosage of 0.1 mg/kg can correspond to the first dosage level for a sample corresponding to Psilocybin.
- dosages of 5 mg/kg, 10 mg/kg, and 20 mg/kg can correspond to the first, second, and third dosage levels for samples corresponding to Tiagabine.
- dosages of 5 mg/kg, 20 mg/kg, and 40 mg/kg can correspond to the first, second, and third dosage levels for samples corresponding to Nefazodone.
- dosages of 2 mg/kg, 5 mg/kg, and 10 mg/kg can correspond to the first, second, and third dosage levels for samples corresponding to Oxazepam.
- Attorney Docket No.16048.0009-00304 [0064] the training system (or a user thereof) may obtain a mapping of dosages to dosage levels for a compound. For example, historical trial data may exist for ten dosages of a compound, but the machine learning model may be configured to use only six dosage levels.
- the training system may therefore need to select a subset of the historical data for inclusion in the training dataset.
- a study may specify that trials will be performed at six dosage levels (e.g., because a machine learning model is configured to use only six dosage levels).
- the study may benefit from selecting, for the six dosages levels, dosages ranging from sub-therapeutic to supra-therapeutic dosages. Selecting such a range of dosages may enable the study to provide more suitable data for classification of therapeutic effects than a selection of dosages that are either entirely sub- therapeutic or entirely supra-therapeutic.
- a mapping for a compound from a dosage to a position in the sequence of dosage levels can be determined manually (or at least in part automatically).
- a user of the training system can specify the mapping based on prior knowledge of the effect of the compound in the non-human animal tested. For example, the user can map dosages to dosage levels based on a known sigmoidal relationship between dosage and observational data. The user can map dosages to dosage levels to cover the sub-linear, linear, and supra-linear regions of the sigmoidal relationship. In some instances, the training system can determine a relationship between dosage levels and observational data for one or more compounds (e.g., based on observational datasets in the original data). The training system may then create or suggest a mapping from dosage to dosage level for the compound.
- the training system can associate samples with therapeutic class(es), consistent with disclosed embodiments.
- the samples can include samples generated in step 320 of method 300 (or otherwise obtained).
- the training system can associate a sample corresponding to a compound with therapeutic class(es) based on known therapeutic effect(s) of the compound when administered to humans.
- a sample corresponding to Psilocybin can be associated with the therapeutic class “hallucinogen” based on the hallucinogenic effects of Psilocybin in humans.
- a sample corresponding to Sertraline can be associated with the therapeutic class “antidepressant” based on the antidepressant effects of Sertraline in humans.
- the training system can associate dosage levels in a sample with therapeutic class(es). As described herein, each dosage level can be associated with a dosage, and the dosage level can be associated with therapeutic class(es) based on known therapeutic effect(s) of the compound when administered to humans in that dosage. In some embodiments, the sample can therefore be associated with a vector of indications of therapeutic class(es).
- a compound when administered to humans at a first dosage, may act as an anxiolytic.
- the dosage level in the sample corresponding to the first dosage can be associated with an indication of the therapeutic class “anxiolytic.”
- the compound when administered to humans at a second, higher dosage, may act as a sedative.
- the dosage level in the sample corresponding to the second dosage can be associated with an indication of the therapeutic class “sedative.”
- the therapeutic class(es) can include antidepressant (MAO inhibitors, adrenergic and serotonergic receptors antagonists, mixed or specific serotonin and norepinephrine reuptake inhibitors, triple monoamine reuptake inhibitors, tricyclic Attorney Docket No.16048.0009-00304 antidepressants, etc.), analgesic (opiates, NSAIDs, etc.), anxiolytic (serotonin 1A receptor agonists, benzodiazepines, mGLUR5 antagonists, neurosteroids, etc.), antipsychotic (typical, atypical, and other novel antipsychotics), cognitive enhancer (nicotinic cholinergic agonists, cholinesterase inhibitors, phosphodiesterase-4 inhibitors, histamine receptor 3 ⁇ 4 antagonists, etc.), anticonvulsant (GABA), adrene
- the therapeutic class(es) can include a therapeutic antidepressant, high dose side-effect antidepressant, hallucinogen/psychedelic/treatment-resistant depression treatment, therapeutic antipsychotic, high dose antipsychotic, anxiolytic, sedative, cognitive enhancer/Alzheimer's treatment, psychostimulant/ADHD, mood stabilizer/anticonvulsant, side effect/toxic, other therapeutic area/inactive, sedative, vehicle, anxiogenic, analgesic, or the like.
- the disclosed embodiments are not limited to any particular enumeration of therapeutic classes.
- a sample can include observational datasets.
- a sample can be a data structure that contains the observational datasets.
- the sample can include one or more references to one or more observational datasets.
- the observational datasets can be stored in a database and a sample can include references to particular observational datasets stored in the database.
- the training system can train a machine learning model to predict therapeutic class(es) of a compound from a sample, consistent with disclosed embodiments.
- the predicted therapeutic class(es) can indicate therapeutic effect(s) of the compound when administered to a human.
- therapeutic class(es) can be predicted for the overall sample.
- therapeutic class(es) can be predicted for each dosage level in the sequence of dosage levels.
- the machine learning model can include an encoder portion and a classifier portion.
- the encoder portion can include one or more neural networks (e.g., feed-forward neural networks, recurrent neural networks, attention-based neural networks, or the like).
- the classifier portion can include one or more neural networks, random forests or decision trees, regression classifiers, support vector machine, nearest neighbor classifiers, or other suitable classifiers.
- the encoder portion can map the sample to a position in a high-dimensional space.
- the classifier portion can map the position in the high-dimensional space to the predicted therapeutic class(es).
- the encoder and classifier portions of the machine learning model can be trained together.
- the encoder and classifier portions can be trained at least partially separately.
- Attorney Docket No.16048.0009-00304 [0073] samples can include multiple portions of observational data acquired using multiple testing systems.
- the machine learning model can be adapted to accommodate the different portions of observational data. Such accommodation can occur in the encoder portion of the machine learning model or the classifier portion of the machine learning model.
- the encoder portion of the machine learning model can include multiple sub-encoders. Each sub-encoder can be configured to encode a portion of observational data acquired by a corresponding testing system.
- a first testing system can be configured to acquire first observational data using a first collection of sensors (e.g., electrophysiological data, or the like).
- Another testing system can be configured to acquire observational data using a second, differing set of sensors (e.g., visual or infrared cameras, accelerometers, piezoelectric sensors, or the like).
- a first sub-encoder can correspond to and process the first observational data into a first encoding.
- a second sub-encoder can correspond to and process the second observational data into a second encoding.
- a classifier portion of the machine learning model can be configured to generate a classification from the encodings generated by the multiple sub-encoders.
- sub-encoders can correspond to each combination of observational data portion and dosage level. For example, where there are two testing environments and six dosage levels, there may be 12 sub-encoders.
- the encoder portion of the machine learning model can include a single encoder.
- the single encoder can be configured to accept observational data acquired from any one of multiple testing systems.
- a source feature can be input to the encoder.
- the source feature can specify which testing system generated the observational data.
- the encoder can generate an encoding that is input to the classifier.
- Attorney Docket No.16048.0009-00304 [0076]
- the classifier portion of the machine learning model can include multiple sub-classifiers. In some embodiments, each sub-classifier can receive an encoding from a corresponding sub-encoder.
- a first portion of observational data generated using a first testing system can be input to a first sub-encoder to generate a first encoding.
- the first encoding can be input to a first sub-classifier.
- the first sub-classifier can be associated with the first testing system.
- the sub-encoders can share a common stem or base layer.
- one or more encodings can be input to a common base layer of the classifier.
- the classifier can branch into multiple sub-classifiers.
- the output of multiple sub-classifiers can be combined to generate a final classification output.
- the multiple outputs can be combined using one or more common output layers.
- the outputs of two sub-classifiers can be combined into an input for a final classification layer or set of classification layers.
- the multiple outputs can be combined using ensemble learning techniques (e.g., using a Bayesian classifier, bagging, boosting, model combination or averaging, or the like).
- the machine learning model can be configured to determine whether a compound has unspecified activity.
- the machine learning model can include an activity detector portion, in addition to the therapeutic prediction portion discussed above.
- the activity detector portion of the machine learning model can be implemented using a separate encoder and classifier, a separate classifier, or a separate classifier branch that shares one or more layers with other classifier(s) in the machine learning model.
- the activity detector portion can be configured to classify the compound as “active” (or an equivalent label) or “inactive” (e.g., vehicle, control, default, or some Attorney Docket No.16048.0009-00304 equivalent label).
- the output of the overall machine learning model can depend on the output of the therapeutic prediction portion (e.g., the therapeutic class(es) predicted for the different dosage levels) and the output of the activity detector portion. [0080] In some embodiments, the therapeutic prediction portion and the activity detector portion can output probability values.
- the overall output of the machine learning model can be the probabilities output by the therapeutic prediction portion. Otherwise, the overall output of the machine learning model can indicate that the class of the compound is unknown.
- a suitable probability value can be assigned to an “unknown – active” class, or a semantic equivalent. In some embodiments, a suitable probability value can be assigned to a “control” or semantically similar class.
- a machine learning model can be trained according to any suitable training arrangement, consistent with disclosed embodiments.
- the training dataset can be divided into at least a training portion and a validation portion.
- the machine learning model can be trained using the training portion of the training dataset and validated using the validation portion of the training dataset.
- the machine learning model can be initialized and training hyperparameters selected according to known methods.
- sub-machine learning models can be trained separately. For example, a first sub-machine learning model can be trained using the observational data from the first testing system and a second sub-machine learning model can be trained using observational data from the second testing system.
- the two sub-machine Attorney Docket No.16048.0009-00304 learning models can then be combined.
- a new architecture can be trained using a unified classifier and the two encoders of the sub-machine learning models.
- the outputs of the two existing classifiers can be combined (e.g., using ensemble techniques, or the like).
- the combined models can be further trained.
- Such training can be performed using a loss function configured to prioritize contributions from sub-machine learning models known to have superior performance with particular classes of outputs (e.g., anticonvulsants versus sedatives, or the like). [0083]
- training can be performed in batches.
- a set of samples can be applied to the machine learning model to generate a set of predicted therapeutic class(es).
- the predicted therapeutic classes can be compared to the “ground truth” therapeutic class(es) associated with the samples to generate an update.
- the machine learning model can then be updated based on the update. For example, weights or parameters of the machine learning model can be updated based on the update.
- the training system can train the machine learning model until a termination condition is satisfied.
- the termination condition can depend on the performance of the model (e.g., accuracy, precision, recall, a confusion matrix, a loss function value, or another suitable measure), a training time, an amount of data trained upon (e.g., a number of epochs, or the like), or any other suitable measure.
- the training system can store the trained machine learning model, use the trained machine learning model in a production environment (e.g., to generate predictions of therapeutic classes for compounds lacking known therapeutic effects), or provide the trained machine learning model to another system.
- FIG.4 depicts an exemplary method 400 for predicting a therapeutic effect of a compound on a human using a trained machine learning model, consistent with disclosed Attorney Docket No.16048.0009-00304 embodiments.
- the machine learning model can be trained as described with regards to method 300.
- the machine learning model can include an encoder portion and a classifier portion, as described herein.
- the machine learning model accepts as input a sample comprising a sequence of observational datasets.
- observational datasets are described as being EEG spectrogram(s).
- the observational datasets can alternatively or additionally include other physiological or activity data, as described herein.
- the machine learning model can predict therapeutic effect(s) based on the input sample.
- method 400 is described as being performed by a production system. However, this description is not intended to be limiting. In some embodiments, multiple systems can collectively perform method 400. In some embodiments, multiple systems can collectively perform one or more steps of method 400 (e.g., multiple systems can coordinate to generate encodings). In some embodiments, different systems can perform different steps of method 400 (e.g., a first system can obtain an observational dataset and a second system can generate encodings, generate an indication of therapeutic effect(s), and provide the indication of therapeutic effect). [0088] In step 410 of method 400, the production system can obtain a sample, consistent with disclosed embodiments.
- the sample can include a sequence of observational datasets corresponding to a sequence of dosage levels.
- the observational datasets can include or depend upon observational data acquired from non- human animals during trials in which the non-human animals were administered varying dosages of a compound.
- the observational datasets can be or depend upon observational data acquired from the same non-human animal (e.g., the same rodent).
- the observational dataset can include or depend upon observational data Attorney Docket No.16048.0009-00304 acquired from different non-human animals (e.g., different rodents of the same species, or same strain or type, or the like).
- the sequence of dosage levels can include observational datasets corresponding to a predetermined number of dosage levels. However, the dosage values corresponding to these dosage levels can differ from the dosage values used in training the machine learning model. As described herein, the machine learning model considers the sequence of dosage levels, not the dosage values.
- the production system can receive or retrieve the sample from a user of the production system, a memory, a database, or another system. In some embodiments, the production system can create the sample. In some embodiments, the production system can create the sample using observational data received or retrieved from a user, a memory, a database, or another system. In some embodiments, the production system can create the observational data.
- the production system can be or include a system for acquiring observational data (e.g., system 110, or the like).
- a user of the system for acquiring observational data can conduct trials, which can provide observational data to the production system.
- creating the sample can include converting or formatting the observational data into a format expected by the machine learning model, as described herein with regards to step 320.
- the training system can generate suitable EEG spectrograms from EEG signals.
- the production system can add duplicate observational datasets to the sample or interpolate additional levels between existing levels.
- the production system can generate a sequence of encodings using the sample, consistent with disclosed embodiments.
- the production system can generate the sequence of encodings by inputting the sample to an encoding portion of the machine learning model.
- the encoding portion of the machine learning model can include a sequence of encoders.
- each encoder in the sequence can correspond to a dosage level in the sequence of dosage levels.
- the sample can include observational datasets corresponding to the sequence of dosage levels. For each dosage level, the production system can apply the corresponding observational dataset to the corresponding encoder to generate a separate encoding in the sequence of encodings.
- the production system can apply the first observational dataset to the first encoder to generate a first encoding corresponding to the first dosage level in the sequence of dosage levels.
- the production system can predict therapeutic effect(s) of the compound, consistent with disclosed embodiments.
- the production system can generate the prediction by inputting the sequence of encodings to a classifier portion of the machine learning model.
- the classifier portion can generate a prediction of therapeutic effect(s) for the overall sample.
- the classifier portion can output a vector with elements corresponding to therapeutic classes. An element corresponding to a predicted therapeutic class can be one-valued. The remaining elements can be zero-valued.
- the elements corresponding to the therapeutic classes can have likelihood values. The greater the likelihood value, the more likely the compound belongs to the therapeutic class.
- the classifier portion can generate a prediction of therapeutic effects for each dosage level in the overall sample.
- the classifier portion can output a vector with elements corresponding to therapeutic classes for each dosage value.
- the elements of the vector for each dosage value can be zero or one-valued, or can be likelihood values.
- a compound may act as an anxiolytic at low dosages and a sedative at high dosages.
- the likelihood values for the element corresponding to the anxiolytic class can decrease as the dosage level increases, while the likelihood values for the element corresponding to the sedative class can increase as the dosage level increases.
- the element corresponding to the anxiolytic class can have the value one for lower dosage levels (e.g., zero otherwise) and the element corresponding to the sedative class can have the value one for higher dosage levels (e.g., zero otherwise).
- the production system can be configured to provide an indication of the predicted therapeutic effects.
- FIG.5 depicts an exemplary architecture 500 of a machine learning model for predicting a therapeutic effect of a compound in humans using observational data obtained from non-human animals, consistent with disclosed embodiments.
- a machine learning model trained according to method 300 or used to predict therapeutic effect(s) according to method 400 can be implemented using architecture 500.
- such a machine learning model can be configured to generate, using an encoder portion, n separate encodings (e.g., encodings 520-1 to 520-n) from a sample 501.
- n separate encodings e.g., encodings 520-1 to 520-n
- Such a machine learning model can further be configured to generate class label(s) 599 from the n separate encodings using a classifier 530.
- Sample 501 can include a sequence of n observational datasets corresponding to n dosage levels, where n can be greater than one. For example, n can be between 3 and 10, or more. In some embodiments, each of the observational datasets can include multiple channels.
- each of the observational datasets can correspond to different EEG signals obtained by different electrodes (or different electrode pairs, or the like).
- each of the observational datasets can include as many as 64 channels, or as many as 256 channels, or more channels. In some embodiments, fewer channels may be used.
- each of the observational datasets can include between 2 and 32 channels.
- each observational dataset can be applied to a corresponding encoder (e.g., encoder 510-1 to encoder 510-n). In some embodiments, each of the encoders can have the same or similar architectures.
- each of the encoders can be or include a convolutional neural network.
- an encoder can include an input component (e.g., input component 511-1), one or more repeats of a body component (e.g., body component 512-1), and an output component (e.g., output component 516-1).
- the input component can include one or more convolutional neural network layers configured to convert the observational dataset into a feature set.
- the feature set can include more channels, but lesser time and frequency resolution than the input observational dataset.
- the input component can include a normalization layer configured to batch normalize the output of the convolutional layers.
- the input component can include an activation Attorney Docket No.16048.0009-00304 layer configured to apply a suitable activation function to the output of the normalization layers. In some embodiments, the input component can apply a pooling or dropout layer to the output of the activation layer.
- each repeat of the body component can be configured to accept input features from a preceding component (e.g., the input component or another body component) and generate output features for provision to a subsequent component (e.g., the output component or another body component). In various embodiments, there can be between 1 and 10 repeats, or more, of the body component in the encoder.
- a body component can include two processing pathways.
- a first processing pathway can include one or more convolutional components (e.g., convolutional component 513-1).
- a convolutional component can include a convolutional layer.
- a convolutional component can further include additional batch normalization and activation layers.
- the output of the one or more convolutional components can be applied to an adaptive weighting block (e.g., weighting block 514-1) to reweight the channel in the output of the convolutional layer.
- the adaptive weighting block can be a squeeze and excitation block (or another suitable adaptive weighting block).
- a second pathway can include a second convolutional components (e.g., convolutional component 515-1).
- the second convolutional component can include a convolutional layer.
- the second convolutional component can further include additional batch normalization and activation layers.
- the second pathway can improve the training characteristics of the encoder by skipping the blocks in the first pathway.
- a combination component can generate the output of the body component by combining the outputs of the first and second pathway.
- the combination component can include a summation (or other function) layer (and optionally an activation layer).
- the body components can be configured to progressively reduce the time and frequency resolution of the feature sets, while increasing the number of channels.
- the input layer can generate a feature set having dimensions (16, 58, 62), where 16 is the number of channels and each channel has dimensions 58 by 62.
- a first body component can accept the feature set having dimensions (16, 58, 62) and produce a feature set having dimensions (64, 29, 31).
- a final body component can produce a feature set having dimensions (128, 4, 4).
- the output component can be configured to accept the multi- channel input and generate a single channel output.
- the output component can reshape the multi-channel input into a single channel and input that channel to at least one set of layers (e.g., one to ten sets of layers, or another suitable number).
- Each set of layers can include a feedforward layer (and optionally a batch normalization, activation, and/or dropout layer).
- the output of the output component can be the encoding (e.g., encoding 520-1 to 502-n).
- classifier 530 can be configured to accept the encodings.
- classifier 530 can be a convolutional neural network.
- the convolutional neural network can include convolution layers and activation layers.
- the convolutional neural network can further include batch normalization and dropout layers.
- classifier 530 can be configured to convert the set of n encodings into an m by 1 vector, where m is a number of possible therapeutic classes.
- the m by 1 vector can be a vector of likelihoods, indicating the likelihood that the compound associated with the sample belongs to each therapeutic class.
- an element of the m by 1 vector can be one-valued, indicating a most likely therapeutic class, and the remaining elements can be zero-valued.
- classifier 530 can be configured to convert the set of n encodings into n m by 1 vectors, where m is a number of possible therapeutic classes. Each of the n vectors can correspond to a dosage level in the sample.
- an m by 1 vector corresponding to a dosage can be a vector of likelihoods, indicating the likelihood that the compound associated with the sample, when administered at the dosage corresponding to the dosage level, belongs to each therapeutic class.
- an element of an m by 1 vector can be one-valued, indicating a most likely therapeutic class for the compound, when administered at the dosage corresponding to the dosage level, and the remaining elements can be zero-valued.
- architecture 500 is exemplary and not intended to be limiting.
- the encoders can be convolutional neural networks having a different internal structure than the structure disclosed in FIG.5.
- the classifier may not be a convolutional neural network.
- FIG.6 depicts an exemplary computing system 600 suitable for generating machine learning models, consistent with disclosed embodiments.
- the machine learning system described herein can be implemented using computing system 600.
- methods 300 or 400 can be performed, and/or architecture 500 can be implemented using computing system 600.
- the components and arrangement of components included in computing system 600 may vary.
- computing system 600 may include a larger or smaller number of processors, I/O devices, or memory units.
- computing system 600 may further include other components or devices not depicted that perform or assist in the performance of one or more processes consistent with the disclosed embodiments.
- the Attorney Docket No.16048.0009-00304 components and arrangements shown in FIG.6 are not intended to limit the disclosed embodiments, as the components used to implement the disclosed processes and features may vary.
- Processor 610 may comprise known computing processors, including a microprocessor. Processor 610 may constitute a single-core or multiple-core processor that executes parallel processes simultaneously. For example, processor 610 may be a single-core processor configured with virtual processing technologies. In some embodiments, processor 610 may use logical processors to simultaneously execute and control multiple processes.
- Processor 610 may implement virtual machine technologies, or other known technologies to provide the ability to execute, control, run, manipulate, store, etc., multiple software processes, applications, programs, etc.
- processor 610 may include a multiple-core processor arrangement (e.g., dual core, quad core, etc.) configured to provide parallel processing functionalities to allow execution of multiple processes simultaneously.
- processor arrangements e.g., dual core, quad core, etc.
- Processor 610 may execute various instructions stored in memory 630 to perform various functions of the disclosed embodiments described in greater detail below.
- Processor 610 may be configured to execute functions written in one or more known programming languages.
- Computer program code for carrying out operations may be written in any combination of one or more programming languages, including an object- oriented programming language such as PYTHON, JAVA, SMALLTALK, C++, or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
- the program code may execute entirely on the user’s computer, partly on the user’s computer, as a stand-alone software package, partly on the Attorney Docket No.16048.0009-00304 user’s computer and partly on a remote computer or entirely on the remote computer or server.
- I/O device 620 may include at least one of a display, an LED, a router, a touchscreen, a keyboard, a microphone, a speaker, a haptic device, a camera, a button, a dial, a switch, a knob, a transceiver, an input device, an output device, or another I/O device to perform methods of the disclosed embodiments.
- I/O device 620 may be configured to manage interactions between computing system 600 and other systems using a network.
- I/O device 620 may be configured to publish data received from other databases or systems not shown. This data may be published in a publication and subscription framework (e.g., using APACHE KAFKA), through a network socket, in response to queries from other systems, or using other known methods.
- I/O device 620 may be configured to provide data or instructions received from other systems.
- I/O device 620 may be configured to receive instructions for generating data models (e.g., type of data model, data model parameters, training data indicators, training parameters, or the like) from another system and provide this information to machine learning framework 635.
- data models e.g., type of data model, data model parameters, training data indicators, training parameters, or the like
- I/O device 620 may be configured to receive data from another system (e.g., in a file, a message in a publication and subscription framework, a network socket, or the like) and provide that data to programs or store that data in, for example, training data set 632, inference data 633, or model 634.
- I/O device 620 may include a user interface configured to receive user inputs and provide data to a user (e.g., a data manager).
- I/O device Attorney Docket No.16048.0009-00304 620 may include a display, a microphone, a speaker, a keyboard, a mouse, a track pad, a button, a dial, a knob, a printer, a light, an LED, a haptic feedback device, a touchscreen and/or other input or output devices.
- Memory 630 may be a volatile or non-volatile, magnetic, semiconductor, optical, removable, non-removable, or other type of storage device or tangible (i.e., non-transitory) computer-readable medium, consistent with disclosed embodiments.
- Memory 630 may include inference data 633 and training data set 632, including one of at least one of encrypted data or unencrypted data. Memory 630 may also include models 634, including weights and parameters of neural network models.
- Machine learning framework 635 may include one or more programs (e.g., modules, code, scripts, or functions) used to perform methods consistent with disclosed embodiments (e.g., TENSORFLOW, PYTORCH, KERAS, OPENCV, or the like). Programs may include operating systems (not shown) that perform known operating system functions when executed by one or more processors. Disclosed embodiments may operate and function with computer systems running any type of operating system. Machine learning framework 635 may be written in one or more programming or scripting languages.
- Machine learning framework 635 may include programs (scripts, functions, algorithms) to assist creation of, train, implement, store, receive, retrieve, and/or transmit one or more machine learning models.
- Machine learning framework 635 may be configured to assist creation of, train, implement, store, receive, retrieve, and/or transmit, one or more ensemble machine learning models (e.g., machine learning models comprised of a plurality of Attorney Docket No.16048.0009-00304 machine learning models).
- training of a model may terminate when a training criterion is satisfied.
- Training criteria may include number of epochs, training time, performance metric values (e.g., an estimate of accuracy in reproducing test data), or the like.
- Machine learning framework 635 may be configured to adjust model parameters and/or hyperparameters during training.
- machine learning framework 635 may be configured to modify model parameters and/or hyperparameters (i.e., hyperparameter tuning) using an optimization technique during training, consistent with disclosed embodiments.
- Hyperparameters may include training hyperparameters, which may affect how training of a model occurs, or architectural hyperparameters, which may affect the structure of a model.
- machine learning framework 635 may be configured to generate models based on instructions received from another component of computing system 600 and/or a computing component outside computing system 600.
- machine learning framework 635 can be configured to receive a visual (e.g., graphical) depiction of a machine learning model and parse that graphical depiction into instructions for creating and training a corresponding neural network.
- Machine learning framework 635 can be configured to select model training parameters.
- Machine learning framework 635 can be configured to provide trained models and descriptive information concerning the trained models to model memory 630.
- Any computer program instructions may also be stored in a computer readable medium that can direct one or more hardware processors of a computer, other programmable Attorney Docket No.16048.0009-00304 data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium form an article of manufacture including instructions that implement the function/act specified in the flowchart or block diagram block or blocks.
- the computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions that execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart or block diagram block or blocks.
- the embodiments may further be described using the following clauses: [0123] 1.
- providing the indication comprises providing, for each position in the dosage sequence, an indication that the compound, when administered to a human at a dosage corresponding to the position in the Attorney Docket No.16048.0009-00304 dosage sequence, is an antidepressant, anxiolytic, antipsychotic, cognitive enhancer, hallucinogen, mood stabilizer, or psychostimulant. [0125] 3.
- a system for classifying compounds comprising: at least one processor; and at least one non-transitory, computer-readable medium containing instructions that, when executed by the at least one processor, cause the system to perform operations comprising: obtaining a sample corresponding to a compound, the sample including observational datasets corresponding to positions in a dosage sequence, the observational datasets including a first observational dataset corresponding to a first position in the dosage sequence and acquired from a first non-human animal administered a first dosage of the compound; generating an indication of at least one predicted therapeutic effect of the compound when administered to a human by applying the sample to a machine learning model configured to: generate separate encodings corresponding to different positions in the dosage sequence, and generate the indication using the separate encodings, and provide the indication.
- the machine-learning model includes a sequence of encoders, the sequence of encoders including, at the first position, a first encoder; and generating the separate encodings includes applying the first observational dataset to the first encoder to generate a first separate encoding.
- the first encoder comprises a convolutional neural network.
- the machine-learning model includes a classifier; and generating the indication using the separate encodings includes inputting the separate encodings to the classifier.
- the classifier comprises a neural network.
- a method of training a machine learning model to classify compounds comprising: obtaining a first sample corresponding to a first compound, the first sample including a first observational dataset corresponding to a first position in a first dosage sequence and acquired from a first non-human animal administered a first dosage of the first Attorney Docket No.16048.0009-00304 compound during a first trial; labeling the first sample with an indication of at least one first predicted therapeutic effect of the first compound when administered to a human; and training the machine-learning model, using a training dataset including the first sample, to generate an indication of at least one predicted therapeutic effect of a compound when administered to a human using a sample including observational datasets corresponding to positions in a dosage sequence, the machine-learning model configured to: generate separate encodings corresponding to different positions in the dosage sequence, and generate the indication using the separate encodings.
- the first sample further includes a second observational dataset acquired from a second non-human animal.
- the first sample further includes: a second observational dataset generated using the first observational dataset and a third observational dataset corresponding to a third dosage of the first compound; and the second observational dataset corresponding to a second dosage of the first compound, the second dosage being greater than the first dosage and less than the third dosage.
- the indication of at least one predicted therapeutic effect comprises an indication of a predicted therapeutic effect for each position in the dosage sequence.
- the first observational dataset comprises a first electroencephalographic spectrogram.
- the machine learning model includes a number of encoders; and the first sample includes duplicate observational datasets, each duplicate corresponding to a different dosage of the first compound, the number of duplicates based in part on the number of encoders.
- Attorney Docket No.16048.0009-00304 [0141] 19.
- the training dataset includes samples corresponding to different compounds, the samples corresponding to different compounds including observational datasets corresponding to different dosage sequences, the different dosage sequences having equal lengths and including differing dosages. [0142] 20.
- the machine-learning model includes a sequence of encoders, the sequence of encoders including, at the first position, a first encoder; and training the machine-learning model, using the training dataset including the first sample, comprises applying the first observational dataset to the first encoder to generate a first separate encoding.
- a range defined by two endpoint values X and Y includes the endpoints X and Y, except where infeasible.
- the term “or” encompasses all possible combinations, except where infeasible.
- a component may include A or B, then, unless specifically stated otherwise or infeasible, the component may include A, or B, or A and B.
- the component may include A, or B, or C, or A and B, or A and C, or B and C, or A and B and C.
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Public Health (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Primary Health Care (AREA)
- Software Systems (AREA)
- Epidemiology (AREA)
- Molecular Biology (AREA)
- Pathology (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Psychiatry (AREA)
- Surgery (AREA)
- Child & Adolescent Psychology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Veterinary Medicine (AREA)
- Developmental Disabilities (AREA)
- Educational Technology (AREA)
- Animal Behavior & Ethology (AREA)
- Social Psychology (AREA)
- Hospice & Palliative Care (AREA)
- Heart & Thoracic Surgery (AREA)
- Psychology (AREA)
- Chemical & Material Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medicinal Chemistry (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
Un système d'apprentissage automatique peut prédire un effet thérapeutique d'un composé lorsqu'il est administré à des êtres humains. Le système d'apprentissage automatique peut également obtenir de multiples ensembles de données d'observation acquis pendant des essais dans lesquels un animal non humain a reçu des doses variables du composé. Le système d'apprentissage automatique peut aussi appliquer les multiples ensembles de données d'observation à un modèle d'apprentissage automatique entraîné pour prédire, sur la base de tels ensembles de données d'observation, un ou plusieurs effets thérapeutiques du composé lorsqu'il est administré à des êtres humains. Le modèle d'apprentissage automatique peut comprendre de multiples codeurs, chaque codeur correspondant à un niveau de dose. Le modèle d'apprentissage automatique peut en outre comprendre un classificateur conçu pour générer une ou plusieurs indications de classe(s) thérapeutique(s) à l'aide des sorties des codeurs.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202363594275P | 2023-10-30 | 2023-10-30 | |
| US63/594,275 | 2023-10-30 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025096294A1 true WO2025096294A1 (fr) | 2025-05-08 |
Family
ID=93455945
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2024/052929 Pending WO2025096294A1 (fr) | 2023-10-30 | 2024-10-25 | Systèmes et procédés de classification de médicaments à doses multiples |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2025096294A1 (fr) |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190142336A1 (en) * | 2016-05-19 | 2019-05-16 | The General Hospital Corporation | Systems and methods for determining response to anesthetic and sedative drugs using markers of brain function |
| CN113974557A (zh) * | 2021-10-28 | 2022-01-28 | 中国人民解放军陆军军医大学第二附属医院 | 基于脑电奇异谱分析的深度神经网络麻醉深度分析方法 |
| US20230083769A1 (en) * | 2021-09-14 | 2023-03-16 | City University Of Hong Kong | Machine learing based method of screening potential drug candidate, and a method thereof |
-
2024
- 2024-10-25 WO PCT/US2024/052929 patent/WO2025096294A1/fr active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190142336A1 (en) * | 2016-05-19 | 2019-05-16 | The General Hospital Corporation | Systems and methods for determining response to anesthetic and sedative drugs using markers of brain function |
| US20230083769A1 (en) * | 2021-09-14 | 2023-03-16 | City University Of Hong Kong | Machine learing based method of screening potential drug candidate, and a method thereof |
| CN113974557A (zh) * | 2021-10-28 | 2022-01-28 | 中国人民解放军陆军军医大学第二附属医院 | 基于脑电奇异谱分析的深度神经网络麻醉深度分析方法 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Goel et al. | Modified grasshopper optimization algorithm for detection of autism spectrum disorder | |
| Kiral-Kornek et al. | Epileptic seizure prediction using big data and deep learning: toward a mobile system | |
| Bergstrom et al. | Automated identification of multiple seizure-related and interictal epileptiform event types in the EEG of mice | |
| Chanu et al. | An automated epileptic seizure detection using optimized neural network from EEG signals | |
| US20240156414A1 (en) | Method for Predicting Age from Resting-State Scalp EEG Signals Using Deep Convolutional Neural Networks | |
| Maccione et al. | Microelectronics, bioinformatics and neurocomputation for massive neuronal recordings in brain circuits with large scale multielectrode array probes | |
| CN118551340B (zh) | 基于多尺度脑电特征融合的脑电信号分析方法及设备 | |
| Wang et al. | Automated rest EEG-based diagnosis of depression and schizophrenia using a deep convolutional neural network | |
| Raeisi et al. | EEG microstate biomarkers for schizophrenia: a novel approach using deep neural networks | |
| Aljalal et al. | Selecting EEG channels and features using multi-objective optimization for accurate MCI detection: validation using leave-one-subject-out strategy | |
| Said et al. | Spectral analysis and Bi-LSTM deep network-based approach in detection of mild cognitive impairment from electroencephalography signals | |
| Faiem et al. | Assessment of Parkinson’s disease severity using gait data: A deep learning-based multimodal approach | |
| Peterson et al. | Deep net detection and onset prediction of electrographic seizure patterns in responsive neurostimulation | |
| Sathishkumar et al. | Detection and Classification of Neuro-Degenerative Disease via EfficientNetB7 | |
| Shen et al. | A data-centric and interpretable EEG framework for depression severity grading using SHAP-based insights | |
| EP4631053A1 (fr) | Systèmes et procédés de conception de médicament phénotypique | |
| WO2025096294A1 (fr) | Systèmes et procédés de classification de médicaments à doses multiples | |
| Wang et al. | Deep learning-assisted non-invasive pediatric tic disorder diagnosis using EEG features extracted by residual neural networks | |
| WO2025101841A1 (fr) | Procédés et appareil de diagnostic de démence fronto-temporale à l'aide d'un marqueur d'eeg de cuir chevelu à l'état de repos d'interactions régionales dans le cerveau | |
| Chauhan et al. | A Hybrid CNN-BiLSTM Neural Network Architecture for Early Prediction of Parkinson's Disease | |
| Ranieri et al. | Unveiling Parkinson’s disease features from a primate model with deep neural networks | |
| Lee et al. | Machine learning for physical motion identification using EEG signals: a comparative study of classifiers and hyperparameter tuning | |
| EP4648674A1 (fr) | Systèmes et procédés d'utilisation de fonctions objectives neuronales pour une optimisation en boucle fermée | |
| Tengshe et al. | Automated algorithm for neurodegenerative disorder detection using gait-based features | |
| Mootoo et al. | Stochastic Sparse Sampling: A Framework for Variable-Length Medical Time Series Classification |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24804688 Country of ref document: EP Kind code of ref document: A1 |