Gating, also known as triggering, is a technique that acquires MRI data at a low motion state. An example of this could be acquiring an MRI slice only when the lung capacity is low (i.e. between large breaths). Gating is a very simple solution that can have a very large result.
Gating is best suited for mitigating breathing and cardiac artifacts. This is because these types of motion are repetitive, so we can leverage triggering acquisitions in a 'low motion state'. Gating is used for cine imaging, MRA, free-breathing chest scans, CSF flow imaging, and more.[13]
In order to gate correctly, the system needs to have knowledge of the patient's cardiac motion and breathing pattern. This is commonly done by using a pulse oximeter or EKG sensor to read a cardiac signal and/or a bellows to read the breathing signal.
A big disadvantage to gating is 'dead time', defined as time wasted due to waiting for a high motion state to pass. For example, we do not want to acquire an MRI image while someone is in the process of inhaling, since this would be a high motion state. So, we have many time periods where we are waiting for a high motion state to pass. This is even more prominent when we consider respiratory and cardiac gating together. The windows of time where the respiratory and cardiac motions are low are very infrequent, leading to high dead times. However, the advantage is that images acquired with both cardiac and respiratory gating have a significant improvement in image quality.[13]
The Pilot Tone method involves turning on a constant RF frequency to detect patient motion. More specifically, the MRI machine will detect the pilot tone signal when acquiring an image. The strength of the pilot tone signal at every TR will be proportional to the breathing/motion patterns of the patient. That is, the patient's movements will cause the received constant RF tone to be amplitude modulated. A very large advantage to the pilot tone is that it requires no contact with the patient.
Extracting a breathing signal using a pilot tone is simple in theory: One must place a constant frequency signal near the MRI bore, acquire an image, and take an FFT along the readout direction to extract the pilot tone. Technical considerations include choosing the RF frequency. The pilot tone must be detectable by the MRI machine, however must be carefully chosen not to interfere with the MRI image. The pilot tone shows up as a zipper (for a cartesian acquisition).[14]
The location of that line is determined by the frequency of the RF tone. For this reason, pilot tone acquisitions usually have slightly large FOVs, to make room for the pilot tone.
Once an image has been acquired, the pilot tone signal can be extracted by taking the FFT along the readout direction and plotting the amplitude of the resulting signal. The pilot tone will show up as a line (of varying amplitude) when taking an FFT along the readout direction. The pilot tone method can also be used prospectively to acquire cardiac images.[15]
The Pilot Tone method is great for detecting respiratory motion artifacts. This is because there is a very large and distinct modulation due to human breathing patterns. Heart signals are much more subtle and difficult to detect using a pilot tone. Retrospective techniques using the pilot tone are able to increase the level of detail and reduce blurring in free-breathing radial images.[14]
Targeted Motion Estimation and Reduction (TAMER) is a retrospective motion correction method developed by Melissa Haskell, Stephen Cauley, and Lawrence Wald. The method was first introduced in their paper Targeted Motion Estimation and Reduction (TAMER): Consistency Based Motion Mitigation for MRI using a Reduced Model Joint Optimization, as part of the IEEE Transactions on Medical Imaging Journal.[16] The method corrects motion-related artifacts by acquiring a joint estimation of the desired motion-free image and the associated motion trajectory by minimizing the data consistency error of a SENSE forward model that includes rigid-body subject motion.[16]
The TAMER Method utilizes the SENSE forward model (described below) that has been modified to include the effects of motion in a 2D multi-shot imaging sequence. Note: the following modified SENSE model is described in detail in Melissa Haskell's doctoral dissertation, Retrospective Motion Correction for Magnetic Resonance Imaging.[17]
Suppose that we have
coils. Let
be a
column vector of image voxel values where
is the number of k space samples acquired per shot and let
be the
signal data from
coils. Let
encoding matrix for a given
patient motion trajectory vector,
.
is composed of
many
sub-matrices
(encoding matrices for each shot
).
For each shot
, we have the sub-matrix
which is the encoding matrix for that particular shot
where:
is the
under-sampling operator
is the Fourier Encoding Operator
is the in-place translation operator
is the through-plane translation operator
is the rotation operator
SENSE Motion Forward Model: 
SENSE model Extended to describe a 2D multi-shot imaging sequence: 
The rigid-body motion forward model is nonlinear and the process of solving for estimations of both the motion trajectory and the image volume is computationally challenging and time-consuming. In the effort to speed up and simplify computations, the TAMER method separates the vector of image voxel values,
, into a vector of target voxel values,
, and a vector of fixed voxels,
. Given any choice of target voxels and fixed voxels, we have the following:


Note: The length of
only makes up about 5% of the total length of
.
Now the optimization can be reduced to fitting the signal contribution of the target voxels to the correct target voxel values
and the correct motion,
.
The TAMER algorithm has 3 main stages: Initialization, Jumpstart of Motion Parameter Search, and the Joint Optimization Reduced Model Search.
Initialization:
The first stage of the TAMER algorithm acquires the initial reconstruction of the full image volume,
, by assuming that all motion parameters are zero. One can solve for
by minimizing the least squared error of the SENSE forward model without motion i.e. solve the system
where
and
is the conjugate transpose of
.[16] We have discussed the notion of separating the sense model into
; however, we haven't yet discussed how the target voxels are chosen. Voxels that are strongly coupled together indicate motion. In a motion-free Cartesian acquisition, each voxel would only be coupled to itself, so our goal is to essentially un-couple these voxels. As described in the paper Targeted Motion Estimation and Reduction (TAMER): Consistency Based Motion Mitigation for MRI using a Reduced Model Joint Optimization, as part of the IEEE Transactions on Medical Imaging Journal, the TAMER algorithm converges fastest when choosing target voxels that are highly coupled.[16] The target voxels can be entirely determined by the sequence parameters and coil sensitivities.[16]
Target Voxel Selection Process:
- Group coils based on artifact properties. The model error is first computed assuming no motion. The model correlation is then computed across all channels. TAMER is applied to groups of coils with the largest correlation artifacts to attain the motion and image estimation.[17]
- The initial target voxels are selected by first choosing a root voxel (generally the center of the image). Once the root voxel is chosen, the correlation between the root voxel and all other voxels is determined by attaining the column vector of the correlation matrix
corresponding to the root voxel. The magnitude of the entries in this column vector represent the strength of interaction between the root voxel and all of the other voxels.[16] The root voxel along with the voxels that have the strongest interaction with the root voxel are then chosen to be the initial target voxels.
Note: For each iteration of the TAMER process, the target voxels are selected by shifting the target voxels from the previous iteration perpendicularly to the phase encode direction by a preset amount.
Jumpstart of Motion Parameter Search:
Now the initial guess of the patient's motion is determined by evaluating the data consistency metric over a range of values for each of the
motion parameters and the best value for each parameter is selected to construct the initial guess.[17]
Joint Optimization Reduced Model Search:
We now have the initial target voxels, motion estimate, and coil groupings. The following procedure is now executed.
Let
be the motion trajectory estimate for the
search step. Let
be the max number of iterations.
While
repeat the following:
- Solve for

- Solve for

- Set

- Set

- Set

Advantages:
- TAMER retrospectively corrects for motion, so modifications to the MRI exam procedure isn't necessary.
- TAMER doesn't alter the acquisition procedure, so it can be easily integrated into current clinical MRI scans.
- TAMER significantly reduces computation of the joint optimization model used to estimate motion parameters and image voxels.
Disadvantages:
- Current TAMER implementations have lengthy overall computation times.
- TAMER requires multi-channel data as the motion parameters need additional degrees of freedom which is provided by the multichannel acquisition.[16]
- The TAMER algorithm assumes static coil profiles that don't change with the motion of the patient. This assumption would be an issue for larger motion.
In recent years, neural networks have generated a great deal of interest by outperforming traditional methods[18] on longstanding problems across many fields. Machine learning, and by extension neural networks, have been used in many facets of MRI[19] — for instance, speeding up image reconstruction, or improving reconstruction quality when working with a lack of data.[20][21] Neural networks have also been used in motion artifact correction thanks to their ability to learn visual information from data,[18] as well as infer underlying, latent representations in data.[22]
Network Accelerated Motion Estimation and Reduction (NAMER)[23] is a retrospective motion correction technique that utilizes convolutional neural networks (CNNs), a class of neural networks designed to process and learn from visual information such as images. This is a follow-up from the authors of the TAMER paper titled Network Accelerated Motion Estimation and Reduction (NAMER): Convolutional neural network guided retrospective motion correction using a separable motion model.[23] Similar to TAMER, the paper aims to correct for motion-related artifacts by way of estimating a desired motion-free image and optimizing parameters for a SENSE forward model describing the relationship between raw k-space data and image space while factoring in rigid motion.
A SENSE forward model is used to induce synthetic motion artifacts in raw k-space data, allowing us access to both data with motion artifacts, as well as the ground-truth image without motion artifacts. This is important to the NAMER technique, because it utilizes a Convolutional Neural Network (CNN) to frontload image estimation and guide model parameter estimation. Convolutional Neural Networks leverage convolution kernels to analyze visual imagery. Here, a 27-layer network is used with multiple convolution layers, batch normalization, and ReLU activations. It uses a standard ADAM optimizer.[24]
The CNN attempts to learn the image artifacts from the motion-corrupted input data
. The estimate for these artifacts, denoted as
, are then subtracted from the motion-corrupted input data
in order to produce a best estimate for the motion-free image:[23]
This serves two purposes: First, it allows the CNN to perform backpropagation and update its model weights by using a mean square error loss function comparing the difference between
and the known ground-truth motion-free image.[23] Second, it gives us a good estimate of the motion-free image that gives us a starting point for model parameter optimization.
Using a CNN effectively allows us to bypass the second stage of TAMER by skipping the joint parameter search. This means that we can focus on solely estimating motion parameters
. Because
is really a vector of multiple, independent parameters, we can parallelize our optimization by estimating each parameter separately.[23]
Before, we used the following to optimize both the image
and parameters
at once. Now, we can optimize solely the
values:

On top of this, if a multi-shot acquisition was performed, we can estimate the parameters
for each of
shots separately, and go even further by estimating the parameters
for each line
in each shot
:[23]

This allows us to massively reduce computation time, from around 50 minutes with TAMER[16] to just 7 minutes with NAMER.[23]
The new model parameters are then used in a standard Least Squares optimization problem to reconstruct an image that minimizes the distance between the k-space data, and the result of applying the SENSE forward model
under our new parameter estimate
to our best estimate for the motion-free image:[23]
This process is repeated until a desired number of time steps, or when the change in reconstructed image is sufficiently low. The NAMER technique has shown itself to be very effective in correcting for rigid motion artifacts, and converges much faster than other methods including TAMER.[23] This illustrates the power of deep learning in improving results across myriad fields.
Other more advanced techniques take advantage of generative adversarial networks (GANs) which aim to learn the underlying latent representation of data in order to synthesize new examples that are indistinguishable from real data. Here, two neural networks, a Generator Network and a Discriminator Network, are modelled as agents competing in a game. The Generator Network's goal is to produce synthetic images that are as close as possible to images from the true distribution, while the Discriminator Network's goal is to distinguish generated synthetic images from the true data distribution. Specific to motion artifact correction in MRI, the Generator Network takes in an image with motion artifacts, and outputs an image without motion artifacts. The Discriminator Network then differentiates between the synthesized image and ground truth data. Various studies[25][26] have shown that GANs perform very well in correcting for motion artifacts.
B1 inhomogeneity due to constructive or destructive interference from the permittivity of body tissue can be mitigated using external objects with high dielectric constants and low conductivity.[27] These objects, called radiofrequency/dielectric cushion, can be placed over or near the imaging slice to improve B1 homogeneity. The combination of high dielectric constant and having low conductivity allows the cushion to alter the phase of the RF standing waves and has been shown to reduce signal loss due to B1 inhomogeneity. This correction method was shown to have the greatest effect on sequences that suffer from B1 inhomogeneity artifacts but has no effect on those with B0 inhomogeneity. In one study, the dielectric cushion improved image quality for turbo spin echo‐based T2‐weighted sequences but not on gradient echo‐based T2‐weighted sequences.[27]
B1 inhomogeneity has been successfully mitigated by adjusting coil type and configurations.
One method is as simple as using the same transmit and receive coil to improve homogeneity.[28] This method exploits the tradeoff between B1 dependence and coil sensitivity dependence in FLASH sequences and allows the user to select an optimized flip angle that will reduce B1 dependence. By using the same coil for transmitting and receiving, the receiver coil sensitivity can offset some of the nonuniformities in the transmitter coil, reducing the overall RF inhomogeneity. For anatomical studies using the FLASH sequence that can be performed with one transmit and receive coil, this method can be used to reduce B1 inhomogeneity artifacts. However, the method would not be suitable for exams under strict time constraints, since the user first needs to perform flip angle optimization.
Modifying the field distribution within the RF coils will create a more homogenous field. This can be done by changing the way that the RF coil is driven and excited. One method uses a four-port RF excitation that applies different phase shifts at each port.[29]
By implementing a four-port drive, the power requirement is decreased by 2, SNR is increased by √2, and the overall B1 homogeneity is improved.
Changing the shape of the coils can be used to reduce B1 inhomogeneity artifacts. The use of spiral coil instead of standard coils at higher fields has been shown to eliminate the effects of standing waves in larger samples.[30] This method can be effective when imaging large samples at 4T or higher; however, the proper equipment is required to implement this correction method. Unlike post-processing or sequence modulations, changing the coil shape is not feasible in all scanners.
Another method to correct for B1 inhomogeneity is to employ the infrastructure in place from a parallel system to generate multiple RF pulses of lower flip angles that, together, can result in the same flip angle as that created using a single transmit coil.[31] This method uses the multiple transmit coils from parallel imaging systems to reduce and better mitigate the RF power deposition by relying on shorter RF pulses. One advantage of using parallel excitation with coils is the potential to reduce scan time by combining the multiple short RF pulses and the parallel imaging capabilities to cut scan time. Overall, when this method is used with the correct selection of RF pulses and optimized for a low power deposition, the artifacts from B1 inhomogeneity can be greatly reduced.
Actively modulating the RF transmit power for each slice position compensates for B1 inhomogeneity.[32] This method focuses on inhomogeneity along the axial, or z axis, direction since it is the most dominant in terms of poor homogeneity and least sample dependent.
Prior to inhomogeneity correction, measurement of the B1 profile along the z-axis of the coil is necessary for calibration. Once calibrated, the B1 data can be used for active transmit power modulation. For a specific pulse sequence, the values of each slice position are pre-determined and the appropriate RF transmitter power scale values are read from a look-up table. Then, while the sequence runs, a real time slice counter varies the attenuation of the RF transmit power.[32]
This method is advantageous for reducing artifacts at the source, particularly when accurate flip angle is critical and for increasing signal to noise ratio. Even though this technique can only be used to compensate for the B1 variation along the z-axis in axially acquired images, it's still significant since B1 inhomogeneity is most dominant along this axis.
One way to achieve perfect spin inversion despite B1 inhomogeneity is to use adiabatic pulses. This correction method works by removing the source of the problem and applying pulses that will not generate flip angle errors. Specific sequences that employ adiabatic pulses for increased flip angle uniformity include a slice selective spin-echo pulse, adiabatic 180 degrees inversion RF pulses, and 180 degrees refocusing pulses.
[33]
[34]
[35]
Post-processing techniques correct for intensity inhomogeneity (IIH) of the same tissue over an image domain. This method applies a filter to the data, typically based on a pre-acquired IIH map of the B1 field. If a map of the IIH in the image domain is known, then the IIH can be corrected by division into the pre-acquired image.[36] This popular model in describing the IIH effect is:
[36]
Where
is the measured intensity,
is the true intensity,
is the IIH effect and ξ is the noise.
This method is advantageous because it can be conducted offline, i.e., the patient is not required to be in the scanner. Therefore, correction time is not an issue. However, this technique does not improve SNR and contrast of the image because it only utilizes information that was already acquired. Since the B1 field was not homogeneous when the images were acquired, the flip angles and subsequent acquired signals are imprecise.
The effects of an AI-based image post-scan processing denoising system in brain scans have been demonstrated to be effective in higher image quality and morphometric analysis. Post-scan image processing systems enable noise reduction while retaining contrast. The subsequent image enhancement can be processed with shorter scan times for higher throughput and plausible earlier detection.[37][38]
To correct RF inhomogeneity artifacts using post-processing corrections, there are a few methods to map the B1 field. Here is a short description of some common techniques.
A common and robust method that uses the results from two images acquired at flip angles of
and
.[39] The B1 map is then constructed using a ratio of the signal intensities of these two images. This method, although robust and accurate, requires a long TR and long scan time; therefore, the method is not optimal for imaging regions susceptible to motion.
Similar to the double angle method, the phase map method uses two images; however, this method relies on the accrual of phase to determine the real flip angle of each spin.[40] After applying a 180 degree rotation about the x-axis followed by a 90 degree rotation about the y axis, the resulting phase is then used to map the B1 field. By obtaining two images and subtracting one from the other, any phase from B0 inhomogeneity can be removed and only phase accumulated by the inhomogeneous RF field will be mapped. This method can be used to map 3D volumes but requires a long scan time, making it unsuitable for some scanning requirements.
This method is a multislice B1 mapping technique. DREAM can be used to acquire a 2D B1 map in 130 ms, making it insensitive to motion and feasible for scans that require breath holds, such as cardiac imaging.[41] The short acquisition also reduces effects of chemical shifts and susceptibility. Additionally, this method requires low SAR rates. Although not as accurate as the double angle method, DREAM achieves reliable B1 mapping during short acquisitions. T