matlabefd asked . 2022-04-09

Extract data from text file

I have this 'sample data' data not in the right form. I need to read this text file and extract the data and tabulate it in the order as shown in figure below. I am not sure how can I do it. 

=========== REPORT TEXT ==========







RADIOLOGICAL REPORT
Patient AAA 
Date of Birth 17/10/2020
Request Number 2020100000000 
RN Number 123456789123 / 123456789 
Referring Doctor 
Examination Date 09/10/2020 





MRI - Heart / Cardiac ( Contrast )


Indication
blablabla

Medication/Contrast
blablabla

Complications
Nil

Sequences
GRE 2C, 4C, SA - CINE
GRE LVOT
GRE 2C, 4C, SA - 
MDE

Findings
The LV longitudinal and circumferential function is reduced.
The right ventricular contractility is normal. The RVOT is normal.
No mitral regurgitation.
Aortic valve appears normal with 3 cusps seen, mobile and not calcified. No tricuspid regurgitation.
No thrombus seen. Mild aortic regurgitation.
Normal pericardium. No pericardial effusion.
The visualized liver and thoracic spine are normal.





Left Ventricular Segments No Wall Motion Perfusion At Rest Perfusion At Stress Delayed Gadolinium 
Enhancement 
Basal Anterior 1 Hypokinetic Nil Nil 50% 
Basal Anteroseptal 2 Dyskinetic Present Present Full thickness 
Basal Inferoseptal 3 Hypokinetic Present Present 50% 
Basal Inferior 4 Hypokinetic Nil Present 50% 
Basal Inferolateral 5 Normal Nil Nil Nil 
Basal Anterolateral 6 Normal Nil Nil Nil 
Mid Anterior 7 Hypokinetic Nil Nil <50% 
Mid Anteroseptal 8 Dyskinetic Present Present Full thickness 
Mid Inferoseptal 9 Akinetic Present Present Full thickness 
Mid Inferior 10 Hypokinetic Nil Present <50% 
Mid Inferolateral 11 Normal Nil Nil Nil 
Mid Anterolateral 12 Normal Nil Nil <50% 
Apical Anterior 13 Akinetic Nil Nil 50% 
Apical Septal 14 Akinetic Nil Nil < 50% 
Apical Inferior 15 Akinetic Nil Nil > 50% 
Apical Lateral 16 Hypokinetic Nil Nil Full thickness 
Apex 17 Akinetic Nil Nil Full thickness 

LVESV is 138 ml. 
LVEDV is 172 ml. 
LVEF is 20 %.
The anterior septal wall thickness in is 0.8 cm. (Diastole)
The posterior lateral wall thickness is 0.9 cm. (Diastole)


Impression
1. Poor LV systolic function.
2. Infarcted and nonviable LAD territory. 
3. Some scarring in the RCA territory. However, inferior wall myocardium has to be considered viable.
4. Stress induced ischaemia of RCA territory.
5. Mild AR.


Computer generated report. No signature required.




========== REPORT TEXT END =========

 

Really appreciate it if someone can help to guide me. Thank you.

text file , extract data , data extraction , AI, Data Science, and Statistics , Text Analytics Too

Expert Answer

Prashant Kumar answered . 2024-05-17 21:36:12

That is a very badly formatted file. For example, the field delimiters are space characters and space characters also occur within the fields (without any text delimiters to group the fields together). There is no robust general solution for parsing such a poorly formatted file, altough in some limited cases (such as with prior knowledge of the field contents) you might be able to parse it but parsing such files will always be fragile. On that basis I assumed that the fields contain only the text in the number and types that you have shown, i.e. each line contains exactly:
  1. 1 or 2 words (starts with 'Basal' or 'Mid' or 'Apical', or constitutes 'Apex')
  2. 1 number
  3. 1 word
  4. ('Nil' or 'Present')
  5. ('Nil' or 'Present')
  6. ('Nil' or 'Full thickness' or a percentage)
This matches all of the seventeen rows in your example data file:
 
=========== REPORT TEXT ==========







RADIOLOGICAL REPORT
Patient AAA 
Date of Birth 17/10/2020
Request Number 2020100000000 
RN Number 123456789123 / 123456789 
Referring Doctor 
Examination Date 09/10/2020 





MRI - Heart / Cardiac ( Contrast )


Indication
blablabla

Medication/Contrast
blablabla

Complications
Nil

Sequences
GRE 2C, 4C, SA - CINE
GRE LVOT
GRE 2C, 4C, SA - 
MDE

Findings
The LV longitudinal and circumferential function is reduced.
The right ventricular contractility is normal. The RVOT is normal.
No mitral regurgitation.
Aortic valve appears normal with 3 cusps seen, mobile and not calcified. No tricuspid regurgitation.
No thrombus seen. Mild aortic regurgitation.
Normal pericardium. No pericardial effusion.
The visualized liver and thoracic spine are normal.





Left Ventricular Segments No Wall Motion Perfusion At Rest Perfusion At Stress Delayed Gadolinium 
Enhancement 
Basal Anterior 1 Hypokinetic Nil Nil 50% 
Basal Anteroseptal 2 Dyskinetic Present Present Full thickness 
Basal Inferoseptal 3 Hypokinetic Present Present 50% 
Basal Inferior 4 Hypokinetic Nil Present 50% 
Basal Inferolateral 5 Normal Nil Nil Nil 
Basal Anterolateral 6 Normal Nil Nil Nil 
Mid Anterior 7 Hypokinetic Nil Nil <50% 
Mid Anteroseptal 8 Dyskinetic Present Present Full thickness 
Mid Inferoseptal 9 Akinetic Present Present Full thickness 
Mid Inferior 10 Hypokinetic Nil Present <50% 
Mid Inferolateral 11 Normal Nil Nil Nil 
Mid Anterolateral 12 Normal Nil Nil <50% 
Apical Anterior 13 Akinetic Nil Nil 50% 
Apical Septal 14 Akinetic Nil Nil < 50% 
Apical Inferior 15 Akinetic Nil Nil > 50% 
Apical Lateral 16 Hypokinetic Nil Nil Full thickness 
Apex 17 Akinetic Nil Nil Full thickness 

LVESV is 138 ml. 
LVEDV is 172 ml. 
LVEF is 20 %.
The anterior septal wall thickness in is 0.8 cm. (Diastole)
The posterior lateral wall thickness is 0.9 cm. (Diastole)


Impression
1. Poor LV systolic function.
2. Infarcted and nonviable LAD territory. 
3. Some scarring in the RCA territory. However, inferior wall myocardium has to be considered viable.
4. Stress induced ischaemia of RCA territory.
5. Mild AR.


Computer generated report. No signature required.




========== REPORT TEXT END =========
 
str = fileread('sample data.txt');
rgx = ['(Apex|(Basal|Mid|Apical)\s+[A-Z][a-z]+)\s+(\d+)\s+([A-Z][a-z]+)',...
	'\s+(Nil|Present)\s+(Nil|Present)\s+(Nil|Full thickness|([<>]\s?)?\d+\%)'];
tkn = regexpi(str,rgx,'tokens');
tkn = vertcat(tkn{:})

Giving:

tkn = 
    'Basal Anterior'         '1'     'Hypokinetic'    'Nil'        'Nil'        '50%'           
    'Basal Anteroseptal'     '2'     'Dyskinetic'     'Present'    'Present'    'Full thickness'
    'Basal Inferoseptal'     '3'     'Hypokinetic'    'Present'    'Present'    '50%'           
    'Basal Inferior'         '4'     'Hypokinetic'    'Nil'        'Present'    '50%'           
    'Basal Inferolateral'    '5'     'Normal'         'Nil'        'Nil'        'Nil'           
    'Basal Anterolateral'    '6'     'Normal'         'Nil'        'Nil'        'Nil'           
    'Mid Anterior'           '7'     'Hypokinetic'    'Nil'        'Nil'        '<50%'          
    'Mid Anteroseptal'       '8'     'Dyskinetic'     'Present'    'Present'    'Full thickness'
    'Mid Inferoseptal'       '9'     'Akinetic'       'Present'    'Present'    'Full thickness'
    'Mid Inferior'           '10'    'Hypokinetic'    'Nil'        'Present'    '<50%'          
    'Mid Inferolateral'      '11'    'Normal'         'Nil'        'Nil'        'Nil'           
    'Mid Anterolateral'      '12'    'Normal'         'Nil'        'Nil'        '<50%'          
    'Apical Anterior'        '13'    'Akinetic'       'Nil'        'Nil'        '50%'           
    'Apical Septal'          '14'    'Akinetic'       'Nil'        'Nil'        '< 50%'         
    'Apical Inferior'        '15'    'Akinetic'       'Nil'        'Nil'        '> 50%'         
    'Apical Lateral'         '16'    'Hypokinetic'    'Nil'        'Nil'        'Full thickness'
    'Apex'                   '17'    'Akinetic'       'Nil'        'Nil'        'Full thickness'
>> size(tkn)
ans =
    17     6
>>     

Clearly you can put that into a table if you really want to:

>> hdr = {'LeftVentricularSegments','No','WallMotion','PerfusionAtRest','PerfusionAtStress','DelayedGadoliniumEnhancement'};
>> T = cell2table(tkn,'VariableNames',hdr)
T = 
    LeftVentricularSegments     No      WallMotion      PerfusionAtRest    PerfusionAtStress    DelayedGadoliniumEnhancement
    _______________________    ____    _____________    _______________    _________________    ____________________________
    'Basal Anterior'           '1'     'Hypokinetic'    'Nil'              'Nil'                '50%'                       
    'Basal Anteroseptal'       '2'     'Dyskinetic'     'Present'          'Present'            'Full thickness'            
    'Basal Inferoseptal'       '3'     'Hypokinetic'    'Present'          'Present'            '50%'                       
    'Basal Inferior'           '4'     'Hypokinetic'    'Nil'              'Present'            '50%'                       
    'Basal Inferolateral'      '5'     'Normal'         'Nil'              'Nil'                'Nil'                       
    'Basal Anterolateral'      '6'     'Normal'         'Nil'              'Nil'                'Nil'                       
    'Mid Anterior'             '7'     'Hypokinetic'    'Nil'              'Nil'                '<50%'                      
    'Mid Anteroseptal'         '8'     'Dyskinetic'     'Present'          'Present'            'Full thickness'            
    'Mid Inferoseptal'         '9'     'Akinetic'       'Present'          'Present'            'Full thickness'            
    'Mid Inferior'             '10'    'Hypokinetic'    'Nil'              'Present'            '<50%'                      
    'Mid Inferolateral'        '11'    'Normal'         'Nil'              'Nil'                'Nil'                       
    'Mid Anterolateral'        '12'    'Normal'         'Nil'              'Nil'                '<50%'                      
    'Apical Anterior'          '13'    'Akinetic'       'Nil'              'Nil'                '50%'                       
    'Apical Septal'            '14'    'Akinetic'       'Nil'              'Nil'                '< 50%'                     
    'Apical Inferior'          '15'    'Akinetic'       'Nil'              'Nil'                '> 50%'                     
    'Apical Lateral'           '16'    'Hypokinetic'    'Nil'              'Nil'                'Full thickness'            
    'Apex'                     '17'    'Akinetic'       'Nil'              'Nil'                'Full thickness' 

 


Not satisfied with the answer ?? ASK NOW

Frequently Asked Questions

MATLAB offers tools for real-time AI applications, including Simulink for modeling and simulation. It can be used for developing algorithms and control systems for autonomous vehicles, robots, and other real-time AI systems.

MATLAB Online™ provides access to MATLAB® from your web browser. With MATLAB Online, your files are stored on MATLAB Drive™ and are available wherever you go. MATLAB Drive Connector synchronizes your files between your computers and MATLAB Online, providing offline access and eliminating the need to manually upload or download files. You can also run your files from the convenience of your smartphone or tablet by connecting to MathWorks® Cloud through the MATLAB Mobile™ app.

Yes, MATLAB provides tools and frameworks for deep learning, including the Deep Learning Toolbox. You can use MATLAB for tasks like building and training neural networks, image classification, and natural language processing.

MATLAB and Python are both popular choices for AI development. MATLAB is known for its ease of use in mathematical computations and its extensive toolbox for AI and machine learning. Python, on the other hand, has a vast ecosystem of libraries like TensorFlow and PyTorch. The choice depends on your preferences and project requirements.

You can find support, discussion forums, and a community of MATLAB users on the MATLAB website, Matlansolutions forums, and other AI-related online communities. Remember that MATLAB's capabilities in AI and machine learning continue to evolve, so staying updated with the latest features and resources is essential for effective AI development using MATLAB.

Without any hesitation the answer to this question is NO. The service we offer is 100% legal, legitimate and won't make you a cheater. Read and discover exactly what an essay writing service is and how when used correctly, is a valuable teaching aid and no more akin to cheating than a tutor's 'model essay' or the many published essay guides available from your local book shop. You should use the work as a reference and should not hand over the exact copy of it.

Matlabsolutions.com provides guaranteed satisfaction with a commitment to complete the work within time. Combined with our meticulous work ethics and extensive domain experience, We are the ideal partner for all your homework/assignment needs. We pledge to provide 24*7 support to dissolve all your academic doubts. We are composed of 300+ esteemed Matlab and other experts who have been empanelled after extensive research and quality check.

Matlabsolutions.com provides undivided attention to each Matlab assignment order with a methodical approach to solution. Our network span is not restricted to US, UK and Australia rather extends to countries like Singapore, Canada and UAE. Our Matlab assignment help services include Image Processing Assignments, Electrical Engineering Assignments, Matlab homework help, Matlab Research Paper help, Matlab Simulink help. Get your work done at the best price in industry.