Identify the type of data in the given table.
| Scenario | Type of Data |
|---|---|
| Recording of telephonic conversation | |
| Images of car and bus | |
| Clippings of a movie | |
| Newspaper content | |
| Clippings of TV news channel | |
| Radio news | |
| Content of an article |
Answer
| Scenario | Type of Data |
|---|---|
| Recording of telephonic conversation | Audio data |
| Images of car and bus | Image data |
| Clippings of a movie | Video data |
| Newspaper content | Text data |
| Clippings of TV news channel | Video data |
| Radio news | Audio data |
| Content of an article | Text data |
From the following sentences, identify the sentences that depict data and information. Write them under the proper heading.
- The temperature is 25 degrees Celsius.
- A patient's blood pressure is 120/80 mmHg.
- The patient's blood pressure is 120/80 mmHg, which is in the normal range.
- The stock price of the Company XYZ has increased.
- The city received a significant amount of rainfall, which may lead to improved water reserves and agricultural productivity.
- The customer rating for the new phone is good, indicating high satisfaction and desirability.
- Number of website visitors are 10,000.
Answer
| Data | Information |
|---|---|
| The temperature is 25 degrees Celsius. | The patient's blood pressure is 120/80 mmHg, which is in the normal range. |
| A patient's blood pressure is 120/80 mmHg. | The stock price of the Company XYZ has increased. |
| Number of website visitors are 10,000. | The city received a significant amount of rainfall, which may lead to improved water reserves and agricultural productivity. |
| The customer rating for the new phone is good, indicating high satisfaction and desirability. |
Write the names of any two online surveying tools.
Answer
The names of two online surveying tools are:
- Web forms (Google Form)
- Emails
Information is the processed form of data.
Answer
True
Reason — Information is the processed form of data.
Video data is not useful for developing AI applications.
Answer
False
Reason — Data available in the form of video is considered as video data, and video data is used in developing AI applications such as online lectures, videos captured through mobile phones, and YouTube videos.
Data from X (previously known as Twitter) can be collected with the help of an API.
Answer
True
Reason — X (previously known as Twitter) API helps in extracting tweets for data analysis purpose.
Kaggle is a platform which provides plenty of datasets for machine learning purpose.
Answer
True
Reason — Platforms like Kaggle offer a huge number of datasets for different topics and provide datasets for machine learning purpose.
Storage of video data take more space in memory.
Answer
True
Reason — Video data contains images and audio together, therefore storage of video data takes more space in memory.
Collection of some digits is an example of information.
Answer
False
Reason — Data refers to the raw facts that are generated by almost all activities of our daily routine. A collection of some digits is raw data and does not have meaning or context, therefore it is not information.
Data should be relevant for developing a successful AI system.
Answer
True
Reason — To design an accurate AI system, relevant data must be identified, collected, and explored.
Data is used to identify patterns.
Answer
True
Reason — To learn patterns and relationships, AI models, such as deep learning neural networks, require extensive training with the help of data.
AI is not useful for medical domain.
Answer
False
Reason — AI models can analyse medical imaging data, such as X-rays, CT scans, and MRIs, to help in disease detection and diagnosis in the healthcare industry.
Survey is a method to collect desired data.
Answer
True
Reason — Surveys can be done online or offline using web forms or emails. Users are asked a number of important questions, and their replies are recorded as text, audio, or video.
Source(s) of data can be ............... .
- Mobile phone
- Devices with sensors
- Data log
- All of these
Answer
All of these
Reason — We are living in a digital world where electronic gadgets like mobile phones, sensors, and machines are generating and collecting a huge amount of data.
Information can be defined as ............... .
- Raw facts
- Abrupt sentences
- Processed data
- Collection of dates only
Answer
Processed data
Reason — Information is the processed form of data.
The primary purpose of data in AI is ............... .
- To train AI models
- To generate revenue
- To develop algorithms
- None of these
Answer
To train AI models
Reason — The primary purpose of data in AI is to train AI models. AI algorithms use huge amount of data to learn and make predictions.
Phone number of a student is an example of ............... .
- Data
- Information
- Facts
- None of these
Answer
Information
Reason — A student’s phone number is meaningful data related to a specific person and can be used for communication. Therefore, it is considered information.
List of student names of a class is an example of ............... .
- Data
- Information
- Educational data
- None of these
Answer
Data
Reason — A list of student names of a class does not have meaning or context and therefore it is considered as data.
Data can be collected through ............... .
- Newspapers
- Audio files
- Images
- All of these
Answer
All of these
Reason — Data is represented in many forms such as text, image, audio, and video, and can be collected through newspapers, audio files, and images.
Which technique would be suitable to process the data from an article?
- Computer vision
- Natural Language Processing
- Speech recognition
- None of these
Answer
Natural Language Processing
Reason — Natural Language Processing would be suitable to process the data from an article. Large text data collections, such as books and articles, are utilised to train AI models to understand language and context.
Which platform(s) use(s) recommendation system?
- YouTube
- Netflix
- Amazon
- All of these
Answer
All of these
Reason — Platforms such as Netflix, Amazon, Spotify, YouTube, etc., use AI-based recommendation systems to recommend personalised content to users.
Which of the following is not used in data visualisation?
- Images
- Graphs
- Charts
- Text
Answer
Text
Reason — Data visualisation allows the graphical representation of data in the form of charts, graphs, or diagrams.
Data continuity means ............... .
- To get huge amount of data
- To get image data
- To get new and updated data
- None of these
Answer
To get new and updated data
Reason — Data continuity means to get new and updated data.
Fill in the blanks:
- ................ is used to collect data from a large number of people.
- ................ helps in getting useful information from websites and other online tools.
- ................ Machine Learning Repository provides many datasets.
- ................ and ................ are examples of descriptive statistics.
Answer
- Crowdsourcing is used to collect data from a large number of people.
- Web scraping helps in getting useful information from websites and other online tools.
- UCI Machine Learning Repository provides many datasets.
- Mean and median are examples of descriptive statistics.
How is data important in AI-based medical imaging systems?
Answer
Data is important in AI-based medical imaging systems because AI models can analyse medical imaging data, such as X-rays, CT scans, and MRIs, to help in disease detection and diagnosis in the healthcare industry. By training on labelled medical images, AI algorithms can learn to recognise patterns associated with various conditions and help radiologists and physicians in making more precise decisions.
How do some APIs help collect relevant data? Explain.
Answer
Some APIs help collect relevant data by allowing developers to get and use specific data from online platforms. APIs can be used to get data from social media sites, weather services, banking databases, and public data sources. For example, X (previously known as Twitter) API helps in extracting tweets for data analysis purpose.
How can data be collected through crowdsourcing? Explain.
Answer
Data can be collected through crowdsourcing by using crowdsourcing tools that let people from all over the world work together to acquire data. Tasks and instructions are given for collecting data, and people who are part of the crowd help in collecting data. This is a good method to collect a huge amount of data.
What do you understand by data? Explain its various types.
Answer
Data refers to the raw facts that are generated by almost all activities of our daily routine. Data is everything that we can get by reading, writing, speaking, and seeing. It can be made up of numbers, alphabets, symbols, or a mix of any or all of these elements. The collection of data or facts is considered as a dataset.
Various types of data are as follows:
- Text Data: Data available in the form of text is considered as text data. Examples of text data are research articles, any information available on websites in text form, information available in books, newspapers, etc.
- Image Data: Pictures are images that can be processed as image data. Examples of image data are the images related to a specific subject.
- Audio Data: This type of data contains only audio files. Examples of audio data are audio clips of songs, recordings of speeches, recordings of lectures, news, etc.
- Video Data: Data available in the form of video is considered as video data. Examples of video data are movies, online lectures, videos captured through mobile phones, YouTube videos, etc.
What are the various methods of collecting relevant data in AI project development?
Answer
The various methods of collecting relevant data in AI project development are as follows:
- Survey: Surveys can be done online or offline using web forms or emails. Users are asked a number of important questions, and their replies are recorded as text, audio, or video.
- Observations: Data can be collected through observations by keeping track of people, things, or events in their natural environment and recording the observations.
- Data Logging and Sensor Deployment: Sensors or data loggers are used to collect useful environmental data for AI projects that use the Internet of Things (IoT) or sensor-based data collection.
- Web Scraping: Web scraping helps in getting useful information from websites and other online tools by crawling web pages and extracting structured or unstructured data.
- Application Programming Interfaces (APIs): APIs allow developers to get and use specific data from online platforms such as social media sites, weather services, banking databases, and public data sources.
- Collaborations and Data Partnerships: Data can be collected by making agreements to share data or by building partnerships with organisations that possess proprietary or specialised datasets.
- Crowdsourcing: Crowdsourcing tools let people from all over the world work together to acquire data by performing assigned tasks and following given instructions.
- Existing Datasets: Data can be collected from publicly available datasets provided by platforms such as Kaggle, UCI Machine Learning Repository, government data portals, and research institutions.
What are the various applications of data in AI? Explain.
Answer
The various applications of data in AI are as follows:
- Natural Language Processing (NLP): NLP applications rely extensively on data. Large text data collections, such as books, articles, and social media postings, are utilised to train AI models to understand language and context. This enables the AI system to generate accurate translations, understand sentiments, and answer queries based on the data’s knowledge.
- Autonomous Vehicles: Data is essential for the development of autonomous vehicles. Sensors and cameras collect a huge amount of data about the surrounding environment, such as road conditions, other vehicles, and pedestrians. This data is used to train AI models to perceive and understand their surroundings, make decisions in real time, and drive safely.
- Recommendations: Data plays a vital role in designing AI-based recommendation systems. Platforms such as Netflix, Amazon, Spotify, and YouTube use AI-based recommendation systems to recommend personalised content to users based on viewing history, purchase behaviour, and preferences.
- Fraud Detection: AI models are used to analyse a huge amount of transactional data in fraud detection systems. By analysing data patterns and anomalies, these models help in identifying potentially fraudulent activities and generating alerts.
- Medical Imaging: AI models can analyse medical imaging data, such as X-rays, CT scans, and MRIs, to help in disease detection and diagnosis in the healthcare industry. By training on labelled medical images, AI algorithms learn to recognise patterns associated with various conditions.
Suppose you want to collect data on your classmate's career choices. Write down the steps of identifying and collecting relevant data for the same.
Answer
To collect data on classmate's career choices, the steps of identifying and collecting relevant data are as follows:
Identifying Relevant Data
- Defining Data Requirement: The first step is to define the data requirements by understanding the problem to be solved. In this case, data related to classmates’ career choices is required.
- Acquiring Domain Expertise: Discussion with teachers or career counsellors who have in-depth understanding of career options can help in identifying relevant data and deciding what information is important.
- Reviewing Literature: Reviewing articles, reports, or studies related to career choices can help in understanding what type of data should be collected.
Collecting Relevant Data
- Survey: A survey can be conducted by asking classmates questions through questionnaires, online forms, or interviews. Their responses can then be recorded and used as data.
Suppose you have a collection of X-ray images for a disease. Which AI technique would be suitable for identifying the disease through these images and why?
Answer
The most suitable AI technique for identifying a disease from X-ray images is Machine Learning used with image recognition.
This technique is suitable because:
- X-ray images are a type of unstructured data, which means they cannot be analysed using simple rules or formulas.
- Machine Learning allows a computer to learn patterns from examples. By showing the AI many X-ray images of healthy patients and diseased patients, it learns how they look different.
- The AI system compares shapes, shadows, and patterns in the X-ray images to find signs of disease.
- Once trained, the AI can identify whether a new X-ray image shows the disease or not.