Skip to content

Latest commit

 

History

History
332 lines (222 loc) · 19.2 KB

File metadata and controls

332 lines (222 loc) · 19.2 KB

GCP

What is Google Cloud Platform (GCP)?

Google Cloud Platform is a suite of cloud computing services provided by Google, offering a wide range of infrastructure and platform services for building, deploying, and managing applications and data.

Table of Contents

What are the key components of GCP?

GCP comprises various services such as compute, storage, networking, databases, big data, machine learning, and management tools.

Table of Contents

What are the advantages of using GCP for data engineering?

GCP provides scalability, flexibility, high-performance computing, data storage options, managed services, security, and integration with other Google services.

Table of Contents

How does GCP store data?

GCP offers various storage options, including Cloud Storage for object storage, Cloud SQL for managed relational databases, Bigtable for NoSQL wide-column store, and Cloud Firestore for document-based NoSQL data.

Table of Contents

What is Google BigQuery?

BigQuery is a fully-managed, serverless data warehouse provided by GCP. It enables you to analyze massive datasets using SQL queries with high speed and scalability.

Table of Contents

How can you move data into GCP for analysis?

You can use services like Cloud Storage, Cloud Pub/Sub, Data Transfer Service, or third-party tools to import data into GCP for analysis.

Table of Contents

Explain the concept of dataflow in GCP.

Dataflow is a serverless data processing service that allows you to build, deploy, and execute data processing pipelines in GCP. It supports both batch and stream processing.

Table of Contents

What is Cloud Dataproc?

Cloud Dataproc is a fully-managed service in GCP for running Apache Spark and Apache Hadoop clusters. It provides a scalable and cost-effective way to process big data workloads.

Table of Contents

How does GCP ensure data security?

GCP employs various security measures, including encryption at rest and in transit, identity and access management, network security, and DDoS protection.

Table of Contents

What is Cloud Pub/Sub?

Cloud Pub/Sub is a messaging service in GCP that enables asynchronous communication between independent applications. It allows you to build scalable event-driven architectures.

Table of Contents

Explain the role of Cloud Storage in GCP.

Cloud Storage is an object storage service provided by GCP. It offers scalable, durable, and highly available storage for objects of any size. It can be used for storing files, backups, and serving static content.

Table of Contents

What is Cloud Composer in GCP?

Cloud Composer is a managed workflow orchestration service based on Apache Airflow. It allows you to create, schedule, and monitor workflows composed of tasks, such as data processing or ETL pipelines.

Table of Contents

How can you monitor and analyze GCP resources and services?

GCP provides various monitoring and logging tools, such as Cloud Monitoring, Cloud Logging, and Stackdriver, which allow you to collect, analyze, and visualize metrics, logs, and traces.

Table of Contents

What is Cloud Spanner in GCP?

Cloud Spanner is a horizontally scalable, globally distributed relational database service provided by GCP. It combines the benefits of relational databases with the scalability of NoSQL.

Table of Contents

How can you ensure high availability and fault tolerance in GCP?

GCP offers regional and multi-regional options for deploying services across multiple zones and regions. Load balancing, auto-scaling, and managed instance groups help ensure high availability and fault tolerance.

Table of Contents

Explain the use of Cloud Dataflow in GCP.

Cloud Dataflow is used for building data pipelines that transform and process data in parallel. It supports both batch and stream processing and can be integrated with other GCP services like BigQuery.

Table of Contents

What is Cloud SQL and how is it used in GCP?

Cloud SQL is a managed database service in GCP that provides MySQL, PostgreSQL, and SQL Server databases. It simplifies database management tasks, offers high availability, and scales seamlessly.

Table of Contents

Explain the use of Cloud Datastore in GCP.

Cloud Datastore is a NoSQL document database provided by GCP. It is schemaless, automatically scales, and is suitable for applications that require low-latency reads and writes.

Table of Contents

How can you securely transfer data to and from GCP?

GCP supports secure data transfer over encrypted connections using protocols like HTTPS and SSL/TLS. It also provides Cloud VPN and Dedicated Interconnect for private network connections.

Table of Contents

What is the purpose of Cloud Machine Learning Engine in GCP?

Cloud Machine Learning Engine is a managed service in GCP that allows you to build, train, and deploy machine learning models at scale. It supports popular frameworks like TensorFlow and scikit-learn.

Table of Contents

How does GCP handle disaster recovery?

GCP provides disaster recovery solutions through features like regional and multi-regional deployments, automated backups, and replication across zones and regions.

Table of Contents

Explain the concept of VPC (Virtual Private Cloud) in GCP.

VPC allows you to create a virtual private network within GCP, providing isolation and control over network resources. It enables you to define IP ranges, subnets, firewall rules, and routing tables.

Table of Contents

What is the purpose of Cloud Identity and Access Management (IAM) in GCP?

IAM allows you to manage access control and permissions for GCP resources. It helps you define who has access to which resources and what actions they can perform.

Table of Contents

How does GCP handle data redundancy and backup?

GCP offers redundancy and backup options such as regional and multi-regional storage, snapshot-based backups, and data replication across multiple zones and regions.

Table of Contents

Explain the use of Cloud Composer in GCP.

Cloud Composer is used for orchestrating workflows and managing dependencies between tasks. It allows you to create, schedule, and monitor complex data pipelines and ETL workflows.

Table of Contents

What is the purpose of Cloud Load Balancing in GCP?

Cloud Load Balancing distributes incoming traffic across multiple instances or backend services to ensure high availability, scalability, and fault tolerance.

Table of Contents

How does GCP handle data encryption?

GCP supports encryption at rest and in transit. It provides key management services like Cloud Key Management Service (KMS) for managing encryption keys and Cloud HSM for hardware security modules.

Table of Contents

What is Memorystore in GCP?

Memorystore is a fully managed in-memory data store service provided by GCP. It supports Redis, offering high-performance caching for applications.

Table of Contents

Explain the use of Cloud CDN (Content Delivery Network) in GCP.

Cloud CDN is a global content delivery network that caches and delivers content from GCP to users with low latency and high bandwidth. It improves the performance of web applications and reduces serving costs.

Table of Contents

How can you monitor and analyze GCP costs?

GCP provides tools like Cloud Billing, Cost Management, and budgets to monitor, analyze, and optimize costs associated with your GCP resources and services.

Table of Contents

Explain the use of Cloud AutoML in GCP.

Cloud AutoML is a suite of machine learning products in GCP that allows you to train custom machine learning models with minimal coding. It supports vision, natural language, and tabular data.

Table of Contents

What is the purpose of Cloud Deployment Manager in GCP?

Cloud Deployment Manager is an infrastructure deployment service in GCP. It allows you to define and manage resources as code, making it easier to create and maintain infrastructure configurations.

Table of Contents

How does GCP ensure compliance and data privacy?

GCP complies with various industry standards and regulations. It offers features like data location controls, access transparency, and compliance certifications to meet data privacy and compliance requirements.

Table of Contents

Explain the use of Cloud Machine Learning Engine in GCP.

Cloud Machine Learning Engine is a managed service that allows you to train, deploy, and serve machine learning models at scale. It integrates with other GCP services like BigQuery and Cloud Storage.

Table of Contents

What is the purpose of Cloud Monitoring in GCP?

Cloud Monitoring is a monitoring and observability service provided by GCP. It allows you to collect and analyze metrics, create dashboards, set up alerts, and visualize the health and performance of your resources.

How can you optimize data ingestion in GCP?

GCP provides services like Cloud Pub/Sub and Cloud Dataflow for efficient and scalable data ingestion. You can also optimize ingestion by using partitioning, batching, and compression techniques.

Table of Contents

What is Cloud Memorystore for Redis in GCP?

Cloud Memorystore for Redis is a fully-managed in-memory data store service provided by GCP. It offers high-performance, scalable Redis instances for caching and data storage.

Table of Contents

Explain the use of Cloud Datalab in GCP.

Cloud Datalab is an interactive data exploration and visualization tool in GCP. It provides a Jupyter notebook interface for analyzing and visualizing data using Python, SQL, and BigQuery.

Table of Contents

What is the purpose of Cloud DNS in GCP?

Cloud DNS is a scalable and reliable domain name system (DNS) service provided by GCP. It allows you to manage and resolve domain names to IP addresses.

Table of Contents

How can you ensure data integrity in GCP?

GCP provides various mechanisms for ensuring data integrity, including checksums, encryption, redundancy, and regular backups.

Table of Contents

Explain the use of Cloud Security Command Center in GCP.

Cloud Security Command Center is a security and risk management service provided by GCP. It helps you discover, monitor, and manage security vulnerabilities and threats across your GCP resources.

Table of Contents

What is the purpose of Cloud SQL Proxy in GCP?

Cloud SQL Proxy is a secure client-side proxy for connecting to Cloud SQL instances from external applications or local development environments without exposing the database to the internet.

Table of Contents

Explain the use of Cloud Run in GCP.

Cloud Run is a fully managed serverless execution environment in GCP. It allows you to run containers without worrying about infrastructure provisioning or scaling.

Table of Contents

What is the purpose of Cloud Security Scanner in GCP?

Cloud Security Scanner is a web application vulnerability scanning service provided by GCP. It helps you identify security vulnerabilities in your web applications by crawling your website.

Table of Contents

What is the purpose of Cloud Composer in GCP?

Cloud Composer is a fully managed workflow orchestration service based on Apache Airflow. It allows you to create, schedule, and monitor complex data pipelines and ETL workflows.

Table of Contents

How does GCP handle data archiving and long-term storage?

GCP offers services like Cloud Storage Coldline and Archive for long-term data storage, providing cost-effective options for archiving data with high durability and availability.

Table of Contents

How can you manage data access and permissions in GCP?

GCP provides Cloud Identity and Access Management (IAM) for managing access control and permissions to GCP resources. IAM allows you to define fine-grained access policies and grant access to specific users or groups.

Table of Contents

How can you optimize data processing in GCP?

GCP offers optimization techniques like data partitioning, distributed processing, caching, and using appropriate data storage and processing services to achieve efficient and scalable data processing.

Table of Contents

What is the purpose of Cloud NAT in GCP?

Cloud NAT is a service in GCP that allows your virtual machine instances to send outbound traffic to the internet without exposing their IP addresses. It provides network address translation capabilities.

Table of Contents

Explain the use of Cloud IoT Core in GCP.

Cloud IoT Core is a fully managed service in GCP for securely connecting, managing, and ingesting data from IoT devices at scale. It provides device management, data ingestion, and integration with other GCP services.

Table of Contents

How does GCP handle data replication and synchronization?

GCP offers data replication and synchronization capabilities through services like Cloud Storage, Cloud Datastore, Cloud Spanner, and database-specific replication features.

Table of Contents

Explain the use of Cloud VPN in GCP.

Cloud VPN is a service in GCP that provides a secure and encrypted connection between your on-premises network and GCP Virtual Private Cloud (VPC) network.

Table of Contents

What is the purpose of Cloud Functions in GCP?

Cloud Functions is a serverless compute service in GCP that allows you to run event-driven code in response to events from various GCP services or HTTP requests.

Table of Contents

How can you monitor and troubleshoot performance issues in GCP?

GCP provides monitoring tools like Cloud Monitoring and logging tools like Cloud Logging and Stackdriver, which allow you to monitor and troubleshoot performance issues by collecting and analyzing metrics, logs, and traces.

Table of Contents

Explain the use of Cloud Key Management Service (KMS) in GCP.

Cloud KMS is a managed service in GCP for generating, using, and managing encryption keys. It helps you encrypt data and control access to sensitive information.

Table of Contents