web2vec package
Subpackages
- web2vec.crawlers package
- web2vec.extractors package
- Subpackages
- Submodules
- web2vec.extractors.dns_features module
- web2vec.extractors.html_body_features module
HtmlBodyFeaturesbody_length()body_to_special_char_ratio()check_obfuscated_scripts()check_suspicious_keywords()detect_api_endpoints()detect_likely_js_spa()find_copyright()find_favicon()find_logo()get_html_body_features()hidden_elements()iframe_redirection()is_external_url()mouse_over_effect()num_email_forms()num_external_iframes()num_external_scripts()num_external_styles()num_forms()num_forms_external_action()num_forms_get()num_forms_post()num_iframes_http()num_images()num_internal_links()num_links()num_media_external()num_media_http()num_meta_tags()num_safe_anchors()num_scripts_http()num_styles_http()num_titles()right_click_disabled()script_length()script_to_body_ratio()script_to_special_chars_ratio()special_characters()
- web2vec.extractors.http_response_features module
HttpResponseFeaturesbody_length()body_to_special_char_ratio()check_forms()check_header_content_security_policy()check_header_strict_transport_security()check_header_x_content_type_options()check_header_x_frame_options()check_header_x_xss_protection()check_https()check_obfuscated_scripts()check_redirects()check_server_version()check_suspicious_keywords()count_redirects()get_http_response_features()is_live()num_images()num_links()num_titles()script_length()script_to_body_ratio()script_to_special_chars_ratio()special_characters()
- web2vec.extractors.network_features module
- web2vec.extractors.ssl_certification_features module
- web2vec.extractors.url_geo_features module
- web2vec.extractors.url_lexical_features module
- web2vec.extractors.whois_features module
- Module contents