python中怎么获取js的输出值_使用Python中的BeautifulSoup在HTML源代码中获取JS var值(Get JS var value in HTML source using Be...

使用Python中的BeautifulSoup在HTML源代码中获取JS var值(Get JS var value in HTML source using BeautifulSoup in Python)我正在尝试使用BeautifulSoup从HTML源代码中获取JavaScript var值。例如我有：[other code]var my = 'hello';var name = 'hi'

weixin_39979948

1366人浏览 · 2020-11-26 00:15:11

weixin_39979948 · 2020-11-26 00:15:11 发布

使用Python中的BeautifulSoup在HTML源代码中获取JS var值(Get JS var value in HTML source using BeautifulSoup in Python)

我正在尝试使用BeautifulSoup从HTML源代码中获取JavaScript var值。

例如我有：

[other code]

var my = 'hello';

var name = 'hi';

var is = 'halo';

[other code]

我想要一些东西来返回Python中var“my”的值

我怎样才能做到这一点？

I'm trying to get a JavaScript var value from an HTML source code using BeautifulSoup.

For example I have:

[other code]

var my = 'hello';

var name = 'hi';

var is = 'halo';

[other code]

I want something to return the value of the var "my" in Python

How can I achieve that?

原文：https://stackoverflow.com/questions/41020606

2020-09-23 10:09

满意答案

最简单的方法是使用正则表达式模式通过BeautifulSoup定位元素并提取所需的子字符串：

import re

from bs4 import BeautifulSoup

data = """

[other code]

var my = 'hello';

var name = 'hi';

var is = 'halo';

[other code]

"""

soup = BeautifulSoup(data, "html.parser")

pattern = re.compile(r"var my = '(.*?)';$", re.MULTILINE | re.DOTALL)

script = soup.find("script", text=pattern)

print(pattern.search(script.text).group(1))

打印hello 。

The simplest approach is to use a regular expression pattern to both locate the element via BeautifulSoup and extract the desired substring:

import re

from bs4 import BeautifulSoup

data = """

[other code]

var my = 'hello';

var name = 'hi';

var is = 'halo';

[other code]

"""

soup = BeautifulSoup(data, "html.parser")

pattern = re.compile(r"var my = '(.*?)';$", re.MULTILINE | re.DOTALL)

script = soup.find("script", text=pattern)

print(pattern.search(script.text).group(1))

Prints hello.

2016-12-07

最新问答

如果启用了复制处理程序，请确保将其置于其中一个安全角色之后。我见过人们做的另一件事是在不同的端口上运行admin。最好在需要auth的页面上使用SSL，这样你就不会发送明确的密码，因此管理和复制将发生在8443上，而常规查询将在8080上发生。如果您要签署自己的证书，请查看此有用的SO页面：如何在特定连接上使用不同的证书？ I didn't know that /admin was the context for SOLR admin because /admin does not re

第一：在您的样本中，您有：但是你在询问 //td[@class=‘CarMiniProfile-TableHeader’] （注意TableHeader中的大写'T'）。 xpath区分大小写。第二：通过查询// td [@ class ='CarMiniProfile-TableHeader'] / td，你暗示你在外部td中有一个'td'元素，而它们是兄弟姐妹。有很多方法可以在这里获得制作和模型

这是你的答案： http://jsfiddle.net/gPsdk/40/ .preloader-container { position: absolute; top: 0px; right: 0px; bottom: 0px; left: 0px; background: #FFFFFF; z-index: 5; opacity: 1; -webkit-transition: all 500ms ease-out;

问题是，在启用Outlook库引用的情况下， olMailItem是一个保留常量，我认为当您将Dim olMailItem as Outlook.MailItem ，这不是问题，但是尝试设置变量会导致问题。以下是完整的解释：您已将olMailItem声明为对象变量。在赋值语句的右侧，在将其值设置为对象的实例之前，您将引用此Object 。这基本上是一个递归错误，因为你有对象试图自己分配自己。还有另一个潜在的错误，如果之前已经分配了olMailItem ，这个语句会引发另一个错误（可能是

我建议使用wireshark http://www.wireshark.org/通过记录（“捕获”）设备可以看到的网络流量副本来“监听”网络上发生的对话。当您开始捕获时，数据量似乎过大，但如果您能够发现任何看起来像您的SOAP消息的片段（应该很容易发现），那么您可以通过右键单击并选择来快速过滤到该对话'关注TCP Stream'。然后，您可以在弹出窗口中查看您编写的SOAP服务与Silverlight客户端之间的整个对话。如果一切正常，请关闭弹出窗口。作为一个额外的好处，wireshar

Android默认情况下不提供TextView的合理结果。您可以使用以下库并实现适当的aligntment。 https://github.com/navabi/JustifiedTextView Android Does not provide Justified aligntment of TextView By default. You can use following library and achieve proper aligntment. https://github.com/

你的代码适合我： class apples { public static void main(String args[]) { System.out.println("Hello World!"); } } 我将它下载到c：\ temp \ apples.java。以下是我编译和运行的方式： C:\temp>javac -cp . apples.java C:\temp>dir apples Volume in drive C is HP_PAV

12个十六进制数字（带前导0x）表示48位。那是256 TB的虚拟地址空间。在AMD64上阅读wiki（我假设你在上面，对吗？）架构http://en.wikipedia.org/wiki/X86-64 12 hex digits (with leading 0x) mean 48 bits. That is 256 TB of virtual address space. Read wiki on AMD64 (I assume that you are on it, right?) ar

这将取决于你想要的。对象有两种属性：类属性和实例属性。类属性类属性对于类的每个实例都是相同的对象。 class MyClass: class_attribute = [] 这里已经为类定义了MyClass.class_attribute ，您可以使用它。如果您创建MyClass实例，则每个实例都可以访问相同的class_attribute 。实例属性 instance属性仅在创建实例时可用，并且对于类的每个实例都是唯一的。您只能在实例上使用它们。在方法__init__中定

2048 AI社区

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

企业级公交线路查询系统管理系统源码｜SpringBoot+Vue+MyBatis架构+MySQL数据库【完整版】

2048 AI社区

具有非线性不确定性的多智能体系统的固定时间事件触发共识控制（Matlab代码实现）

本文研究了具有非线性不确定性的多智能体系统的固定时间事件触发共识控制问题。基于事件触发策略的固定时间共识协议被提出，这些协议可以显著降低能量消耗和控制器更新的频率。集中式和分布式共识控制策略均被考虑。证明了在所提出的事件触发共识控制策略下，可以避免Zeno行为。与有限时间共识相比，固定时间共识可以在固定的收敛时间内达成，而与智能体的任意初始状态无关。最后，通过两个例子展示了固定时间事件触发共识协议